Reputation
Badges 1
17 × Eureka!CostlyOstrich36 I have a base training task that runs successfully. When I am trying to run a clone of the task on an agent through queue, the environment setup is fine but the training does not start and throws this error
Now that I have shared this with you..I finally saw that kubeflow is injecting this argparse stuff
Thanks for your answer. By main script, you mean the base task or the agent?
AgitatedDove14
` import os
os.environ['LC_ALL'] = "C.UTF-8"
os.environ['LANG'] = "C.UTF-8"
from clearml import Task
CLEARML_PROJECT = 'Vodafone Sentiment full'
CLEARML_TASK = 'HPO_BASE_TASK'
os.environ["CLEARML_PROJECT"] = CLEARML_PROJECT
os.environ["CLEARML_TASK"] = CLEARML_TASK
os.environ['MPLBACKEND'] = "TkAg"
Task.set_credentials(
api_host=" ",
web_host=" ",
files_host=" ",
key='******************',
secret='*********************'
)
task = Task.init(proje...
AgitatedDove14 even the base task does not have any Arg named "input_train_data". The base task is self-contained i.e. it downloads training/eval directly data and has direct access to it.
AgitatedDove14 I followed the above format but it still does not work. i'm getting increasingly sure that this is related to huggingface's trainer API. Can you share an example for using huggingface's trainer API if possible? TIA
yes, that's the base task which ran successfully. It does have both
could you maybe point me to an example of HPO that uses transformers? Can 't find anything online. maybe I can compare my version. Many thanks
also hpo controller:
` import os
from clearml import Task
os.environ['MPLBACKEND'] = "TkAg"
CLEARML_PROJECT = "Vodafone Sentiment full"
CLEARML_TASK = "HPO optimizer Controller"
os.environ["CLEARML_PROJECT"] = CLEARML_PROJECT
os.environ["CLEARML_TASK"] = CLEARML_TASK
Task.set_credentials(
api_host=" ",
web_host=" ",
files_host=" ",
key='88888888888',
secret='888888888888888'
)
from clearml.automation import UniformParameterRange, UniformIntegerParameterRange, Discret...
The base task does have 'Task.init' and both have clearml installed
sorry, what do you mean :manually edit it back to your code