Reputation
Badges 1
17 × Eureka!sorry, what do you mean :manually edit it back to your code
CostlyOstrich36 I have a base training task that runs successfully. When I am trying to run a clone of the task on an agent through queue, the environment setup is fine but the training does not start and throws this error
could you maybe point me to an example of HPO that uses transformers? Can 't find anything online. maybe I can compare my version. Many thanks
AgitatedDove14
` import os
os.environ['LC_ALL'] = "C.UTF-8"
os.environ['LANG'] = "C.UTF-8"
from clearml import Task
CLEARML_PROJECT = 'Vodafone Sentiment full'
CLEARML_TASK = 'HPO_BASE_TASK'
os.environ["CLEARML_PROJECT"] = CLEARML_PROJECT
os.environ["CLEARML_TASK"] = CLEARML_TASK
os.environ['MPLBACKEND'] = "TkAg"
Task.set_credentials(
api_host=" ",
web_host=" ",
files_host=" ",
key='******************',
secret='*********************'
)
task = Task.init(proje...
The base task does have 'Task.init' and both have clearml installed
yes, that's the base task which ran successfully. It does have both
also hpo controller:
` import os
from clearml import Task
os.environ['MPLBACKEND'] = "TkAg"
CLEARML_PROJECT = "Vodafone Sentiment full"
CLEARML_TASK = "HPO optimizer Controller"
os.environ["CLEARML_PROJECT"] = CLEARML_PROJECT
os.environ["CLEARML_TASK"] = CLEARML_TASK
Task.set_credentials(
api_host=" ",
web_host=" ",
files_host=" ",
key='88888888888',
secret='888888888888888'
)
from clearml.automation import UniformParameterRange, UniformIntegerParameterRange, Discret...
AgitatedDove14 even the base task does not have any Arg named "input_train_data". The base task is self-contained i.e. it downloads training/eval directly data and has direct access to it.
Now that I have shared this with you..I finally saw that kubeflow is injecting this argparse stuff
AgitatedDove14 I followed the above format but it still does not work. i'm getting increasingly sure that this is related to huggingface's trainer API. Can you share an example for using huggingface's trainer API if possible? TIA
Thanks for your answer. By main script, you mean the base task or the agent?