@<1523701087100473344:profile|SuccessfulKoala55> Could you give some advice?
As far as I known, ClearML will not record the whole cmd python -m torch.distributed.run --nproc_per_node 2 train.py --batch 64 --data coco.yaml
. And there is a file path issue as following. The cloned and enqueued task on the WebApp didn't pass --data coco.yaml
to the train.py
and result in the train.py
can not get data args! @<1523701205467926528:profile|AgitatedDove14> could you help?
Hi @<1523701205467926528:profile|AgitatedDove14> . Yes, Agent will execute the cloned task and Task.init()
inside my code, but I don't know which cmd it use, python -m torch.distributed.run --nproc_per_node 2 train.py --batch 64
or python train.py --batch 64
? Another question is how the task's name and project's name are setting for the WebAPP gives the names and Task.init()
also gives the names.
Hi ThoughtfulBadger56
If I clone and enqueue the cloned task on the webapp, does the clearml server execute the whole cmd above?
You mean agent will execute it? Do you have Task.init inside your code ?