Hi, I'm trying to use clearml on pytorch-lightning with multiple gpus, but it seems as if the server does not monitor the experiment. I can see no progress i...
2 years ago
Hi Natan,
agent command: clearml-agent daemon --gpu all
I'm using 8 gpus. the model runs on all of them, but the logging isn't working
And everything works fine with regular pytorch
we used to use pytorch and it worked just fine, but now we moved to pytorch-lightning (kind of extension on pytorch that gives keras-ish functionality)
we used the pytorch with multi-gpu (ddp)
and it doesnt work for 2 gpus either
I'm still getting the machine usage reports
with regular pytorch it worked when running on all 8 gpus