Hi All! Is There Any Simple Way To Use

Unanswered

just to be clear, this works on my local machine:

distributed_args = torch.distributed.run.parse_args(sys.argv)
distributed_args.nproc_per_node = args.gpus
torch.distributed.run.run(distributed_args)

But not when clearml-agent runs it

So the args are patched on the "main" process, but only on the remote worker

  				
Posted 
	one year ago

					More  		
  Report
		
					PlainSeaurchin97
				
					0
					 × 1

210 Views

0 Answers

one year ago