CUDA is the driver itself. The agent doesn't install CUDA but installs a compatible torch assuming that CUDA is properly installed.
If I run nvidia-smi it returns valid output and it says the CUDA version is 11.2
Just to make sure, run the code on the machine itself to verify that python can actually detect the driver
Isn't the problem that CUDA 12 is being installed?
Solved that by setting docker_args=["--privileged", "--network=host"]
Hi @<1734020162731905024:profile|RattyBluewhale45> , what version of pytorch are you specifying?