Could you try to do:
CUDA_VERSION="11.1" clearml-agent ...
It asks the driver or find the cuda dll/so
By host you mean the machine on which the agent is running? How does clearml-agent find the cuda_version?
But I do not have anything linked correctly since I rely in conda installing cuda/cudnn for me
From the log it installed:cudatoolkit==11.1.1
based on the CUDA it found on the host machine: agent.cuda_version = 110
But for some reason it installed the pytorch from the conda "pytorch" repo without the cuda support.
But I do not have anything linked correctly since I rely in conda installing cuda/cudnn for me
fyi: NVIDIA-SMI 460.56 Driver Version: 460.56 CUDA Version: 11.2
Hi @<1523701868901961728:profile|ReassuredTiger98>
Could you send the full log ? Also what's the clearml-agent
version?