What you actually specified is torch the @ is kind of pip remark, pip will not actually parse it 🙂
use only the link https://download.pytorch.org/whl/cu100/torch-1.3.1%2Bcu100-cp36-cp36m-linux_x86_64.whl
I specified a
torch @ https://download.pytorch.org/whl/cu100/torch-1.3.1%2Bcu100-cp36-cp36m-linux_x86_64.whl and it didn't detect the link, it tried to install latest version: 1.6.0
What do you have in the
(you can see it printed at the beginning of the log)
agent.cuda_version = 0 agent.cudnn_version = 0
This is why it thinks it's CPU 🙂
It is configured as CPU (i.e. no CUDA)
(since you are using venv mode, if the cuda is not detected at startup time, it will not install the GPU version, as it has no CUDA support)
python3 -m trains_agent --config-file "~/trains.conf" daemon --queue default --log-level DEBUG --detached --gpus 1 > ~/trains-agent.startup.log 2>&1
what do you see in the console when you start the trains-agent , it should detect the cuda version
This is also set in the command line.
--cpu-only or maybe without any --gpus flag at all
OK but nowhere I specified that, I just checked my trains.conf file
agent.cuda_version = 110 agent.cudnn_version = 0
I just started one and it wrote:
okay, now it should work 🙂
yes, I use --gpus parameter
cuda 10.1, I guess this is because no wheel exists for torch==1.3.1 and cuda 11.0
how can I enforce a specific wheel to be installed?
You mean like specific CUDA wheel ?
you can simple put the http link to the wheel in the "installed packages", it should work
I have 11.0 installed but on another machine with 11.0 installed as well, trains downloads torch for cuda 10.1, I guess this is because no wheel exists for torch==1.3.1 and cuda 11.0
AgitatedDove14 one last question: how can I enforce a specific wheel to be installed?
You can always force it with environment variable
I don't know why it didn't detect it in first place
That depends on what you have installed 🙂
btw shoulnd't it be
thanks, I will do that
ho, that might be it then, thanks!
You're welcome 🙂