Hey Guys! I'M Having Some Issues With Pytorch And Clearml. I Am Starting A New Task Using Task.Create And Setting Pytorch As A Requirement Under `Packages`. For Some Reason Pytorch With Cuda 12 Is Being Installed, But I Need Cuda 11. Do You Know How To Se

Answered

Hey guys! I'm having some issues with pytorch and clearml. I am starting a new task using task.create and setting pytorch as a requirement under packages. For some reason pytorch with CUDA 12 is being installed, but I need CUDA 11. Do you know how to set it to install CUDA 11?

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

Votes Newest

Answers 41

I can install the correct torch version with this command:
pip install --pre torchvision --force-reinstall --index-url ` None ```

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

pip install --pre torchvision --force-reinstall --index-url None

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

docker="nvidia/cuda:11.8.0-base-ubuntu20.04"

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

It's hanging at


Installing collected packages: zipp, importlib-resources, rpds-py, pkgutil-resolve-name, attrs, referencing, jsonschema-specifications, jsonschema, certifi, urllib3, idna, charset-normalizer, requests, pyparsing, PyYAML, six, pathlib2, orderedmultidict, furl, pyjwt, psutil, python-dateutil, platformdirs, distlib, filelock, virtualenv, clearml-agent
Successfully installed PyYAML-6.0.2 attrs-23.2.0 certifi-2024.7.4 charset-normalizer-3.3.2 clearml-agent-1.8.1 distlib-0.3.8 filelock-3.15.4 furl-2.1.3 idna-3.7 importlib-resources-6.4.0 jsonschema-4.23.0 jsonschema-specifications-2023.12.1 orderedmultidict-1.0.1 pathlib2-2.3.7.post1 pkgutil-resolve-name-1.3.10 platformdirs-4.2.2 psutil-5.9.8 pyjwt-2.8.0 pyparsing-3.1.2 python-dateutil-2.8.2 referencing-0.35.1 requests-2.31.0 rpds-py-0.20.0 six-1.16.0 urllib3-1.26.19 virtualenv-20.26.3 zipp-3.20.0
WARNING: You are using pip version 20.1.1; however, version 24.2 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

CostlyOstrich36 do you have any ideas?

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

I am running the agent with clearml-agent daemon --queue training

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

or cu11x

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

unrelated to the agent itself

  				
Posted 
	7 months ago

					More  		
  Report
		
					CostlyOstrich36
				
					0

This one seems to be compatible: [nvcr.io/nvidia/pytorch:22.04-py3](http://nvcr.io/nvidia/pytorch:22.04-py3)

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

Thank you I will try that

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

ERROR: This container was built for NVIDIA Driver Release 530.30 or later, but
       version 460.32.03 was detected and compatibility mode is UNAVAILABLE.

       [[System has unsupported display driver / cuda driver combination (CUDA_ERROR_SYSTEM_DRIVER_MISMATCH) cuInit()=803]]

  				
Posted 
	7 months ago

					More  		
  Report
		
					RattyBluewhale45
				
					0
					 × 1

Show more results

Write your answer

49K Views

41 Answers

7 months ago