Hi All, I Was Trying To Use Clearml-Task To Run A Custom Docker(With Poetry To Install All The Python Dependencies And Activated The Environment) Using Clearml Gpu, But It Seems Like Clearml Always Create A Virtual Environment And Run The Python Script Fr

Answered

Hi all, I was trying to use clearml-task to run a custom docker(with poetry to install all the python dependencies and activated the environment) using clearml GPU, but it seems like clearml always create a virtual environment and run the python script from /root/.clearml/venvs-builds/3.10/bin/python . Is there a way that I can have the clearml-task to automatically activated a virtual environment use the activated custom virtual environment in my docker and run the scripts from there instead of always creating a new venv inheriting from the clearml system_site_packages? I noticed that clearml.conf has a configuration agent.docker_use_activated_venv , but I am not sure how to enable it from clearml-task

  				
Posted 
	one year ago

					More  		
  Report
		
					EnchantingPenguin77
				
					0
					 × 1

Votes Newest

Answers 38

There is nothing on the queue and worker

  				
Posted 
	one year ago

					More  		
  Report
		
					EnchantingPenguin77
				
					0
					 × 1

I've added gpu:True to my hydra config file but the GPU is still not used

  				
Posted 
	one year ago

					More  		
  Report
		
					EnchantingPenguin77
				
					0
					 × 1

AgitatedDove14 Yes I cansee the worker:

  				
Posted 
	one year ago

					More  		
  Report
		
					EnchantingPenguin77
				
					0
					 × 1

It seems like CPU is working on something, I saw the usage is spiking periodically but I didn't run any task this morning

  				
Posted 
	one year ago

					More  		
  Report
		
					EnchantingPenguin77
				
					0
					 × 1

Thanks AgitatedDove14 . I just got an issue running clearml-task remotely, it has been working fine before today, but now every time I run clearml-task, it shows pending, and I've been waiting for 3 hours the status is still pending. The autoscalers was charging the hourly rate even though the task is still pending for 3 hours. From the console log of Clearml GPU instance, I saw it is listening to the queue, but there is no log even after 3 hours. There is nothing else I am running beside this one task, and seems like the worker never spin up again

2023-08-03 04:41:00,624 - clearml.Auto-Scaler - INFO - Spinning new instance resource='default', prefix='38ae71a80baf4a58893631d23c0c6e72_3090_1', queue='test-gpu'
2023-08-03 04:41:00,625 - clearml.Auto-Scaler - INFO - Creating instance for resource default
2023-08-03 04:41:01,027 - clearml.Auto-Scaler - INFO - New instance b97e702d-e2b3-4f28-adab-be59648601ea listening to test-gpu queue

  				
Posted 
	one year ago

					More  		
  Report
		
					EnchantingPenguin77
				
					0
					 × 1

And how did you connect your example,yaml?

  				
Posted 
	one year ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I did use --args to clearml-task command for this run, but it looks like the docker didn't take it

  				
Posted 
	one year ago

					More  		
  Report
		
					EnchantingPenguin77
				
					0
					 × 1

EnchantingPenguin77 can you provide the full log?

  				
Posted 
	one year ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Show more results

Write your answer

80K Views

38 Answers

one year ago