Are There Instructions Somewhere On How I Can Use Clearml-Agent To Run Jobs On My Google-Cloud Compute Engine?

Answered

Are there instructions somewhere on how I can use clearml-agent to run jobs on my google-cloud compute engine?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

Votes Newest

Answers 13

Actually, no. This is ti spin the clearml-server on GCP, not the agent

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I think I am missing one part — which command do I use on my local machine, to indicate the job needs to be run remotely? I’m imagining something like
clearml-remote run python3 my_train.py

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

AgitatedDove14 thanks yes I assume I would follow these instructions:
https://clear.ml/docs/latest/docs/deploying_clearml/clearml_server_gcp

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

(and a way to specify which remote server)

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

got it, nice, thanks

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

thanks, so I got clearml-task working, sent to a queue and the agent on gcp picked it up. I had a question — for a job that runs on the order of minutes, it’s not worth re-creating the whole python virtual env from scratch on the remote (that itself takes 5mins). So is the --folder ` option meant for running it in an existing folder in an existing virtual env?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

Yes, which looks like a lot, but you only need to d that once.
Auto scheduler would make (1) redundant (as it would spin the instance up/down based on the jobs in the queue)

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

... indicate the job needs to be run remotely? I’m imagining something like

clearml-task and you need to specify the queue to push your Task into.
See here: https://clear.ml/docs/latest/docs/apps/clearml_task

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I guess I follow these steps on a GCP instance?
https://clear.ml/docs/latest/docs/clearml_agent

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

Exactly !

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

So if I want to train with a remote agent on a remote machine, I have to:
spin up clearml-agent on the remote create a dataset using clearml-data, populate with data… from my local machine use clearml-data to upload data to google gs:// bucket modify my code so it accesses data from the dataset as here https://clear.ml/docs/latest/docs/clearml_data/clearml_data_sdk#accessing-datasetsAm I understanding right?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

👍

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					HurtWoodpecker30
				
					0
					 × 1

HurtWoodpecker30 currently in the open source only AWS is supported, I know the SaaS pro version supports it (I'm assuming enterprise as well).
You can however manually spin an instance on GCP and launch an agent on the instance (like you would on any machine)

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Write your answer

1K Views

13 Answers

2 years ago

one year ago