Hello, I’M Seeing This Issue When Launching A Machine With The App –

Answered

Hello, I’m seeing this issue when launching a machine with the app –
2022-07-05 09:55:49,759 - usage_reporter - INFO - Sending usage report for 60 usage seconds, 1 units 2022-07-05 10:56:40 2022-07-05 09:56:36,826 - clearml.Auto-Scaler - INFO - ignoring unknown worker: 'ip-172-16-30-141:0' 2022-07-05 09:56:36,862 - clearml.Auto-Scaler - INFO - Found 1 tasks in queue 'aws-cpu-clear-managed-autoscale' 2022-07-05 09:56:36,862 - clearml.Auto-Scaler - INFO - Spinning new instance resource='clear_managed_cpu_machines', prefix='clear_machines_pool_4', queue='aws-cpu-clear-managed-autoscale' 2022-07-05 09:56:37,846 - clearml.Auto-Scaler - ERROR - Failed to start new instance (resource 'clear_managed_cpu_machines'), Error: unsupported operand type(s) for *: 'NoneType' and 'int' Traceback (most recent call last): File "/root/.clearml/venvs-builds/3/task_repository/clearml-apps.git/apps/auto_scaler/auto_scaler.py", line 311, in supervisor instance_id = self.driver.spin_up_worker(resource_conf, worker_prefix, queue, task_id=task_id) File "/root/.clearml/venvs-builds/3/task_repository/clearml-apps.git/apps/auto_scaler/cloud_driver.py", line 150, in spin_up_worker instance_id, region = self._spin_up_worker(resource_conf, worker_prefix, queue_name, task_id) File "/root/.clearml/venvs-builds/3/task_repository/clearml-apps.git/apps/auto_scaler/aws_driver.py", line 134, in _spin_up_worker "MaxAttempts": int(resource_conf["regular_instance_rollback_timeout"] * 60 / SPOT_INSTANCE_WAITER_DELAY) + 1 TypeError: unsupported operand type(s) for *: 'NoneType' and 'int' 2022-07-05 09:56:38,344 - clearml.Auto-Scaler - INFO - Idle for 60.00 secondsIt also repeats every ~15 minutes and leaves running machines behind (seemingly launches new machine every time). So I’m left with +/- 105 running unusable cloud machines from yesterday just because they haven’t been shut down by the launcher script 🤯

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SparklingHedgehong28
				
					0
					 × 1

Votes Newest

Answers 6

Hi SparklingHedgehong28 - are you using your own self-hosted server? Are you running this AutoScaler yourself?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

SparklingHedgehong28 is this a new instance created or a clone from some other former scaler instance?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					TimelyPenguin76
				
					0
					 Administrator

It’s a clone of a previous one, since I’ve failed -> cloned -> changed params -> failed -> clone -> …

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SparklingHedgehong28
				
					0
					 × 1

Nope, this is the ‘autoscaler app’ from the web interface of the SaaS. Nothing self-hosted at the moment.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SparklingHedgehong28
				
					0
					 × 1

You can also try with a new one, not a cloned task, just create a new instance with the ➕

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					TimelyPenguin76
				
					0
					 Administrator

It seems for some reason this value does not have a default value... I think the best thing for now is to clone and provide it explicitly. We'll obviously fix that for the next release 🙂

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

Write your answer

1K Views

6 Answers

2 years ago

one year ago