I Seem To Be Missing Something ... I'Ve Only Got One Task Running To Train A Segmentation Model On My Local Machine, And In A Few Days It'S Hit Over 1.15M Api Calls. It Looks Like It'S Sending Every Single Console Output ... Are There Settings To Control

Answered

I seem to be missing something ... I've only got one task running to train a segmentation model on my local machine, and in a few days it's hit over 1.15M API calls. It looks like it's sending every single console output ... are there settings to control what gets logged? I only care about the results from each epoch. I don't need each line of the console posted up ( that's 99% of the API usage right there ). I can't find a way to prevent this and can see each line in the clearml console that's already in my terminal window ( each tick in the progress bar for each epoch seems to be an API call to post that local console output to clearml ). Any tips to stop console from getting sent?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Votes Newest

Answers 51

I am running this on a 3090 GPU locally, just been letting it run for about two weeks now I think. Just have the one GPU, ha ha. It's at epoch 368 out of the 1,000 I have it set to cap out on ( if it does not hit the default YOLO "patience" limit of 50 before then and self terminate ).

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

My training is on roughly 50 classes as a subset of the Open Images Dataset for Segmentation

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

each epoch runs about 55 minutes, and that screenshot I posted earlier kind of show the logs for the rest of the info being output, if you wanted to check that out None

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Literally all there is, ha ha

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

I would love to be able to fine tune this as needed, but in my profile I only see a Billings & Usage, and it states at the top that "Usage data is updated once every day" ... and even then, all the shows under "Platform Usage" is number of calls performed, not what those calls were.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Actually looking at the counts today, they've barely changed. So I think this actually fixed it, and was just that the counts are only updated daily so I needed to get 48 hours out from when I made the change to see clean results to assure no spill over counts from previous days.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Under your profile you should be able to see it

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Maybe ClearML is using tensorboard in ways that I can fine tune? I saw there was a manual way if you were not using tensorboard to send over data, but the videos I saw from your team used this solution when demoing YOLOv8 on YouTube ( there were a few collab videos your team did with theirs, so I just followed their instructions ). But my gut is telling me that might be the issue for the remaining data being sent over that I have no insight into.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

If you do not have a lot of workers, that I would guess console outputs

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I appreciate your help @<1523701205467926528:profile|AgitatedDove14> 🙂

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

I did notice that the last 24 hours I dropped quite a bit, so my theory that the 140K might have some spillover from previous day might have been correct. Last 24 hours went from 1.24M to 1.32M, so about half as much as the day before, with the same training running.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

But I will try to set the reduce the number of log reports first

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

It was at 1.1M when I shut it down yesterday, and today it's at 1.24M

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

this one, right ? report_period_sec in ~/clearml.conf correct ?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

@<1572395184505753600:profile|GleamingSeagull15> see " Can I control what ClearML automatically logs? " in None (specifically the auto_connect_frameworks argument to Task.init() )

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

( under the None page )

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

I guess last followup question, is there a way to cap costs? Like if this is running at this scale, I am not sure I can use ClearML for my purpose if I am just going to get overage charged repeatedly ( which I am already looking like I will be doing ).

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

I'm not sure on the frequency it updates though

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Came to ClearML since it had slick dashboard and showed me the info that mattered. Loved that I could share the results of each epoch so we could make sure things were headed in the correct direction.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

is number of calls performed, not what those calls were.

oh, yes this is just a measure of how many API calls are sent.
It does not really matter which ones

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I guess last followup question, is there a way to cap costs?

Scale tier ? (I know it is not per usage, but it is probably more than 15$ per user 🙂 )

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Show more results

Write your answer

144K Views

51 Answers

2 years ago