I Seem To Be Missing Something ... I'Ve Only Got One Task Running To Train A Segmentation Model On My Local Machine, And In A Few Days It'S Hit Over 1.15M Api Calls. It Looks Like It'S Sending Every Single Console Output ... Are There Settings To Control

Answered

I seem to be missing something ... I've only got one task running to train a segmentation model on my local machine, and in a few days it's hit over 1.15M API calls. It looks like it's sending every single console output ... are there settings to control what gets logged? I only care about the results from each epoch. I don't need each line of the console posted up ( that's 99% of the API usage right there ). I can't find a way to prevent this and can see each line in the clearml console that's already in my terminal window ( each tick in the progress bar for each epoch seems to be an API call to post that local console output to clearml ). Any tips to stop console from getting sent?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Votes Newest

Answers 51

It'd be great if it just posted to clearml after each epoch is completed and the CSV with the results gets updated . I only care about using the dashboard to track completed progress . I can use my local computers terminal window to monitor current epoch training . No need to send that to clearml every second ;) Results once an hour or so is fine after each completes :)

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

@<1572395184505753600:profile|GleamingSeagull15> see " Can I control what ClearML automatically logs? " in None (specifically the auto_connect_frameworks argument to Task.init() )

  				
Posted 
	one year ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

@<1523701087100473344:profile|SuccessfulKoala55> You are my hero !!! This is EXACTLY what I needed !!!

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

So, might be in the minority here, but seems like capturing stdout and sending that over to clearml via API should be disabled by default. Like I get maybe capturing stderr, but stdout? In a training scenario, that's MILLIONS of API calls just in progress bar indicators, right? Like it might actually be better for the ClearML servers just in general to make the user turn that on if they want it, otherwise we're just blasting your servers. In my case, I did not even know it was sending that over until I got into digging where these API calls were coming from, and saw the CONSOLE tab in clearml that had every single line of stdout captured.

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

FYI, I did not even know to look into this until I logged in and saw that I was being throttled because I had hit my monthly limit with API calls ( on my very first use of your platform ), and my last dozen or so epochs were just not even logged ( also a bummer ). I only had that one model in training, and thought there was no way I sent over a million API requests, so had to figure out where those were coming from, and tracked it down to that STDOUT, and was like ... wait, what?!?! Found that console tab, which I did not even use before, and saw that screenshot I posted, and was like ... well, there's your problem, ha ha

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Hi @<1572395184505753600:profile|GleamingSeagull15>
Try adjusting:
None
to 30 sec
It will reduce the number of log reports (i.e. API calls)

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Thanks, will do. Heck, for my use case, I only need like once every 10 minutes.

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Is there a place in ClearML that shows Platform Usage? Like, what's actually taking up the API calls?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

well from 2 to 30sec is a factor of 15, I think this is a good start 🙂

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Under your profile you should be able to see it

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I'm not sure on the frequency it updates though

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I would love to be able to fine tune this as needed, but in my profile I only see a Billings & Usage, and it states at the top that "Usage data is updated once every day" ... and even then, all the shows under "Platform Usage" is number of calls performed, not what those calls were.

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

( under the None page )

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Literally all there is, ha ha

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

is number of calls performed, not what those calls were.

oh, yes this is just a measure of how many API calls are sent.
It does not really matter which ones

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

(Not sure it actually has that information)

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

well, in my case, if I am trying to make sure I do not go over the allotted usage, it matters, as I am already hitting the ceiling and I have no idea what is pushing this volume of data

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Since it's literally something we have to pay for ( which I signed up to do ) I would love to know what drives this cost

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

If you do not have a lot of workers, that I would guess console outputs

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

In case of scalars it is easy to see (maximum number of iterations is a good starting point

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

might be a feature request then, as ya, having transparency into something we are charged for would be nice. At this point, I have zero idea what is driving this usage and just want to make sure the costs for training do not bloat too much. I personally am just using ClearML as a central dashboard for a few people. I don't need it to be live data, I just need a rough overview of progress. Even if it only posted updates to ClearML once an hour, that is honestly fine.

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

I guess last followup question, is there a way to cap costs? Like if this is running at this scale, I am not sure I can use ClearML for my purpose if I am just going to get overage charged repeatedly ( which I am already looking like I will be doing ).

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Hmm if this is case, you can add some prints in here:
None
the service/action will tell you what you are sending
wdyt?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I guess last followup question, is there a way to cap costs?

Scale tier ? (I know it is not per usage, but it is probably more than 15$ per user 🙂 )

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

hmmm, this is just a personal project, honestly was just hoping this would let me take the results of each epoch and put it in a central dashboard. Having this generate 1M+ api calls and only being like 1/4 of the way though training is a bit much. Current pricing is $1/100K API calls at the PRO tear, which I am on ... so it would be like another $50 just in API calls at this pace 😞 Would love to just cap it at a fixed amount for a month for API calls.

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

But I will try to set the reduce the number of log reports first

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Just wish I could actually see somewhere what is being sent over API so I could know where to focus my efforts to refine this kind of stuff 😉

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Would love to just cap it at a fixed amount for a month for API calls.

Try the timeout configuration, I think this shoud solve all your issues, and will be fairly easy to set for everyone

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

this one, right ? report_period_sec in ~/clearml.conf correct ?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					GleamingSeagull15
				
					0
					 × 1

Correct

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Show more results

Write your answer

35K Views

51 Answers

one year ago