Is There A Way To Save The Models Completely On The Clearml Server? It Seems That Clearml Server Does Not Store The Models Or Artifacts Itself, But They Are Stored Somewhere Else (E.G., Aws S3-Bucket) Or On My Local Machine And Clearml Server Is Only Sto

Answered

Is there a way to save the models completely on the ClearML server?

It seems that ClearML Server does not store the models or artifacts itself, but they are stored somewhere else (e.g., AWS S3-bucket) or on my local machine and ClearML Server is only storing configuration parameters and previews (e.g., when the artifact is a pandas dataframe). Is that right?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

Votes Newest

Answers 45

From the one you sent - None

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

We have the following, works fine (we also use internal zip packaging for our models):

model = OutputModel(task=self.task, name=self.job_name, tags=kwargs.get('tags', self.task.get_tags()), framework=framework)
model.connect(task=self.task, name=self.job_name)
model.update_weights(weights_filename=cc_model.save())

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

I have already been trying to contribute (have three pull requests), but honestly I feel it is a bit weird, that I need to update a documentation about something I do not understand, while I actually try to evaluate if ClearML is the right tool for our company...

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

@<1523701087100473344:profile|SuccessfulKoala55> : That is the link I posted as well. But this should be mentioned also at places where it is about about the external or non-external storage. Also it should be mentioned everywhere we talk about models or artifacts etc. Not necessarily in details, but at least with a sentence and a link.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

@<1523704157695905792:profile|VivaciousBadger56> I'm not sure I'm following you - is the issue not being able to upload to the ClearML server or to load the downloaded file?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

FWIW, we prefer to set it in the agent’s configuration file, then it’s all automatic

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

It is documented at None ... super deep in the code. If you don't know that output_uri in TASK's (!) init is relevant, you would never know...

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

@<1523704157695905792:profile|VivaciousBadger56> It seems like whatever you pickled in the zip file relies on some additional files that are not pickled.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

We're certainly working hard on improving the documentation (and I do apologize for the frustrating experience)

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

@<1523704157695905792:profile|VivaciousBadger56> regrading: None
Is this a discussion or PR ?
(general ranting is saved for our slack channel 🙂 )

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Yes, you're correct, I misread the exception.
Maybe it hasn't completed uploading? At least for Datasets one needs to explicitly wait IIRC

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

@<1523701083040387072:profile|UnevenDolphin73> : Thanks, but it does not mention the File Storage of "ClearML Hosted Server".

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

I can only say I’ve found ClearML to be very helpful, even given the documentation issue.
I think they’ve been working on upgrading it for a while, hopefully something new comes out soon.
Maybe @<1523701205467926528:profile|AgitatedDove14> has further info 🙂

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

Unbelievable! That worked.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

The documentation is messy, I’ve complained about it the in the past too 🙈

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

Heh, good @<1523704157695905792:profile|VivaciousBadger56> 😁
I was just repeating what @<1523701070390366208:profile|CostlyOstrich36> suggested, credits to him

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

Heh, well, John wrote that in the first reply in this thread 🙂
And in Task.init main documentation page (nowhere near the code), it says the following -

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

I wouldn't put past ClearML automation (a lot of stuff depend on certain suffixes), but I don't think that's the case here hmm

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

Hi all, sorry for not being so responsive today 🙏

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

@<1523701083040387072:profile|UnevenDolphin73> : I see. I did not make the connection that output_uri=True is what I was missing. I thought this was the default. But the default is actually "None", which is different than "True".

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

By the way, output_uri is also documented as part of the Task.init() docstring ( None )

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

Exactly 🙂

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					UnevenDolphin73
				
					0
					 × 1

@<1523701083040387072:profile|UnevenDolphin73> : If I do, what should I configure how?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

@<1523701083040387072:profile|UnevenDolphin73> : How do you figure? In the past, my colleagues and I just shared the .zip file via email / MS Teams and it worked. So I don't think so.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

@<1523701087100473344:profile|SuccessfulKoala55> : I referenced this conversation in the issue None

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

But, I guess @<1523701070390366208:profile|CostlyOstrich36> wrote that in a different chat, right?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

missing a configuration option

Which one, where? Any idea? I did not set output_uri - do I have to do that?

I am refering to

None
None

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

"Messy" is putting it nicely.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

@<1523701083040387072:profile|UnevenDolphin73>

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

@<1523701087100473344:profile|SuccessfulKoala55> Also, I think that - in this case, but also in other cases - the issue is not just the documentation, but also the design of the SDK.

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					VivaciousBadger56
				
					0
					 × 1

Show more results

Write your answer

155K Views

45 Answers

2 years ago