Hello, I'Ve Been Using Clearml For A Month Now, And Must Say It'S A Really Good Product! I'M Mostly Working With Huggingface Transformers, I Integrated Clearml In My Solution:

Answered

Hello,
I've been using Clearml for a month now, and must say it's a really good product!
I'm mostly working with huggingface transformers, I integrated clearml in my solution:
task initialization task logging (text, scalar and plot)Now I'm wondering how to properly save the output model. Currently, it stores one binary file automatically because of the underlying call to torch.save. The problem is transformers has multiple binary files that should be stored in order to be reused afterwards.
As anybody find a solution? Does it mean that I should use the manual model logging?
kind regards

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					HungryArcticwolf62
				
					0
					 × 1

Votes Newest

Answers 7

Hi AgitatedDove14 , CostlyOstrich36
Thanks for the links. I see that clearml-serving supports a predefined list of engines, transformer no included. Do you have any documentation on how one would implement an engine and integrate it into the on prem version?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					HungryArcticwolf62
				
					0
					 × 1

HungryArcticwolf62 , I couldn't find something relevant 😞
AgitatedDove14 , wdyt?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					CostlyOstrich36
				
					0

Hi HungryArcticwolf62 ,
from what I understand you simply want to access models afterwards - correct me if I'm wrong.
What I think would solve your problem is the following:
task = Task.init(...., output_uri=True)This should upload the model to the server and thus make it accessible by other entities within the system.
Am I on track?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					CostlyOstrich36
				
					0

HungryArcticwolf62 transformer model is at the end a pytorch/tf model, with pre/post processing.
the pytorch/tf model inference is done with Triton (probably the most efficient engine today), where clearml runs the pre/post on a different CPU machine (making sure we fully utilize all the HW. Does that answer the question?
Latest docs here:
https://github.com/allegroai/clearml-serving/tree/dev

expect a release after the weekend 😉

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

After you store the model in ClearML server accessing it later becomes almost trivial 🙂

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					CostlyOstrich36
				
					0

Actually, this opens my mind on what I'm trying to achieve. I'm trying to find a way to store the model (will try using the output_uri argument), and also a way to serve models using clearml-serving. Since I don't know yet how clearml-serving works, I wanted first to archive the correct files.

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					HungryArcticwolf62
				
					0
					 × 1

HungryArcticwolf62 the new clearml-serving is almost out (eta late next week), you can already start playing here:
https://github.com/allegroai/clearml-serving/tree/dev
Example:
train+serve
https://github.com/allegroai/clearml-serving/tree/dev/examples/sklearn

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Write your answer

2K Views

7 Answers

3 years ago

2 years ago