Is There A Tutorial For Clearml Serving? I Followed The Steps On Its Repo But I Still Don'T Understand It. Also The Serving Engine Keeps Failing After A While. I Also Don'T Know How To Access The Serving Engine Or How To Send Inference Requests To It.

Answered

Is there a tutorial for ClearML Serving? I followed the steps on its repo but I still don't understand it. Also the serving engine keeps failing after a while. I also don't know how to access the serving engine or how to send inference requests to it.

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					VexedCat68
				
					0
					 × 1

Votes Newest

Answers 8

Hi Fawad, maybe this can help you get started! They're both c++ and python examples of triton inference. Be careful though, the pre and postprocessing used is specific to the model (in this case yolov4) and you'll have to change it to your own model's needs

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					ExasperatedCrab78
				
					0
					 × 1

Do you have a log of the triton server?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					CostlyOstrich36
				
					0

I'll take a look thank you

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					VexedCat68
				
					0
					 × 1

Also the tutorial mentioned serving-engine-ip as a variable but I have no idea what the ip of the serving engine is.

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					VexedCat68
				
					0
					 × 1

I want to serve using Nvidia Triton for now.

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					VexedCat68
				
					0
					 × 1

https://github.com/isarsoft/yolov4-triton-tensorrt/tree/master/clients

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					ExasperatedCrab78
				
					0
					 × 1

I'm currently installing nvidia docker on my machine, where the agent resides. I was also getting an error regarding gpu not being available in docker since the agent was running on docker mode. I'll share update in a bit. Trying to re run the whole set up

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					VexedCat68
				
					0
					 × 1

I've finally gotten the triton engine to run. I'll be going through nvidia triton docs to find how to make an inference request. If you have an example inference request, I'll appreciate if you can share it with me.

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					VexedCat68
				
					0
					 × 1

Write your answer

2K Views

8 Answers

3 years ago

2 years ago