Is Clearml-Serving Using Either System Or Cuca Shared Memory? Or Planning To? In Our Experiments Using Perf_Analyzer The Shared Memory Experiments Showed A Huge Improvement And If We Wanted To Look Into This, Do You Have Any Pointers Of Where We Can Do T

Answered

Is ClearML-Serving using either System or CUCA shared memory? Or planning to? In our experiments using perf_analyzer the shared memory experiments showed a huge improvement

And if we wanted to look into this, do you have any pointers of where we can do the whole registering/transferring of data into shmem?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					TimelyRabbit96
				
					0
					 × 1

Votes Newest

Answers 6

@<1523701205467926528:profile|AgitatedDove14> Thanks

  				
Posted 
	one year ago

					More
				  		
  Report
		
					SillyRobin38
				
					0
					 × 1

I see, okay so using

shm_size: '2gb'

we still need to modify the infrence logic to register and input and output on shmem, no?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					TimelyRabbit96
				
					0
					 × 1

@<1657918706052763648:profile|SillyRobin38> ^

  				
Posted 
	one year ago

					More
				  		
  Report
		
					TimelyRabbit96
				
					0
					 × 1

Sorry @<1657918706052763648:profile|SillyRobin38> I missed this reply

Is ClearML-Serving using either System or CUCA shared memory? O

This needs to be set on the docker-compose:
and I think this line actually includes ipc: host which means there is no need to set the shm_size, but you can play around with it and let me know if you see a difference
None

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Hi @<1547028116780617728:profile|TimelyRabbit96>
Notice that if running with docker compose you can pass an argument to the clearml triton container an use shared mem. You can do the same with the helm chart

  				
Posted 
	one year ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

@<1523701205467926528:profile|AgitatedDove14> Actually our meant is something like the following example from the triton client examples:

None

Does clearml has any example for using shared memory? or it's out of context for clearml?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					SillyRobin38
				
					0
					 × 1

Write your answer

1K Views

6 Answers

one year ago