I see, okay so using
shm_size: '2gb'
we still need to modify the infrence logic to register and input and output on shmem, no?
@<1523701205467926528:profile|AgitatedDove14> Thanks
Sorry @<1657918706052763648:profile|SillyRobin38> I missed this reply
Is ClearML-Serving using either System or CUCA shared memory? O
This needs to be set on the docker-compose:
and I think this line actually includes ipc: host which means there is no need to set the shm_size, but you can play around with it and let me know if you see a difference
None
@<1523701205467926528:profile|AgitatedDove14> Actually our meant is something like the following example from the triton client examples:
Does clearml has any example for using shared memory? or it's out of context for clearml?
@<1657918706052763648:profile|SillyRobin38> ^
Hi @<1547028116780617728:profile|TimelyRabbit96>
Notice that if running with docker compose you can pass an argument to the clearml triton container an use shared mem. You can do the same with the helm chart