Unanswered
Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?
Sure! This is an example of running a custom model. It basically boils down to defining a preprocess, process and postprocess
function. Inside the process
function can be anything, including just a basic call to huggingface to run inference 🙂
I have not tested this myself mind you, but I see no reason why it wouldn't work!
In fact, I think even Triton itself supports running on CPU these days, so you still have the option :)
165 Views
0
Answers
one year ago
one year ago