Is it necessary to serve keras model using triton engine? I'm trying to serve an endpoint, and trying to debug, but the error given not helping much. Is there a flag i can pass to see more logs?

Posted 10 months ago
Hi @<1567321739677929472:profile|StoutGorilla30>

Is it necessary to serve keras model using triton engine?

It is not, but it is the most efficient way to serve keras models, and this is why by default clearml-serving is using Nvidia Triton (we are talking 10x factors)
I would start with the keras example, see that it works and then work your way into your example (notice you always need to provide the layers form the in/out of the model)

You could just do everything with a custom endpoint, but I would not suggest using that for any scale.

Posted 10 months ago
