Hi @<1523701205467926528:profile|AgitatedDove14> !
Thank you again for coming back to me with your support!
- 👍 Thank you, I have noticed that (when using
model auto-update
) different model versions (with their ownmodel_id
) appear under "model_monitoring_eps" section of the serving service. - 👍 It's now clear to me that I was always accessing the static endpoint (that I created with
model add
) with mycurl
command.
I retested automatic model deployment with your suggestion of not usingmodel add
at all, but instead onlymodel auto-update
to start the endpoint (since I precisely want automatic and not manual model deployment as you explained, in order to automatically select latest model with "released" tag).
Your solution looks very promising! 🏆 Unfortunately, I encounter following RPC error when using the curl
command ( curl -X POST "
None " -H "accept: application/json" -H "Content-Type: application/json" -d '{"url": "
None "}'
):
{"detail":"Error processing request: <AioRpcError of RPC that terminated with:\n\tstatus = StatusCode.NOT_FOUND\n\tdetails = \"Request for unknown model: 'test_model_pytorch_auto/1' is not found\"\n\tdebug_error_string = \"UNKNOWN:Error received from peer ipv4:172.18.0.5:8001 {grpc_message:\"Request for unknown model: \\'test_model_pytorch_auto/1\\' is not found\", grpc_status:5, created_time:\"2023-11-22T09:47:10.632169727+00:00\"}\"\n>"}
This proves that the endpoint "test_model_pytorch_auto/1" effecively exists ( 🎉 ), because in contrast when I try to perform inference with an endpoint that doesn't exist, e.g. "test_model_pytorch_auto/2" (with curl -X POST "
None " -H "accept: application/json" -H "Content-Type: application/json" -d '{"url": "
None "}'
), I get following error message (see attached picture):
{"detail":"Error processing request: Model inference endpoint 'test_model_pytorch_auto/2' not found"}
I don't have much insight into why the inference isn't working (since I'm using the exact same command you suggest and also the same one that's in the tutorial , and I'd be curious to know if you have any idea why the "test_model_pytorch_auto/1" model can't be found 🤔 . There is, after all, a link between the "train pytorch model" and the serving service when I look under the "LINEAGE" section of the model (see attached image).
Have you ever come across a similar case?
Thanks again in advance for your help! 🙇
Have a nice day ☀