Hi, yet again i come with a problem i cant see a fix for this issue that is bugging me for days.
I want to serve a Gpt2 model and i have the onnx uploaded to the server. When i try to mount the endpoint the server will try to find model.onnx as it is intended but it will not find since it got renamed to model.bin as moment of the service starting but if i go
clearml-serving --id 12e416036c4b4cd38b9fd3a46c85a583 model add --engine triton --endpoint "gpt2_onnx" --model-id 75159e2de62142fb9958e416807e3d1a --preprocess preprocess.py --input-size "[-1, -1]" "[-1, -1]" "[-1, -1]" --input-type int64 int64 int64 --input-name "input_ids" "attention_mask" "token_type_ids" --output-size "[-1, -1, 50257]" --output-type float32 --output-name "logits" --aux-config platform=\"onnxruntime_onnx\" max_batch_size=8 default_model_filename=\"model.bin\"
i will get an error "Error: ERROR: You have default_model_filename
in your config pbtxt, please remove it. It will be added automatically by the system."
Is there any fix for this?
Thanks again