seems like an issue about 2 compose apps using different networks which are not accessible from each other
I wonder if I just need to join 2 docker-compose files to run everything in one session
I wonder if I just need to join 2 docker-compose files to run everything in one session
Actually that could also work
But for reference, when I said IP i meant the actual host network IP not the 127.0.0.1 (which is the same as localhost)
you should also use my example.env
Hi @<1523706266315132928:profile|DefiantHippopotamus88>
The idea is that clearml-server acts as a control plane and can sit on a different machine, obviously you can run both on the same machine for testing. Specifically it looks like the clearml-sering is not configured correctly as the error points to issue with initial handshake/login between the triton containers and the clearml-server. How did you configure the clearml-serving docker compose?
my example.env
CLEARML_WEB_HOST="
"
CLEARML_API_HOST="
"
CLEARML_FILES_HOST="
"
CLEARML_API_ACCESS_KEY="91SFEX4BYUQ9YCZ9V6WP"
CLEARML_API_SECRET_KEY="4WTXT7tAW3R6tnSi8hzSKNjgkmgUoyv22lYT2FIzIfLoeGERRO"
CLEARML_SERVING_TASK_ID="450231049bba42f69c6507cb774f7dc6
my clearml.conf
api {
web_server:
api_server:
files_server:
# test 3
credentials {
"access_key" = "91SFEX4BYUQ9YCZ9V6WP"
"secret_key" = "4WTXT7tAW3R6tnSi8hzSKNjgkmgUoyv22lYT2FIzIfLoeGERRO"
}
}
do I need to change anything else?
same thing
clearml-serving-inference | Retrying (Retry(total=236, connect=236, read=240, redirect=240, status=240)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f899dc4e8b0>: Failed to establish a new connection: [Errno 111] Connection refused')': /auth.login
I tried that, it didn't work. I was confused by the separate port parameter:
CLEARML_SERVING_PORT: ${CLEARML_SERVING_PORT:-8080}
which is only one port related in docker-compose-triton.yml
Can I test /auth.login
somehow independently? Using curl or any other way. Which address does it suppose to have and which creds should I use?
I got only smth like this:
clearml-serving-triton | I0701 08:32:58.580705 46 server.cc:250] Waiting for in-flight requests to complete.
clearml-serving-triton | I0701 08:32:58.580710 46 server.cc:266] Timeout 30: Found 0 model versions that have in-flight inferences
clearml-serving-triton | I0701 08:32:58.580713 46 server.cc:281] All models are stopped, unloading models
clearml-serving-triton | I0701 08:32:58.580717 46 server.cc:288] Timeout 30: Found 0 live models and 0 in-flight non-inference requ
What are you getting with:
curl http://<ip>:8008/auth.login
that's strange, maybe you should upgrade it
how d you start a docker-compose?
docker-compose --env-file example.env -f docker-compose.yml up -d
I have to step away for a couple of hours
please let me know if you find something wrong
It should also work with host IP and two docker compose files.
I'm not sure where to push a for a unified docker compose?