Reputation
Badges 1
53 × Eureka!there’s a PR coming with example values: https://github.com/allegroai/clearml-helm-charts/pull/234
I mean this blob is then saved on the fs
SuccessfulKoala55 after looking at the issue I’m a bit confused 😄 ; as far as I can see there is no way to pass any parameter to clearml-agent in daemon mode to push log to stdout. Can you confirm it? (If yes I need to find some workaround)
I wouldn’t say it’s related RBAC because the issue seems Networking realted so connection timed out
maybe this can cause the issue?
Do you have Ingresses enabled?
but I will try to find something good for you
Hi Ofir, ty for feedback
Will cook something asap
I’m going to investigate (and fix it if possible) in some day
how do you point tasks to git repo?
usually you can see if you are getting timeouts or wrong http code
Can you pls share all 3 health checks ?
# Point to the internal API server hostname APISERVER= `
Path to ServiceAccount token
SERVICEACCOUNT=/var/run/secrets/kubernetes.io/serviceaccount
Read this Pod's namespace
NAMESPACE=$(cat ${SERVICEACCOUNT}/namespace)
Read the ServiceAccount bearer token
TOKEN=$(cat ${SERVICEACCOUNT}/token)
Reference the internal certificate authority (CA)
CACERT=${SERVICEACCOUNT}/ca.crt
Explore the API with TOKEN
curl --cacert ${CACERT} --header "Authorization: Bearer ${TOKEN}" -X GET ${A...
I’m just going to include these credentials in pod template too
still need time because I have two very busy days
the goal is to get healthchecks green so ALB should be able to work
if it turns 503 it’s not network but something on top of it
Hi, atm we still didn’t tested chart against a k8s 1.29.*. That’s why we didn’t ;t included it for now; I plan to add support by this week since I’m m preparing for a new testing cluster so if you waiut a couple of day you should be good.\
this configuration object is stored as a file in /root/.trains ?
can you change the path in ALB healthcheck pls?
uh it would be interesting if you can include it in chart and push a PR :D
first I noticed a mistake I did when suggesting config, this:
There’s an incomplete PR for this None .
for fileserver the persistent volume need to be provisioned by a storageclass. Ususally I always set it to standard becase it’s commonly used in public cloud providers