
Reputation
Badges 1
53 × Eureka!I’m going to investigate (and fix it if possible) in some day
Ok, let’s try to deep dive into it, what is the Helm chart version used for this deployment?
moreover url exposed by nginx should be under https
about clearml-agent, just set resources in basepodtemplate (cpu gpu ram) so you will have a specific definition
not a big issue but you maybe worth a quick fix
I need to investigate, ScrawnyLion96 can you pls open an issue on https://github.com/allegroai/clearml-helm-charts ?
I’m just going to include these credentials in pod template too
h i, if I’m not wrong, mongodb doesn’t have an ARM image (and you are using a Silicon one like me ofc 😄 )
not official but should work
In k8s there’s no services but just clearml-agent (k8sglue). you can set any definition you want for spawned pods in this section: https://github.com/allegroai/clearml-helm-charts/blob/503ab437adc5d4f9b7b1037e2af143d47da24048/charts/clearml-agent/values.yaml#L132
look also at the monitoring tab
did you tried to create a debug pod with a mount using ceph storageclass? you can start from here https://downey.io/notes/dev/ubuntu-sleep-pod-yaml/ then add the pvc and the mount. then you should exec into the pod and try to write a dummy file on the mount; I suspect the problem is there
ok so they are executed as expected
Hi BeefyHippopotamus73 , on EKS it’s preferrable to use ALB but you can also work with your nginx. You need DNS records with hostnames you setup pointing to that External IP. If you just need to test, you can simply add entries in you client machine /etc/hosts
file (if you are on *nix)
can you change the path in ALB healthcheck pls?
I can add some configurable value then ASAP 👍 will do in next days
with the right svc names
yes, exactly, agent creates and manages task pod lifecycle
ReassuredArcticwolf33 PR is coming https://github.com/allegroai/clearml-helm-charts/pull/84
but I can be wrong, give me 30 mins while I recreate same local installation with samecchart so I can see if something is wrong
can you pls put the entire helm list -A
output command?
elastic is not being scheduled