Reputation
Badges 1
53 × Eureka!this is the PR: https://github.com/allegroai/clearml-helm-charts/pull/80 https://github.com/allegroai/clearml-helm-charts/pull/80 will merge it soon so agent chart 1.0.1 will be released
This is pretty weird. If pv containing mongodb data is still the same data must be there. what storageclass provider are you using?
this is strange, I have a lot of clusters that went trough nodes issue but I never lost data
and are you sure you mongodb respawned on same node?
not sure if provisioner did something unexpected
Let me check and I will eventually get back with a PR
at task completion do you get state Completed in UI?
there are processes listening on host ports?
with that said Iβd start trying to work on localhost just to focus on the real problem and then I would move outside
It happened to me when trying many installations; can you login using http://app.clearml.home.ai/login url directly ?
With Helm we are not running in service-mode. If pod get evicted or killed we should investigate what is the reason behind that; there are any logs on kille dpod that can help us understand better the situation?
especially if itβs evicted, it should be due increasing resource usage
Iβm going to investigate this specific use case and will get back to you
Basically you can install lates clearml chart
pretty weird; I have some issues with ceph in the past but never something like that
ty AgitatedDove14 , your fixes work like a charm. As reward I opened another one https://github.com/allegroai/clearml/issues/423 sorry for that π
uh, using clearml-task
params π
I donβt think itβs possible to setup queues in advance with any ClearML chart env var but Iβm not 100% sure. SuccessfulKoala55 can you pls clarify this?
Iβm going to investigate (and fix it if possible) in some day
I think yes, at least this is whatI saw in docs
output_dest works:)
About last point: I would like to retrieve the pipeline 'output_dest' and use it as a parameter in adding steps, is that possible?