Unanswered
Hi Folks, I Did A Deployment Of Clearml Using The K8S Helm Chart, And I Set The Agent Using K8S Glue.
I Run A Task Locally, And I Went To The Ui Cloned The Experiment And Scheduled It In The Default Queue.
After Doing This, I See That The Experiment Is Q
Martin I told you I can't access the resources in the cluster unfortunately
😞
so it seems there is some misconfiguration of the k8s glue, because we can see it can "talk" to the clearml-server, but it seems it fails to actually create the k8s pod/job. I would start with debugging the k8s glue (not the services agents). Regardless, I think the next step is to get a log of the k8s glue pod, and better understand the issue.
wdyt?
158 Views
0
Answers
2 years ago
one year ago