Unanswered
Good Morning Folks, I Am Setting Up Clearml On A (Self-Hosted) K8S Cluster Using The
SarcasticSquirrel56
if I configure manually the pods for the different nodes, how do I make clearml server aware that those agents exist?
Basically the agent register themselves on your cleaml-server, and they register on which Queue(s) they listen to. In other words the interface to choose the different types of machines/gpus is by enqueue the Task to different queues.
For example: Queue(1): "CUDA11_GPUx1" , Queue(2): "CUDA10_GPUx1"
Make sense ?
EDIT:
I guess to achieve what I want, I could disable the agent using the helm chart values.yaml
and then define pods for each of the agent on their respective nodes
It might be the case, I have to admit I can't remember how flexible the helm chart is in this manner ...
153 Views
0
Answers
2 years ago
one year ago
Tags