Hey there! Quick question about clearml-agent, docker and conda. I’m trying to use conda as package manager with an agent, but I get the following error mess...
2 years ago
Hi there! Is there an easy way to retrieve the site-package directory that was created by an agent from inside a task? Eg. task = Task.init(...) task.add_req...
2 years ago
Hi all! Having a bit of an ssh issue, basically I'm running the clearml k8s-glue agent in a pod in k8s, which happily spins up new pods on a managed node gro...
2 years ago
Hello, Does anyone know how to bypass package management by clearml if using docker mode? I want to achieve the following - Run agent in docker mode Push a t...
2 years ago
Hi, do you know more or less how many workers can server work with. Have you make such stress-tests? The thing is that we have some power run agent on (for e...
2 years ago
Hi all! When I try to run tasks for a agent on machine without GPU this error occurs: docker: Error response from daemon: OCI runtime create failed: containe...
2 years ago
Hello everyone ! As you can observe in attached snipped, in my code i freeze the env, and the agent install every cached dependency with the same version. Is...
2 years ago
Hello! I have a quick question about the ClearML Hyperparameter Optimizations module. Is it possible to use it without using the clearml agent system? In oth...
2 years ago
Hello, when running a task with a remote interpreter i get clearml.Task - INFO - No repository found, storing script code instead . I am using Pycharm and i ...
2 years ago
Has anyone got any experience with C++ extensions in Python when using ClearML? In our setup.py we have: ext_modules=[ Extension( "file_io.extio", sources=["...
2 years ago
Hi all! Could do with some help on running registered task on a clearml-agent. My workflow so far is as follows: Execute a local training run (from within a ...
2 years ago
[WebUI-based options injection not working] Hey everyone! Since our training repo has gotten quite complex, we configure all setup in an options.yml file whi...
2 years ago
Is there any examples of mounting an AWS EFS mount to a self hosted k8 agent deploy? https://github.com/allegroai/clearml-helm-charts/blob/main/charts/clearm...
2 years ago
Hi, I just started using clearml, and it is amazing! However I've run into an issue - I have a windows machine which I've setup as a worker, with an agent ru...
Hey, would it be possible to add a way to edit autoscaler configs without having to clone them ? This is really frustrating especially when you reached the q...
2 years ago
Hello everyone ! Problem description: I have my virtual environment (conda) in which I do have detectron2 installed. When I run the task locally it is workin...
Heya, is there any plan for ClearML to leverage the new https://developer.nvidia.com/blog/getting-the-most-out-of-the-a100-gpu-with-multi-instance-gpu/ tech ...
2 years ago
Hi, I was trying to test the autoscaler feature, but I am getting the following error: 2022-10-21 02:06:43,599 - clearml.Auto-Scaler - INFO - Failed to start...
2 years ago
Hi, I am trying to test the whole stack on my local machine but I am unable to make clearml-agent and clearml-serving work. Every time I get: > clearml_agent...
2 years ago
Hey, currently on a PRO SaaS deployment using pipelines with the GCP autoscaler and setting some credentials into the init scripts of my provisionned VMs (in...
2 years ago
Heya, trying to setup a GCP autoscaler for general-purpose CPU instances ( e2-standard-4 ) but I get into that error: googleapiclient.errors.HttpError: And I...
2 years ago
I have a general question about this part in dynamic GPU allocation. If for example I have a machine that has 8 GPUs and I have 3 queues: Queue1 will take 3G...
2 years ago
Hey there, since a bit I often find experiments being stuck while training a model. It seems to happen randomly and I could not find a reproducible scenario ...
2 years ago
why it is not running if I specify the agent on the docker mode , usually I just run "clearml-agent daemon" and it works but today I am trying "clearml-agent...
2 years ago
this is my code, it was working fine with the same clearML agent and server I don't know why to it gives error
2 years ago
I am trying to do a remote execution of a test task, but it fails during env setup due to trying to install an obscure version of pytorch. Been trying to sol...
Do I understand correctly that python versions must match between client (my mac, sends task for remote execution) and clearml-agent? I don’t really get how ...
When I run a task with Dataset.get the agent requests the dataset from a weird url. adamastor.gaiavf.local in this case. 2022-10-03 17:50:17,556 - clearml.st...
2 years ago
When I set agent management to conda it tries to create envs with python 3.1 and fails. Executing Conda: /home/adamastor/anaconda3/bin/conda create --yes --m...
2 years ago