Heya, trying to setup a GCP autoscaler for general-purpose CPU instances ( e2-standard-4 ) but I get into that error: googleapiclient.errors.HttpError: And I...
2 years ago
I have a general question about this part in dynamic GPU allocation. If for example I have a machine that has 8 GPUs and I have 3 queues: Queue1 will take 3G...
2 years ago
Hey there, since a bit I often find experiments being stuck while training a model. It seems to happen randomly and I could not find a reproducible scenario ...
2 years ago
why it is not running if I specify the agent on the docker mode , usually I just run "clearml-agent daemon" and it works but today I am trying "clearml-agent...
2 years ago
this is my code, it was working fine with the same clearML agent and server I don't know why to it gives error
2 years ago
I am trying to do a remote execution of a test task, but it fails during env setup due to trying to install an obscure version of pytorch. Been trying to sol...
Do I understand correctly that python versions must match between client (my mac, sends task for remote execution) and clearml-agent? I don’t really get how ...
When I run a task with Dataset.get the agent requests the dataset from a weird url. adamastor.gaiavf.local in this case. 2022-10-03 17:50:17,556 - clearml.st...
2 years ago
When I set agent management to conda it tries to create envs with python 3.1 and fails. Executing Conda: /home/adamastor/anaconda3/bin/conda create --yes --m...
2 years ago
Why does my task execution freeze after pip installation (running agent in foreground mode)? The task is: from clearml import Task task = Task.init(project_n...
2 years ago
Hey, loving ClearML so far. I create an agent with 1 gpu and I am sending a task to it. But it says that it couldn’t create a docker with gpu access. How can...
2 years ago
I have a question regarding running the code directly in the agent without running it on my local device. How can I do that? (usually, I run the code in my l...
2 years ago
I have a question regarding running a model on the agent with an external dataset, I have the dataset in my local device and I connected it with the clearML ...
2 years ago
I didnt spesfy the mode of the agent I just wrote "clearml-agent daemon"
2 years ago
I am trying to use ClearML in my work and I am facing some problems so could anyone help me with that? I have connected my workstation with the clearML serve...
2 years ago
Hi, I am trying to use agent with a sample, very simple task. But it stucks and task does not finish. In UI in console I see what I pasted on image. Do you k...
2 years ago
Hello everyone! I setup a GCP autoscaler on my Pro SaaS deployment but I keep getting this error: clearml_agent: ERROR: Server does not support --use-owner-t...
2 years ago
When using a TriggerScheduler with a add_task_trigger and schedule_function , how would I go about updating the trigger and function without a new task/exper...
2 years ago
I encountered a weird edge case with the AWS Auto-scaler, wondering if there are any solutions or if this is a known issue. Something as follows happened: Th...
2 years ago
Hi, What is the best way to get the agent to install a dependency from github. I have tried this in this sample script: import clearml clearml.Task.add_requi...
2 years ago
I'm trying to configure the glue agent to use AWS ECR via helm charts. Below is my configuration. It is not pulling the image though, it is failing with K8S ...
2 years ago
Hello. I am running clearml server and agents in k8s using the helm charts. The clearml server came preconfigured with the 2 queues: 'default' and 'k8s_sched...
2 years ago
Hi, I am trying to use the aws autoscaler to assign instance profiles to new machines. This is a better way than managing credentials. I added the configurat...
Hello everyone, how do I tell the agent that it needs to install a local module of the repo? If I put git+ in the requirements it will install the module ver...
2 years ago
In a nutshell, what do I need for the clearML agent to scale EC2 nodes in the k8 cluster, in terms of helm configuration? I assume AWS credentials, is there ...
2 years ago
Hi everyone, im trying to use the aws autoscaler service. Provided the pac but is not able to clone the repo. It is not using the pac (using gitlab)
Can I run an autoscaler listening to a regular queue (i.e. combine autoscaling with on premise machines)? Or does it run the danger of creating an EC2 instan...
2 years ago
I am trying to run a task that is completely detached from git - remotely. The script uploads fine but in the UI, the git repo appears as “origin”. When the ...
2 years ago
Hi! I am running a code from repository, which is cloned by the following command: git clone ssh://git @ :5109.git but ClearML Web UI shows that repository i...
2 years ago