Hi there CostlyOstrich36 , We have two agents, each with its own queue. These agents are running on a Kubernetes cluster, so t...
2 days ago
Hey team, I had a question about executing the pipelines using clearml agent setup in K8s. When we define a pipeline script to be executing remotely and subm...
7 days ago
How to configure ClearML agent to keep pods around after they finish/fail? I want to debug a pod that crashes, but it gets deleted quickly
11 days ago
Hi, When using yolo in a Docker based clearml agent, I get the error below. How do I need to setup clearml agent to have access to CV2? Starting Task Executi...
27 days ago
Hi everyone, I’m testing ClearML and encountered an issue when launching the agent in docker-mode: it seems to ignore additional Docker arguments. For exampl...
29 days ago
hello! i had trouble running clearml-agent on k8s. i fixed it by modifying the helm chart to allow specifying runtimeClassName (which is needed when using nv...
one month ago
Hi. I want to run clearml-agent inside a docker container. with the default image in my clearml.conf file, it is "image: "nvidia/cuda:11.0.3-cudnn8-runtime-u...
one month ago
hey, I'm using clearml GCP autoscaler and it seems that task.connect is very slow compared to the same setup in clearml AWS autoscalers. it might takes up to...
one month ago
Hi I got issue when I am trying to mount EBS to AWS EC2 instance when running ClearML pipeline, I've checked the dev/sdb is dev/nvme1n1 in ec2 and I was usin...
one month ago
Hello everyone, I deployed clearml and clearml-agent on k8s and I can't get the agent to use the gpus. I have 3 machines with a GPU each correctly configured...
one month ago
Hi, we are migrating from AWS to GCP machines and we experience issues with task.connect function. the issue is that on GCP machines that are spawn by the au...
👋 Hi everyone! Did anyone have this issue with clearml agent in k8s when trying to run a task remotely? │ k8s-glue ERROR: Could not push back task [11d1b317...
one month ago
Hi folks! I’m looking to migrate a clearml server setup from using local storage to Azure blob storage. I am able to get data to upload to azure, and the cle...
one month ago
Hi everyone, I'm experiencing an issue with ClearML running on K8S. After upgrading the ClearML server helm chart from version 7.11.5, I'm seeing the followi...
one month ago
Hello We are using enterprise Version and the question is about the Agents and the queues We know that we can configure container image at the agent level Th...
2 months ago
Hi everyone, I want to clone a git repo with clearml-agent that has to following url format https://: ACCESS_TOKEN
2 months ago
Hi everyone ! I am trying to launch multigpu training on remote clearm agent and stuck, because when i do accelerate launch train.py remote agent doesn’t ove...
2 months ago
Hi all, I was using the cleaning script, but it's (I think the elasticsearch, since it's a java process) consuming a lot of memory. So much that it's blockin...
2 months ago
Hi, I would like to know from you, maybe someone has encountered the problem that after deploying an agent inside Docker, the launch of the script itself occ...
2 months ago
Hi all, I'm trying to set up aws autoscaler to spin up ec2 instances from predefined ami so I was able to set up the autoscaler, but i am experiencing some i...
Hello! I am new to clearML and trying to set it up for my org, and have a question regarding git repository. So from what I understood, clearml agent spins u...
2 months ago
Any tips running my first tasks in clearml-agent daemon --docker ? My task is configured with my tried-and-true Docker image (from before using ClearML) with...
3 months ago
Hi everyone, I noticed that ES database of my local server instance is taking up huge amount of RAM (>32GB RAM usage for a ~80GB disk space). There is enough...
3 months ago
Hi everyone, I want to use clearml agent on my remote machine, where I've setup my codebase, along with cred files, dataset dir, dockerfile and docker run sc...
3 months ago
I set disable_ssh_mount: true, however the agent (running in docker mode) constantly mounts it. The issue here is that it alters authorized_keys (adds root
4 months ago
Hello, what is best practice for spinning up the ClearML agent after "remote machine" reboot?
4 months ago
Hi! How to correctly configure Clearml with PyTorch-Ignite to write checkpoints to the /mnt/nfs/shared Project Dir in a 3-agent cluster? I tried this task = ...
4 months ago
hi again, hoping someone can shed some light on my workflow planning. Im rebuilding my project from scratch in clearml, using remote gitlab, i have my person...
4 months ago
hi all, just started tinkering with clearml and wondering if this is suitable to deploy an ETL/ELT pipeline using the agent on my 2nd machine? or is clearml ...
4 months ago
I tried using clearml.automaton.TriggerScheduler in the following way: """Add a trigger to test the model once it is published.""" from clearml.automation im...
4 months ago