Hi, we are migrating from AWS to GCP machines and we experience issues with task.connect function. the issue is that on GCP machines that are spawn by the au...
Hey Guys, Context: I was able to host the ClearML server (web server, file server, API server) in an AWS EC2 instance via terraform IaC and GitHub actions an...
8 months ago
👋 Hi everyone! Did anyone have this issue with clearml agent in k8s when trying to run a task remotely? │ k8s-glue ERROR: Could not push back task [11d1b317...
8 months ago
Hello Team I am enterprise user Looking for help related to model serving capability I followed the tutorial to create the model , push it to the model regis...
8 months ago
Hey everyone! We're excited to announce the launch of our new documentation website ! We've overhauled itm making your experience even smoother and more effi...
8 months ago
Hey Guys, noob question/problem ahead, so please forgive me in advance 🫠Context: I was able to host the ClearML server (web server, file server, API server...
8 months ago
my linter (the default vscode python extension) doesn't understand TaskInstance = TypeVar("TaskInstance", bound="Task") . if I type clearml.Task.current_task...
8 months ago
has anyone else benchmarked ClearML? I'm seeing catastrophic logging overhead: None for 50 numbers, ClearML's logging takes 3x as long as my entire LLM train...
8 months ago
Hi guys, I am trying to upload and serve a pre-existing 3-rdparty PyTorch model inside my ClearML cluster. However, after proceeding with the suggested seque...
8 months ago
Hi, i try to execute pipeline with PipelineController and define it like this: pipe = PipelineController( name =config["clear_ml"]["pipeline_name"], project ...
8 months ago
Hi, is there a way to wait until a dataset finish uploading before proceed? because I want to upload dataset if it is not already exist and then process the ...
8 months ago
Hi, is there anything changed on clearml? I saw the web UI was updated, and after then, I am experiencing package not found issue shown in the log. I have th...
8 months ago
How does clearml-agent install -e python package ? I have a homemade python package which I just moved to pyproject.toml, that I am not familiar with so not ...
8 months ago
What is the recommended way of passing environment variables to Kubernetes pods executed using ClearML pipelines? I know I could create a “global” Kubernetes...
8 months ago
hi, I'm using huggingface trainer, is there a way to capture grad_norm per layer? Thanks!
8 months ago
Hello! I am trying to use an early stopping, or pruning mechanism in the hyperparameter optimization. Is there any of those mechanisms with OptimizerOptuna c...
8 months ago
Hi, i tried to build my own image and uploaded it to harbor in order to use it as an image for pipelines and tasks, but the tasks didn't use the packages fro...
8 months ago
Hello, a question about pipelines. I have a repository with one pipeline using decorators, defined in pipeline.py . It uses multiple components that import c...
9 months ago
Hi everyone, I'm experiencing an issue with ClearML running on K8S. After upgrading the ClearML server helm chart from version 7.11.5, I'm seeing the followi...
8 months ago
Hi folks! I’m looking to migrate a clearml server setup from using local storage to Azure blob storage. I am able to get data to upload to azure, and the cle...
8 months ago
Hi everyone! I want to setup uv caching on 1.9.3 agent, but doesnt understand where i should add UV_CACHE_DIR in my clearml config. I have tried several opti...
8 months ago
Hi guys, im running into an issue when creating new clearml dataset version. I want to create new version of dataset from a local folder, but I don't want al...
8 months ago
Hi, I would like to know from you, maybe someone has encountered the problem that after deploying an agent inside Docker, the launch of the script itself occ...
9 months ago
Hi, i see some strange behavior where the training fails when running on clearml (loss = nan) compared to running w/o clearml. This is entirely reproduceable...
8 months ago
does anyone have experience with integrating clearml and slurm? if so, what pattern did you use? (did you submit tasks and just use clearml as tracker, or di...
9 months ago
Sporadic failure to retrieve Scalars and Console logs. Context: self-hosted in Azure with 2 separate Azure Container App for the UI and API server. ElasticSe...
8 months ago
Hello, I wanted to test out clearml and installed the 7.14.2 helm chart. The deployment looks ok. I can access the webui and create workspace API credentials...
8 months ago
Hi all. After rebooting the server after "no space on disk" cannot see plots of one of the experiments.
8 months ago
I'm using docker runtime to run clearml-agent. When I enqueue the task through web UI. I am getting errors related to invalid git credentials. How do I make ...
8 months ago
Hi, I am wondering after a task submitted to remote server finishing running. Will the docker container / disk space (really I am more interested about the d...
8 months ago