Hi, I cannot manage to start trains-server 0.16 with the docker-compose file, the trains-elastic container fails with the following error:
5 years ago
Hi, if I am starting my training with the following command: python -u -m torch.distributed.launch --nproc_per_node=2 --use_env train.py --config configs/tra...
3 years ago
Hey, I have one question regarding the cleanup_service task in the DevOps project: Does it assume that the agent in services mode is in the trains-server mac...
5 years ago
Hi, I know that ClearML uses local changes for patching and running script. Can it also do the same with local commits?
2 years ago
My goal is to deploy it and have a blog post detailing the instructions
11 months ago
I had a good look at all the introduction video on youtube and had some questions. Context: if we are going to deploy and maintain ClearML servers our self i...
2 years ago
hi, i want to run a script remotely on my agent, but for it to work i need it to download to the agent the whole directory the script is in, is it possible?
4 years ago
Hello Admin, There are not enough videos about MLOPS hands on projects on YouTube. Please add more content.
one year ago
Hello Admin, I am unable to create pipeline using CPU/GPU/TPU. What is the issue?
one year ago
👋 Hi everyone! Do I have to create new credentials for each colab notebook?
one year ago
Hi everyone! ✋ I'm really enjoying ClearML 🎉 Currently I'm finding myself repeating the following a bit too often and as wondering if there is a way for Cle...
2 years ago
Hello, Any idea how to log tables with hyperlinks? When logging dataframes using report_table I get tables in the plots section. I would like the cell conten...
10 months ago
Typo: Was going crazy for a short amount of time yelling to myself: I just installed clear-agent init!
4 years ago
Has anybody had success serving a ClearML Server with Traefik HTTPS tunneling?
11 months ago
In a self hosted server I am getting a similar issue to this: None The agent-services keeps restarting repetitively. In the docker logs I am getting this err...
10 months ago
Anyone have info on how could i migrate my clearml data (models, datasets) from one GCP bucket to another ?
10 days ago
Just want to share with you something that really excited me - I'm doing a workshop for ClearML today (intro only, nothing that will be shocking for you guys...
4 years ago
Hello, in the following context: controller_task = Task.init(...) # This will clone the parent task, enqueue and wait for finished status data_processing_tas...
5 years ago
Hi all, we have clearml-server running on a kube pod, and then a GPU server running the clearml-agent which we use to queue jobs. For some reason, our kube p...
11 months ago
I'm experiencing some weird behavior from the automatic logging iterations. It seem to be capped at the number of batches rather than the epochs. How can I c...
4 years ago
Hi all - I have a large dataset and have preprocessed it and saved each item in .pt files, which are loaded using torch.load in my Dataset . The issue is tha...
11 months ago
Hi all, I have deployed a ClearML server on Linux VM and I am having an issue with logging plots. Specifically, a plot is logged, but in the UI it does not l...
9 months ago
Hi, is there a way to query some tasks ordered by a scalar metric? I tried with: Task.get_tasks( project_name='project', task_name='task', task_filter={'orde...
2 years ago
Hello, does anybody know what triggers a new model to be added in a project (working in Pytorch) ? I'm new to trains and adding it to my script generated a h...
5 years ago
Hi there, I have several experiments hanging/stuck in the middle or at the end of the training, with the last message logged being: train INFO: Engine run co...
one year ago
Hello everyone, I'm currently working on comparing plots from experiments within the ClearML web UI. Each of these plots displays a series of true values and...
2 years ago
Hi, if i want to report image with detection results, but it will take up too much hard disk space by upload or save new pic with detection results, is there...
5 years ago
Hi, I'm trying to configure an agent. After requesting to paste the credentials, I get: Error: could not verify credentials. What could be the reason for that?
5 years ago
Is there anyone who is using ClearML in a Jupyter Notebook. It looks like when using execute_remotely together with a jupyter noteebok, clearml tries to laun...
2 years ago
Hey! I'm having a weird issue when I run pip freeze locally it's showing version "clearml==0.17.5rc6" but when I initiate the task it's always starting with ...
4 years ago