Hello ClearML friends. I'm trying to setup a clearml agent on my workstation to queue jobs on my GPU. $ pip3 install clearml-agent $ clearml-agent init $ cle...
4 years ago
Ok, nvmd. As soon as I spend all the time to write the above message, I figured it out. In case you are curious: I ran the clearml agent in a session before ...
4 years ago
So, here's a question. Does clearml automatically save everything necessary to continue training a pytorch language model? Specifically, I've been looking at...
4 years ago
Hi, Is there a way to stop a clearml-agent from within an experiment? Or block it to prevent it running any other task?
4 years ago
Hey, I’m getting the following error when loading a model using model.get_local_copy() … raise ValueError("Could not retrieve a local copy of model weights {...
4 years ago
Quick one: Having some issues with comparing experiments. Can I only compare experiments if they are clones of the same parent task?
4 years ago
So, I have just started using ClearML for local data and experiment tracking and its been super helpful. Now that I am moving towards deploying and serving t...
4 years ago
Hi everyone! how can i automatically report system resources utilisation?
4 years ago
For some runs of my experiments the ressource monitoring exists, for other it does not. Any idea why this could be the case?
4 years ago
Off topic - Is there a slack / discord community for general ML discussions?
4 years ago
Hi, I am getting an error while running task.mark_stopped() , any idea why? (clearml 1.0.2, clearml-agent 1.0.0, python 3.6) File "/home/machine/.clearml/ven...
4 years ago
Hi all, I'm using clearml 1.0.3 with clearml-server <1 (how do I get the current running version?) In Pytorch-Lightning I use DDP and I see multiple tasks (a...
4 years ago
Hi, I encountered a bug on clearml-server 1.0.1: I tried to add in a project page a custom column in +HYPER PARAMETERS > Args > queue and got an error pop up...
4 years ago
Hi all, I use .get_local_copy() to get a local copy for each of my artifacts logged in a task. I currently have 160 files which I want to get a local copy. E...
4 years ago
Hi, What Is The Right Way Of Syncing A Dataset?
Whenever I Add New Archives And Try To Upload I Get:
Hi, what is the right way of syncing a dataset? Whenever I add new archives and try to upload I get: Error: Task object can only be updated if created or in_...
4 years ago
Hey, one more questions ) after migrate data to new clearml 1.0.2 (from 0.17.0-63, but image version of elastic(7.6.2), mongo(3.6.5), redis(5.0) the same), s...
4 years ago
Hello! I got the idea of publishing model/task. But there could be scenarios when it still should be archived/deleted. For instance death of project. Is it p...
4 years ago
So, I'm trying to do a several-step process, but it needs to run on a GPU queue in ClearML. How would I do that? Specifically, here's what I'm trying to do, ...
4 years ago
Hi, i was running my agent and had a few scripts for agent.extra_docker_shell_script. but when I looked through the logs, they were not executed. Any idea wh...
4 years ago
Hi, would it be possible to parse torch requirement when it’s part of the extras_require dict? In my code, I have the following: train_task._update_requireme...
4 years ago
OK, we've got a GPU Queue setup on one of our local machines. I managed to run a script on it, which was intended to download a clearML dataset stored in s3....
4 years ago
New Channel! After TrickySheep9 asked about discord/slack channels that are centered around ML/DL and data science, I figured that since there are so many of...
4 years ago
Hi, I'm attempting to upgrade my clearml server on offline env. I wish to retain all existing data. Can I check if it suffice to just docker-compose down --r...
4 years ago
any guides on how to aggregate results of different experiments?
4 years ago
Does clearML have an integrated approach to querying model metrics? The objective is to log all model run outputs in order to be able to query them for deplo...
4 years ago
Hi, We have a clearml agent running on an ec2 machine. Until now it worked great. We are two team members using it without a problem via the saas UI. Startin...
4 years ago
hi, i'm wondering why is the agent being terminated after some time? i want it to always be running in the background and listening to a queue, and not have ...
4 years ago
Hi, is it possible to start a clearml-agent (not in docker mode) on a machine with a gpu, but enforce the clearml-agent to not “see” the gpu? So that the exp...
4 years ago
is there any way to export CSV with max metrics and hyperparameters for selected experiments?
4 years ago
Hi, i have been getting the following for a while. Is there a more detailed log i can look into? This happens on both https and http. 2021-05-27 08:47:02,539...
4 years ago