Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
GrievingTurkey78
Moderator
34 Questions, 125 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

119 × Eureka!
0 Votes
2 Answers
960 Views
0 Votes 2 Answers 960 Views
Hi! What would be the way for manually uploading a model? I have intermediate .pt files which I don't want to upload. Is there a way to turn off clearml capt...
3 years ago
0 Votes
2 Answers
899 Views
0 Votes 2 Answers 899 Views
Hi! Regarding the artifact.get_local_copy() method, since there is no way to specify the path where the artifact will be downloaded, I wanted to confirm that...
4 years ago
0 Votes
2 Answers
952 Views
0 Votes 2 Answers 952 Views
Hi
Hi AgitatedDove14 ! Regarding the Hydra integration, which pattern should be used? Call the task inside the decorated function? Will this store the parameter...
3 years ago
0 Votes
10 Answers
963 Views
0 Votes 10 Answers 963 Views
Hi! I am getting the following error on an agent: /usr/local/bin/python3.8: No module named virtualenv clearml_agent: ERROR: Command '['python3.8', '-m', 'vi...
2 years ago
0 Votes
8 Answers
833 Views
0 Votes 8 Answers 833 Views
Hello πŸ‘‹ I am using a self hosted clearml setup using the requirments file of the project. When I run the task it is failing and I get: Collecting torch==2.0...
one year ago
0 Votes
4 Answers
969 Views
0 Votes 4 Answers 969 Views
Hi! If I have a folder with multiple ckpt files would the manual way to upload them be the following: output_model = OutputModel(task) output_model.update_we...
3 years ago
0 Votes
2 Answers
965 Views
0 Votes 2 Answers 965 Views
Hi ! While restarting the server I got ERROR: for agent-services removal of container 8f1d8539340d6d073eb5b51294f5f5d802048a3614d459b5c4fb1d38a05ce538 is alr...
3 years ago
0 Votes
17 Answers
909 Views
0 Votes 17 Answers 909 Views
3 years ago
0 Votes
30 Answers
979 Views
0 Votes 30 Answers 979 Views
Hi! Is there something happening with the ModelCheckpoint callback on tensorflow==2.4.0 ? Using 2.2.0 gave me an input model on the artifacts tab in the GUI 😒
3 years ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
Hi! Is there a way to run a task without reporting to the server? For example if I want to debug a script by running it locally without it appearing on the s...
3 years ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Hi! I am currently using Hydra+ClearML and wanted to know if there are still some updates coming. At the moment, if I change the defaults hydra uses from the...
3 years ago
0 Votes
15 Answers
995 Views
0 Votes 15 Answers 995 Views
Hi
Hi πŸ‘‹ I am trying to set up a trains server on GCP. I followed all the steps listed here https://allegro.ai/docs/deploying_trains/trains_server_gcp/ . I also...
4 years ago
0 Votes
3 Answers
972 Views
0 Votes 3 Answers 972 Views
Hi! I have some ClearML agents on GCP and sometimes the instance seems to reboot making the experiment fail and all the progress is lost. What is the best wa...
2 years ago
0 Votes
21 Answers
980 Views
0 Votes 21 Answers 980 Views
Hi! Any idea why clearml fails to detect iteration reporting? ClearML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-fr...
3 years ago
0 Votes
2 Answers
900 Views
0 Votes 2 Answers 900 Views
Hi! I have the previous trains server configured with multiple experiments; I created it using the gcloud images provided. If I want to update the server to ...
3 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
Hi, is there a way to force the requirements.txt? I have a package I installed directly from github but the version is always wrong. Any other way to do this?
3 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Hi! I changed from trains to clearml and ran some experiments using keras but it seems the metrics are not being tracked automagically, has anyone ran into t...
3 years ago
0 Votes
2 Answers
966 Views
0 Votes 2 Answers 966 Views
I am trying to upgrade from clearml server 0.16 to the newest version but I am getting some errors when spinning up the new containers: WiredTiger error (-31...
3 years ago
0 Votes
10 Answers
951 Views
0 Votes 10 Answers 951 Views
I am also experiencing a weird behaviour when running a script using the module flag. For example I run: python -m module.script arg1 arg 2And after the scri...
4 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
Hi! I am trying to run some experiments on an agent I have configured to use the requirements.txt the problem is it only shows Cython on the list of installe...
3 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Hi! I am saving some intermediate .pt files on the experiments and clearml automatically detects them as models, this makes the clearml.model - INFO message ...
3 years ago
0 Votes
9 Answers
1K Views
0 Votes 9 Answers 1K Views
Hi! Does ClearML have a way to turn on/off virtual machines depending if there are experiments on queue?
3 years ago
0 Votes
1 Answers
948 Views
0 Votes 1 Answers 948 Views
Quick question on the clearml-data package, Can I add files to a dataset from google storage instead of having to download them?
3 years ago
0 Votes
2 Answers
941 Views
0 Votes 2 Answers 941 Views
Hi all! Currently I am trying to create a tool that can perform certain operations on dataset ids, this is a skeleton of what I have in mind (based on the ex...
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Hi
Hi πŸ‘‹ I am logging some figures on pytorch lightning using the example here. The figures are correctly saved on Tensorboard's images tab but unfortunately ar...
3 years ago
0 Votes
13 Answers
941 Views
0 Votes 13 Answers 941 Views
4 years ago
0 Votes
4 Answers
973 Views
0 Votes 4 Answers 973 Views
Hi! I am having some problems with a loss after a good amount of training, what would be the best way to log a value to have a better idea of what is happening?
2 years ago
0 Votes
12 Answers
938 Views
0 Votes 12 Answers 938 Views
Hi all! Is there a way for trains to recognize the CLI arguments when using https://github.com/google/python-fire instead of argparse?
4 years ago
0 Votes
5 Answers
953 Views
0 Votes 5 Answers 953 Views
Hi, with the upcoming version of Hydra it seems the binding breaks. Specifically in the run_job function the argument order changed from https://github.com/f...
3 years ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Hi! If I have a pipeline on gitlab that uses ClearML for some tests is there some way to setup the credentials so that it doesn’t fail?
3 years ago
Show more results questions
0 Hi! I Have Some Clearml Agents On Gcp And Sometimes The Instance Seems To Reboot Making The Experiment Fail And All The Progress Is Lost. What Is The Best Way To Resume An Experiment?

Hey CostlyOstrich36 sorry to ping you! Let's say I enqueue multiple experiments on a couple of agents and one of them fails. Is it possible to restart the experiment from the UI using the latest checkpoint? What if the experiment gets assigned to the other agent? I am not sure how the continue_last_task flag would help in this case.

2 years ago
0 Hello

Yes, I configured it that way πŸ‘Œ Thanks! I'll use the flag!

one year ago
0 Hello

What additional context do you need?

one year ago
0 I Am Also Experiencing A Weird Behaviour When Running A Script Using The Module Flag. For Example I Run:

So should I set them all with a default value? The working dir is the project one, the one that contains the module package

4 years ago
0 Hello

@<1523701070390366208:profile|CostlyOstrich36> Thanks for the help! It ended being a mistake on my side. Misconfigured the VM's memory and it had only 3.75 G. Failed when installing torch.

one year ago
0 Hi! Is There Something Happening With The

Hey AgitatedDove14 does this work for you?
` from argparse import ArgumentParser
from tensorflow.keras import utils as np_utils
from tensorflow.keras.datasets import mnist
from tensorflow.keras.layers import Dense
from tensorflow.keras.optimizers import Adam
from tensorflow.keras.callbacks import ModelCheckpoint

import tensorflow as tf
from clearml import Task

class Linear(tf.keras.Model):
def init(self, in_shape=(784,), num_classes=10):
super().init()
self.l...

3 years ago
0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

I feel it’s easier not to report than cleaning after but please correct me if I am overthinking it. I’ll check if I could wrap the code in something that calls the Task.delete if debugging

3 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

I set it to 200000 ! But the problem stems from when the first plot is the clearml cpu and gpu monitoring, were you able to reproduce it? Even if I set the number fairly large when the monitoring plot was reported the message appeared.

3 years ago
0 Hi! I Am Saving Some Intermediate

So I would have to disconnect pytorch? And then upload the model at the end

3 years ago
0 Hi ! While Restarting The Server

Thanks SuccessfulKoala55 i’ll try this out!

3 years ago
0 Hi! Is There Something Happening With The

Hi AgitatedDove14 ! Do you have any updates on this?

3 years ago
4 years ago
0 Hi! Regarding The

Thanks for the info AgitatedDove14 !

4 years ago
0 Hi! Is There Something Happening With The

Thanks AgitatedDove14 πŸ™Œ

3 years ago
0 Hi! Is There Something Happening With The

It works perfectly! AgitatedDove14 There is something weird on my side 😒

3 years ago
0 Hello

Managed to get:

clearml_agent: ERROR: Command '['/home/ramon/.clearml/venvs-builds/3.9/bin/python', '-m', 'pip', '--disable-pip-version-check', 'install', '-r', '/var/tmp/requirements_tb0x2i3j.txt', '--extra-index-url', '
 died with <Signals.SIGKILL: 9>.

while building the task with the id on the agent

one year ago
0 Hello

Sure! For torch I have:

torch==2.0.1
    # via
    #   monai
    #   pytorch-lightning
    #   torchio
    #   torchmetrics
one year ago
0 Hi! Is There Something Happening With The

Basically one points to an hdf5 and the other one has no extensiion

3 years ago
0 Hi! Is There Something Happening With The

Thanks Martin! I’ll keep checking πŸ‘Œ

3 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

CostlyOstrich36 That seemed to do the job! No message after the first epoch, with the caveat of losing resource monitoring. Any idea of what could be causing this? If the resource monitor is the first plot then the iteration detection will fail? Are there any hacks to keep the resource monitoring? Thanks a lot! πŸ™Œ

3 years ago
0 Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

Yes, exactly! Unfortunately I am not so familiar with the internals of the library but I could take a look and figure that out.

4 years ago
0 Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

I am about to try everything AgitatedDove14 but ran into a gitlab error from the agent, I added the username and password to the configuration file but still get a Host key verification failed . Is it common that the cloning message shows the SSH link instead of the HTTPS when username and password are provided?

4 years ago
0 Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

Yes AgitatedDove14 , I added git user name and password on the trains.conf file. On the results tab of the UI the logs clone command shows the SSH command instead of the HTTPS :
Repository cloning failed: Command ['clone', mailto:'git@gitlab.com : ...

4 years ago
Show more results compactanswers