Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
GrievingTurkey78
Moderator
34 Questions, 125 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

119 × Eureka!
0 Votes
2 Answers
942 Views
0 Votes 2 Answers 942 Views
Hi ! While restarting the server I got ERROR: for agent-services removal of container 8f1d8539340d6d073eb5b51294f5f5d802048a3614d459b5c4fb1d38a05ce538 is alr...
3 years ago
0 Votes
3 Answers
955 Views
0 Votes 3 Answers 955 Views
Hi! I have some ClearML agents on GCP and sometimes the instance seems to reboot making the experiment fail and all the progress is lost. What is the best wa...
2 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Hi! I was taking a look at the https://pytorch-lightning.readthedocs.io/en/latest/common/lightning_cli.html and wanted to know if anyone has used clearml wit...
3 years ago
0 Votes
9 Answers
1K Views
0 Votes 9 Answers 1K Views
Hi! Does ClearML have a way to turn on/off virtual machines depending if there are experiments on queue?
3 years ago
0 Votes
2 Answers
886 Views
0 Votes 2 Answers 886 Views
Hi! I have the previous trains server configured with multiple experiments; I created it using the gcloud images provided. If I want to update the server to ...
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Hi
Hi πŸ‘‹ I am logging some figures on pytorch lightning using the example here. The figures are correctly saved on Tensorboard's images tab but unfortunately ar...
2 years ago
0 Votes
1 Answers
931 Views
0 Votes 1 Answers 931 Views
Quick question on the clearml-data package, Can I add files to a dataset from google storage instead of having to download them?
3 years ago
0 Votes
2 Answers
916 Views
0 Votes 2 Answers 916 Views
Hi all! Currently I am trying to create a tool that can perform certain operations on dataset ids, this is a skeleton of what I have in mind (based on the ex...
3 years ago
0 Votes
2 Answers
883 Views
0 Votes 2 Answers 883 Views
Hi! Regarding the artifact.get_local_copy() method, since there is no way to specify the path where the artifact will be downloaded, I wanted to confirm that...
4 years ago
0 Votes
7 Answers
970 Views
0 Votes 7 Answers 970 Views
Hi! I am trying to download data from GS using StorageManager.get_local_copy() . It works fine when I point it to a file i.e gs://bucket/dataset/image.png bu...
4 years ago
0 Votes
17 Answers
890 Views
0 Votes 17 Answers 890 Views
3 years ago
0 Votes
11 Answers
991 Views
0 Votes 11 Answers 991 Views
Hi! Is there a way to run a task without reporting to the server? For example if I want to debug a script by running it locally without it appearing on the s...
3 years ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Hi! I am currently using Hydra+ClearML and wanted to know if there are still some updates coming. At the moment, if I change the defaults hydra uses from the...
3 years ago
0 Votes
5 Answers
935 Views
0 Votes 5 Answers 935 Views
Hi, with the upcoming version of Hydra it seems the binding breaks. Specifically in the run_job function the argument order changed from https://github.com/f...
3 years ago
0 Votes
12 Answers
915 Views
0 Votes 12 Answers 915 Views
Hi all! Is there a way for trains to recognize the CLI arguments when using https://github.com/google/python-fire instead of argparse?
4 years ago
0 Votes
4 Answers
953 Views
0 Votes 4 Answers 953 Views
Hi! If I have a folder with multiple ckpt files would the manual way to upload them be the following: output_model = OutputModel(task) output_model.update_we...
2 years ago
0 Votes
10 Answers
950 Views
0 Votes 10 Answers 950 Views
Hi! I am getting the following error on an agent: /usr/local/bin/python3.8: No module named virtualenv clearml_agent: ERROR: Command '['python3.8', '-m', 'vi...
2 years ago
0 Votes
13 Answers
921 Views
0 Votes 13 Answers 921 Views
4 years ago
0 Votes
2 Answers
939 Views
0 Votes 2 Answers 939 Views
Hi
Hi AgitatedDove14 ! Regarding the Hydra integration, which pattern should be used? Call the task inside the decorated function? Will this store the parameter...
3 years ago
0 Votes
15 Answers
977 Views
0 Votes 15 Answers 977 Views
Hi
Hi πŸ‘‹ I am trying to set up a trains server on GCP. I followed all the steps listed here https://allegro.ai/docs/deploying_trains/trains_server_gcp/ . I also...
4 years ago
0 Votes
30 Answers
961 Views
0 Votes 30 Answers 961 Views
Hi! Is there something happening with the ModelCheckpoint callback on tensorflow==2.4.0 ? Using 2.2.0 gave me an input model on the artifacts tab in the GUI 😒
3 years ago
0 Votes
3 Answers
918 Views
0 Votes 3 Answers 918 Views
Hi! I recently updated my server and my clearml version, now when I set a task to be executed remotely its default state is aborted hence I have to reset and...
3 years ago
0 Votes
2 Answers
992 Views
0 Votes 2 Answers 992 Views
Hi! I changed from trains to clearml and ran some experiments using keras but it seems the metrics are not being tracked automagically, has anyone ran into t...
3 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Hi! I am saving some intermediate .pt files on the experiments and clearml automatically detects them as models, this makes the clearml.model - INFO message ...
3 years ago
0 Votes
2 Answers
953 Views
0 Votes 2 Answers 953 Views
I am trying to upgrade from clearml server 0.16 to the newest version but I am getting some errors when spinning up the new containers: WiredTiger error (-31...
3 years ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Hi! If I have a pipeline on gitlab that uses ClearML for some tests is there some way to setup the credentials so that it doesn’t fail?
3 years ago
0 Votes
3 Answers
994 Views
0 Votes 3 Answers 994 Views
Hi! I am trying to run some experiments on an agent I have configured to use the requirements.txt the problem is it only shows Cython on the list of installe...
3 years ago
0 Votes
4 Answers
945 Views
0 Votes 4 Answers 945 Views
Hi! I am having some problems with a loss after a good amount of training, what would be the best way to log a value to have a better idea of what is happening?
2 years ago
0 Votes
10 Answers
930 Views
0 Votes 10 Answers 930 Views
I am also experiencing a weird behaviour when running a script using the module flag. For example I run: python -m module.script arg1 arg 2And after the scri...
4 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
Hi, is there a way to force the requirements.txt? I have a package I installed directly from github but the version is always wrong. Any other way to do this?
3 years ago
Show more results questions
2 years ago
0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

AgitatedDove14 Downloading a dataset would not be possible using this right? I want to be able to access the data just avoid reporting the experiment results

3 years ago
0 Hi! Is There Something Happening With The

Thanks AgitatedDove14 πŸ™Œ

3 years ago
0 Hi

Hi AgitatedDove14 thanks for your reply, with the dashboard I meant the Web-App (UI) . I am trying to access http://<External IP>:8080 but unfortunately nothing shows up.

4 years ago
0 Hi! Is There Something Happening With The

Hey AgitatedDove14 after playing around seems that if the callback filepath points to an hdf5 file it is not uploaded.

3 years ago
0 Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

AgitatedDove14 I filed an issue of fire for them to point us to the argument parsing method https://github.com/google/python-fire/issues/291

4 years ago
0 Hi! Is There Something Happening With The

I changed it to point to a folder and it shows up

3 years ago
0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

I feel it’s easier not to report than cleaning after but please correct me if I am overthinking it. I’ll check if I could wrap the code in something that calls the Task.delete if debugging

3 years ago
0 Hi! Is There Something Happening With The

Basically one points to an hdf5 and the other one has no extensiion

3 years ago
0 Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

I am about to try everything AgitatedDove14 but ran into a gitlab error from the agent, I added the username and password to the configuration file but still get a Host key verification failed . Is it common that the cloning message shows the SSH link instead of the HTTPS when username and password are provided?

4 years ago
0 Hi! If I Have A Folder With Multiple

Yes, the files are inside the folder

2 years ago
0 Hi! Is There Something Happening With The

Hi AgitatedDove14 ! Do you have any updates on this?

3 years ago
0 I Am Also Experiencing A Weird Behaviour When Running A Script Using The Module Flag. For Example I Run:

So should I set them all with a default value? The working dir is the project one, the one that contains the module package

4 years ago
0 Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

Yes, it’s similar; somewhat more automatic since it detects the classes of functions arguments and generates the CLI. What do you mean by that AgitatedDove14 get all the parameters and use task.connect ?

4 years ago
0 Hi, I Was Getting A Really Weird Error Due To Mismatch On The Versions Between The Installed Libraries In My Environment And The Ones Ran In The Node (I Manually Changed The Installed Packages And Everything Worked). How Can I Force Trains To Use Exactly

No, I have all the packages with a version. I just want to know if there is a way to override the requirements versions detected by Pigar when using detect_with_pip_freeze: false . I have locally cloudpickle==1.4.1 but when running the code and sending the task to the node the environment uses cloudpickle==1.6.0 . I have to manually change the version on the UI. Is there a way to force this single package to have a version? Maybe on the requirments.txt or something similar

4 years ago
0 Hi! I Am Getting The Following Error On An Agent:

Not yet AgitatedDove14 , does the agent use by default the python version the command is run with? I installed conda and tried using package_manager.type=conda but then get an error:
clearml_agent: ERROR: 'NoneType' object has no attribute 'lower'

2 years ago
0 Hi! I Am Getting The Following Error On An Agent:

Give me a couple of minutes πŸ™Œ

2 years ago
0 Hi

Thanks SuccessfulKoala55 I’ll give it a try!

4 years ago
0 Hi! If I Have A Folder With Multiple

My bad :man-facepalming: It was just specifying weights_path=dirpath since the first argument is weights_filename

2 years ago
0 Hello

It is failing exactly when the download finishes. Not sure if it is something but on the ~/.clearml/pip-download-cache only a cu120 empty folder appears. Should the torch wheel be saved there?

one year ago
0 Hi! I Have Some Agents On Gcp. Lately I Have Been Getting Some Experiments That Simply Stop Running (No Signs That The Experiment Crashed). Here Is A Plot That Shows The Resource Monitoring. Any Ideas On What Could Be Causing This?

Hey CostlyOstrich36 ! I am using clearml==1.1.2 and clearml-agent==1.1.0 . Stopped is not the right word, more like frozen, it just froze at an epoch. The console on the agent shows epoch 33 first batch and the one at the server epoch 32 last batch. The experiment was running for ~6 hours.

3 years ago
0 Hello

@<1523701070390366208:profile|CostlyOstrich36> Thanks for the help! It ended being a mistake on my side. Misconfigured the VM's memory and it had only 3.75 G. Failed when installing torch.

one year ago
0 Hello

What additional context do you need?

one year ago
0 Hello

Yes, I configured it that way πŸ‘Œ Thanks! I'll use the flag!

one year ago
Show more results compactanswers