Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DilapidatedParrot58
Moderator
42 Questions, 205 Answers
  Active since 10 January 2023
  Last activity 2 years ago

Reputation

0

Badges 1

186 × Eureka!
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
we just had a slight problem - there was a double space in S3 checkpoint name, but ClearML UI prints them as one in the model description. if you copy and pa...
2 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
is there any way to post Slack alerts for the frozen experiments? (eg, after server restart they sometimes get stuck in Running mode, or https://github.com/p...
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
4 years ago
0 Votes
0 Answers
2K Views
0 Votes 0 Answers 2K Views
hey guys, do you have any tutorials or examples of intergration with dvc?
5 years ago
0 Votes
16 Answers
2K Views
0 Votes 16 Answers 2K Views
3 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
here I am again... can't find how to create a custom queue
5 years ago
0 Votes
16 Answers
2K Views
0 Votes 16 Answers 2K Views
yo guys, I'm getting Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'ConnectTimeoutError(, 'Connection to O...
5 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
there is something weird going on with console log after latest updates of ClearML Server. it doesn't show the latest updates, instead it often jumps to the ...
2 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
4 years ago
0 Votes
27 Answers
2K Views
0 Votes 27 Answers 2K Views
hey guys, I keep getting trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?...
4 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
any chance StorageManager could re-download files only if their size is different from file in cache (as an option)?
4 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
is there any way to export CSV with max metrics and hyperparameters for selected experiments?
4 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
downloading output artifacts from S3 by clicking on the download button next to Model URL was great, but since we moved from AWS to Yandex.Cloud, this featur...
2 years ago
0 Votes
18 Answers
2K Views
0 Votes 18 Answers 2K Views
I updated trains-server today, and now it's very unstable, Web interface randomly stops working. anyone had the same problem? I've never had any problems wit...
5 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
hey guys, here I am again with another question 😃 after the latest update, I’m getting this error when I’m trying to compare scalars for more than 10 experi...
5 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
4 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
Step 3 Task ( https://github.com/allegroai/trains/blob/master/examples/pipeline/step3_train_model.py ) - Loads the processed data (from Step 2) and clearml a...
4 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
2 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
hey guys, do you have any plans to add functionality to export training config with all hyperparameters to the different formats, such as training command li...
5 years ago
0 Votes
10 Answers
2K Views
0 Votes 10 Answers 2K Views
what is the right way to increase number of retries when using StorageManager.get_local_copy?
3 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
5 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
hey guys, a question about monthly worker_stats indices each of them takes up about 1gb for us. do we really need to keep all of them? is there any way to fr...
5 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
hey guys, is there a ready script that can delete all models from S3 (or other storage) that are related to deleted or archived experiments?
4 years ago
0 Votes
14 Answers
2K Views
0 Votes 14 Answers 2K Views
hey guys the first time I'm seeing this behavior I'm adding a new user to /opt/trains/config/apiserver.conf and restarting the containers. all old users are ...
5 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
feature request: ClearML prints GitHub token in the log, when there is "repository not found" error. it would be nice if could hide it
4 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
is is possible to pass custom https://clear.ml/docs/latest/docs/configs/env_vars/ to ClearML agents?
3 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
I'm getting A LOT of errors when running cleanup service Failed deleting the following URIs - script fails to delete image and text files ERROR - Failed dele...
3 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
hey guys, I keep getting "Failed parsing task parameter" warning for the arguments such as this one: parser.add_argument( "--dataset_mean", type = float, nar...
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
2 years ago
Show more results questions
0 Hey Guys! I'Ve Got The Latest Version Of Trains 0.16.0 And Now I Have A Problem. In Previous Versions I Could Easily Override Default Arguments On Hyperparameters  Tab And Now After Editing The Arguments Values With The New Ones And Executing The Experime

same here, changing arguments in the Args section of Hyperparameters doesn’t work, training script starts with the default values.

trains 0.16.0
trains-agent 0.16.0
trains-server 0.16.0

5 years ago
0 Hey Guys, I Keep Getting "Failed Parsing Task Parameter" Warning For The Arguments Such As This One:

on the side note, is there any way to automatically give more meaningful names to the running docker containers?

4 years ago
0 We Just Had A Slight Problem - There Was A Double Space In S3 Checkpoint Name, But Clearml Ui Prints Them As One In The Model Description. If You Copy And Paste It, The Address Will Be Wrong

thanks! we copy S3 URLs quite often. I know that it’s better to avoid double spaces in task names, but shit happens 😄

2 years ago
0 Hey Guys, A Question About Monthly Worker_Stats Indices Each Of Them Takes Up About 1Gb For Us. Do We Really Need To Keep All Of Them? Is There Any Way To Free Up The Space?

yeah, backups take much longer, and we had to increase our EC2 instance volume size twice because of these indices

got it, thanks, will try to delete older ones

5 years ago
0 Hey Guys, Do You Have Any Plans To Add Functionality To Export Training Config With All Hyperparameters To The Different Formats, Such As Training Command Line Command, Yaml, Etc.?

yeah, I am aware of trains-agent, we are planning to start using it soon, but still, copying original training command would be useful

5 years ago
0 When We Train The Models, We Often Choose Checkpoint Based On The Validation Accuracy, But Test Set Accuracy (Or Specific Class Validation Accuracy) Is Not Necessarily The Best For This Checkpoint. Right Now There Are Options To Add Columns With Max And L

I guess, this could overcomplicate ui, I don't see a good solution yet.

as a quick hack, we can just use separate name (eg "best_val_roc_auc") for all metric values for the current best checkpoint. then we can just add columns with the last value of this metric

4 years ago
0 Hey Guys, I'M Trying To Run An Experiment Using Trains-Agent. I Have A Custom Docker Image With Nightly Versions Of Pytorch And Our Own Library Installed From A Private Repo. I Was Assuming That These Packages Will Be Automatically Available To Trains Dur

I added the link just in case anyway 😃

also, is there any way to install a repo that we clone as a package. we often use absolute imports and do "pip install -e ." to utilize it
sorry there are so many questions, we just really want to migrate to trains-agent)

5 years ago
0 Hey Guys, Here I Am Again With Another Question

fantastic, everything is working perfectly
thanks guys

5 years ago
0 Hey Guys, Is There A Ready Script That Can Delete All Models From S3 (Or Other Storage) That Are Related To Deleted Or Archived Experiments?

we're using the latest version of clearml, clearml agent and clearml server, but we've been using trains/clearml for 2.5 years, so there are some old tasks left, I guess 😃

4 years ago
0 Hey Guys, I Keep Getting

WARNING: You are using pip version 20.1.1; however, version 20.3.3 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.

trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?

http://OUR_IP:8081 http://OUR_IP:8080 http://apiserver:8008
WARNING: You are using pip version 20.1.1; however, version 20.3.3 is available.
`...

4 years ago
0 Hey Guys, I Keep Getting

nope, old clenup task fails with trains_agent: ERROR: Could not find task id=e7725856e9a04271aab846d77d6f7d66 (for host: )
Exception: 'Tasks' object has no attribute 'id

weirdly enough, curl http://apiserver:8008 from inside the container works

4 years ago
0 Hey Guys, I Keep Getting

problem is solved. I had to replace /opt/trains/data/fileserver to /opt/clearml/data/fileserver in Agent configuration, and replace trains to clearml in Requirements

4 years ago
0 Hey Guys, I Keep Getting

new version worked

4 years ago
0 Hey Guys, I Keep Getting

do you have any idea why cleanup task keeps failing then (it used to work before the update)

4 years ago
Show more results compactanswers