Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DilapidatedParrot58
Moderator
42 Questions, 205 Answers
  Active since 10 January 2023
  Last activity 2 years ago

Reputation

0

Badges 1

186 × Eureka!
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
is is possible to pass custom https://clear.ml/docs/latest/docs/configs/env_vars/ to ClearML agents?
3 years ago
0 Votes
10 Answers
2K Views
0 Votes 10 Answers 2K Views
what is the right way to increase number of retries when using StorageManager.get_local_copy?
3 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
two annoying visual bugs in ClearML Server UI after latest update: experiment status is still shown as “Aborted” after successful resetting until you refresh...
2 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
hey guys, thanks for creating Slack workspace, that's really cool. question - are we missing smth or is currently not possible to pass S3 credentials via env...
5 years ago
0 Votes
9 Answers
2K Views
0 Votes 9 Answers 2K Views
3 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
hey guys, a question about monthly worker_stats indices each of them takes up about 1gb for us. do we really need to keep all of them? is there any way to fr...
5 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
5 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
2 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
here I am again... can't find how to create a custom queue
5 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
5 years ago
0 Votes
0 Answers
2K Views
0 Votes 0 Answers 2K Views
hey guys, do you have any tutorials or examples of intergration with dvc?
5 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
hey guys, do you have any plans to add functionality to export training config with all hyperparameters to the different formats, such as training command li...
5 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
hey guys, here I am again with another question πŸ˜ƒ after the latest update, I’m getting this error when I’m trying to compare scalars for more than 10 experi...
5 years ago
0 Votes
29 Answers
2K Views
0 Votes 29 Answers 2K Views
4 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
Step 3 Task ( https://github.com/allegroai/trains/blob/master/examples/pipeline/step3_train_model.py ) - Loads the processed data (from Step 2) and clearml a...
4 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
feature request: ClearML prints GitHub token in the log, when there is "repository not found" error. it would be nice if could hide it
4 years ago
0 Votes
18 Answers
2K Views
0 Votes 18 Answers 2K Views
I updated trains-server today, and now it's very unstable, Web interface randomly stops working. anyone had the same problem? I've never had any problems wit...
5 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
4 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
4 years ago
0 Votes
16 Answers
2K Views
0 Votes 16 Answers 2K Views
yo guys, I'm getting Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'ConnectTimeoutError(, 'Connection to O...
5 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
any chance StorageManager could re-download files only if their size is different from file in cache (as an option)?
4 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
5 years ago
0 Votes
16 Answers
2K Views
0 Votes 16 Answers 2K Views
3 years ago
0 Votes
20 Answers
2K Views
0 Votes 20 Answers 2K Views
5 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
is there any way to post Slack alerts for the frozen experiments? (eg, after server restart they sometimes get stuck in Running mode, or https://github.com/p...
4 years ago
0 Votes
14 Answers
2K Views
0 Votes 14 Answers 2K Views
hey guys the first time I'm seeing this behavior I'm adding a new user to /opt/trains/config/apiserver.conf and restarting the containers. all old users are ...
5 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
hey guys, I keep getting "Failed parsing task parameter" warning for the arguments such as this one: parser.add_argument( "--dataset_mean", type = float, nar...
4 years ago
Show more results questions
0 Yo Guys, I'M Getting

works like a charm! you guys are the best, as always =)

5 years ago
0 Hey Guys, Is There A Ready Script That Can Delete All Models From S3 (Or Other Storage) That Are Related To Deleted Or Archived Experiments?

we're using the latest version of clearml, clearml agent and clearml server, but we've been using trains/clearml for 2.5 years, so there are some old tasks left, I guess πŸ˜ƒ

4 years ago
0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

another stupid question - what is the proper way to delete a worker? so far I've been using pgrep to find the relevant PID πŸ˜ƒ

5 years ago
0 It Would Be Nice To Group Experiments Within Projects Use Cases:

tags are somewhat fine for this, I guess, but there will be too many of them eventually, and they do not reflect sequential nature of the experiments

3 years ago
0 It Would Be Nice To Group Experiments Within Projects Use Cases:

that's right
for example, there are tasks A, B, C
we run multiple experiments for A, finetune some of them in separate tasks, then choose one or more best checkpoints, run some experiments for task B, choose the best experiment, and finally run task C

so we get a chain of tasks: A - A-ft - B- C

ClearML pipeline doesn't quite work here because we would like to analyze results of each step before starting next task

but it would be great to see predecessors of each experiment in the chain

3 years ago
0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

the weird part is that the old job continues running when I recreate the worker and enqueue the new job

5 years ago
0 What Is The Right Way To Increase Number Of Retries When Using

isn't this parameter related to communication with ClearML Server? I'm trying to make sure that checkpoint will be downloaded from AWS S3 even if there are temporary connection problems

there's https://boto3.amazonaws.com/v1/documentation/api/latest/reference/customizations/s3.html#boto3.s3.transfer.TransferConfig parameter in boto3, but I'm not sure if there's an easy way to pass this parameter to StorageManager

3 years ago
4 years ago
0 Yo Guys, I'M Getting

we're using EC2

5 years ago
0 Hi

wow, thanks, just updated our server!
can't seem to find these metrics snapshot plots =) how do I plot one?

4 years ago
0 Hey Guys, I Keep Getting "Failed Parsing Task Parameter" Warning For The Arguments Such As This One:

on the side note, is there any way to automatically give more meaningful names to the running docker containers?

4 years ago
0 Hey Guys, I Keep Getting

WARNING: You are using pip version 20.1.1; however, version 20.3.3 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.

trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?

http://OUR_IP:8081 http://OUR_IP:8080 http://apiserver:8008
WARNING: You are using pip version 20.1.1; however, version 20.3.3 is available.
`...

4 years ago
0 Is Is Possible To Pass Custom

we're using os.getenv in the script to get a value for these secrets

3 years ago
0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

our GPUs are 48GB, so it's quite wasteful to only run one job per GPU
yeah, I'm aware of that, I would have to make sure they don't fail to infamous CUDA out of memory, but still

5 years ago
0 Hey Guys, Is There A Ready Script That Can Delete All Models From S3 (Or Other Storage) That Are Related To Deleted Or Archived Experiments?

what if cleanup service is launched using ClearML-Agent Services container (part of the ClearML server)? adding clearml.conf to the home directory doesn't help

4 years ago
0 What Is The Right Way To Increase Number Of Retries When Using

I'm not sure since names of these parameters do not match with boto3 names, and num_download_attempt is passed https://github.com/allegroai/clearml/blob/3d3a835435cc2f01ff19fe0a58a8d7db10fd2de2/clearml/storage/helper.py#L1439 as container.config.retries

3 years ago
0 Is Is Possible To Pass Custom

works like a charm!

3 years ago
0 Yo Guys, I'M Getting

no, we've only been using metric and model logging so far

5 years ago
0 Hi

AnxiousSeal95 yeah, got it! thanks!

4 years ago
0 Hey Guys, A Question About Monthly Worker_Stats Indices Each Of Them Takes Up About 1Gb For Us. Do We Really Need To Keep All Of Them? Is There Any Way To Free Up The Space?

yeah, backups take much longer, and we had to increase our EC2 instance volume size twice because of these indices

got it, thanks, will try to delete older ones

5 years ago
Show more results compactanswers