Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DilapidatedParrot58
Moderator
42 Questions, 205 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

186 × Eureka!
0 Votes
27 Answers
1K Views
0 Votes 27 Answers 1K Views
hey guys, I keep getting trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?...
3 years ago
0 Votes
18 Answers
1K Views
0 Votes 18 Answers 1K Views
I updated trains-server today, and now it's very unstable, Web interface randomly stops working. anyone had the same problem? I've never had any problems wit...
4 years ago
0 Votes
16 Answers
1K Views
0 Votes 16 Answers 1K Views
yo guys, I'm getting Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'ConnectTimeoutError(, 'Connection to O...
4 years ago
0 Votes
2 Answers
959 Views
0 Votes 2 Answers 959 Views
one year ago
0 Votes
16 Answers
1K Views
0 Votes 16 Answers 1K Views
2 years ago
0 Votes
8 Answers
1K Views
0 Votes 8 Answers 1K Views
4 years ago
0 Votes
10 Answers
1K Views
0 Votes 10 Answers 1K Views
what is the right way to increase number of retries when using StorageManager.get_local_copy?
2 years ago
0 Votes
13 Answers
1K Views
0 Votes 13 Answers 1K Views
it would be nice to group experiments within projects use cases: hyperparameter sweep (10 experiments with different learning rate) finetuning models (for ex...
2 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
I’m interested in learning more about internals of ClearML Server - for example, how ElasticSearch, MongoDB, and Redis are used internally. are there any mat...
2 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
here I am again... can't find how to create a custom queue
4 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
hey guys, I keep getting "Failed parsing task parameter" warning for the arguments such as this one: parser.add_argument( "--dataset_mean", type = float, nar...
3 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
4 years ago
0 Votes
9 Answers
1K Views
0 Votes 9 Answers 1K Views
2 years ago
0 Votes
7 Answers
971 Views
0 Votes 7 Answers 971 Views
3 years ago
0 Votes
30 Answers
1K Views
0 Votes 30 Answers 1K Views
is is possible to pass custom https://clear.ml/docs/latest/docs/configs/env_vars/ to ClearML agents?
2 years ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
I'm getting A LOT of errors when running cleanup service Failed deleting the following URIs - script fails to delete image and text files ERROR - Failed dele...
2 years ago
0 Votes
14 Answers
1K Views
0 Votes 14 Answers 1K Views
hey guys the first time I'm seeing this behavior I'm adding a new user to /opt/trains/config/apiserver.conf and restarting the containers. all old users are ...
4 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
feature request: ClearML prints GitHub token in the log, when there is "repository not found" error. it would be nice if could hide it
3 years ago
0 Votes
20 Answers
1K Views
0 Votes 20 Answers 1K Views
4 years ago
0 Votes
7 Answers
966 Views
0 Votes 7 Answers 966 Views
there is something weird going on with console log after latest updates of ClearML Server. it doesn't show the latest updates, instead it often jumps to the ...
one year ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
any chance StorageManager could re-download files only if their size is different from file in cache (as an option)?
3 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
2 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
hey guys, a question about monthly worker_stats indices each of them takes up about 1gb for us. do we really need to keep all of them? is there any way to fr...
4 years ago
0 Votes
8 Answers
1K Views
0 Votes 8 Answers 1K Views
downloading output artifacts from S3 by clicking on the download button next to Model URL was great, but since we moved from AWS to Yandex.Cloud, this featur...
2 years ago
0 Votes
25 Answers
1K Views
0 Votes 25 Answers 1K Views
I'm probably stupid, but how do I specify worker name? usecase - I want to create two workers using the same GPU, and new worker just overwrites the old one
4 years ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
hey guys, do you have any plans to add functionality to export training config with all hyperparameters to the different formats, such as training command li...
4 years ago
0 Votes
8 Answers
1K Views
0 Votes 8 Answers 1K Views
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Step 3 Task ( https://github.com/allegroai/trains/blob/master/examples/pipeline/step3_train_model.py ) - Loads the processed data (from Step 2) and clearml a...
3 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
hey guys, here I am again with another question πŸ˜ƒ after the latest update, I’m getting this error when I’m trying to compare scalars for more than 10 experi...
4 years ago
Show more results questions
0 We Just Had A Slight Problem - There Was A Double Space In S3 Checkpoint Name, But Clearml Ui Prints Them As One In The Model Description. If You Copy And Paste It, The Address Will Be Wrong

original task name contains double space -> saved checkpoint also contains double space -> MODEL URL field in model description of this checkpoint in ClearML converts double space into single space. so when you copy & paste it somewhere, it'll be incorrect

2 years ago
0 When We Train The Models, We Often Choose Checkpoint Based On The Validation Accuracy, But Test Set Accuracy (Or Specific Class Validation Accuracy) Is Not Necessarily The Best For This Checkpoint. Right Now There Are Options To Add Columns With Max And L

I guess, this could overcomplicate ui, I don't see a good solution yet.

as a quick hack, we can just use separate name (eg "best_val_roc_auc") for all metric values for the current best checkpoint. then we can just add columns with the last value of this metric

3 years ago
0 Is Is Possible To Pass Custom

it works, but it's not very helpful since everybody can see a secret in logs:
Executing: ['docker', 'run', '-t', '--gpus', '"device=0"', '-e', 'DB_PASSWORD=password']

2 years ago
2 years ago
0 Is Is Possible To Pass Custom

we're using os.getenv in the script to get a value for these secrets

2 years ago
0 Hey Guys, I Keep Getting

new version worked

3 years ago
0 Hey Guys, I Keep Getting

thank you πŸ˜ƒ

3 years ago
0 Hey Guys, I Keep Getting

nice, thanks! I'll check if it solves the issue first thing tomorrow in the morning

3 years ago
0 Hey Guys, I Keep Getting

SuccessfulKoala55 grrrrr it keeps happening, I have no idea what's wrong

3 years ago
0 I'M Getting A Lot Of Errors When Running Cleanup Service

self-hosted ClearML server 1.2.0
SDK version 1.1.6

2 years ago
0 Hey Guys, Here I Am Again With Another Question

we often do ablation studies with more than 50 experiments, and it was very convenient to compare their dynamics at the different epochs

4 years ago
0 Hey Guys, Here I Am Again With Another Question

fantastic, everything is working perfectly
thanks guys

4 years ago
3 years ago
0 Hey Guys, Is There A Ready Script That Can Delete All Models From S3 (Or Other Storage) That Are Related To Deleted Or Archived Experiments?

what if cleanup service is launched using ClearML-Agent Services container (part of the ClearML server)? adding clearml.conf to the home directory doesn't help

3 years ago
0 Hey Guys, Is There A Ready Script That Can Delete All Models From S3 (Or Other Storage) That Are Related To Deleted Or Archived Experiments?

two more questions about cleanup if you don't mind:
what if for some old tasks I get WARNING:root:Could not delete Task ID=a0908784a2a942c3812f947ec1f32c9f, 'Task' object has no attribute 'delete'? What's the best way of cleaning them? What is the recommended way of providing S3 credentials to cleanup task?

3 years ago
0 Hey Guys, Is There A Ready Script That Can Delete All Models From S3 (Or Other Storage) That Are Related To Deleted Or Archived Experiments?

oh wow, I didn't see delete_artifacts_and_models option

I guess we'll have to manually find old artifacts that are related to already deleted tasks

3 years ago
0 Is Is Possible To Pass Custom

agent.hide_docker_command_env_vars.extra_keys: ["DB_PASSWORD=password"]

like this? or ["DB_PASSWORD", "password"]

2 years ago
0 Any Chance Storagemanager Could Re-Download Files Only If Their Size Is Different From File In Cache (As An Option)?

yeah, I was thinking mainly about AWS. we use force to make sure we are using the correct latest checkpoint, but this increases costs when we are running a lot of experiments

3 years ago
Show more results compactanswers