Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DilapidatedParrot58
Moderator
42 Questions, 205 Answers
  Active since 10 January 2023
  Last activity 2 years ago

Reputation

0

Badges 1

186 × Eureka!
0 Votes
16 Answers
2K Views
0 Votes 16 Answers 2K Views
yo guys, I'm getting Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'ConnectTimeoutError(, 'Connection to O...
5 years ago
0 Votes
16 Answers
2K Views
0 Votes 16 Answers 2K Views
3 years ago
0 Votes
18 Answers
2K Views
0 Votes 18 Answers 2K Views
I updated trains-server today, and now it's very unstable, Web interface randomly stops working. anyone had the same problem? I've never had any problems wit...
5 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
two annoying visual bugs in ClearML Server UI after latest update: experiment status is still shown as “Aborted” after successful resetting until you refresh...
3 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
3 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
5 years ago
0 Votes
10 Answers
2K Views
0 Votes 10 Answers 2K Views
what is the right way to increase number of retries when using StorageManager.get_local_copy?
3 years ago
0 Votes
25 Answers
2K Views
0 Votes 25 Answers 2K Views
I'm probably stupid, but how do I specify worker name? usecase - I want to create two workers using the same GPU, and new worker just overwrites the old one
5 years ago
0 Votes
27 Answers
2K Views
0 Votes 27 Answers 2K Views
hey guys, I keep getting trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?...
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
4 years ago
0 Votes
9 Answers
2K Views
0 Votes 9 Answers 2K Views
3 years ago
0 Votes
0 Answers
2K Views
0 Votes 0 Answers 2K Views
hey guys, do you have any tutorials or examples of intergration with dvc?
5 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
4 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
hey guys, I keep getting "Failed parsing task parameter" warning for the arguments such as this one: parser.add_argument( "--dataset_mean", type = float, nar...
4 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
hey guys, is there a ready script that can delete all models from S3 (or other storage) that are related to deleted or archived experiments?
4 years ago
0 Votes
13 Answers
2K Views
0 Votes 13 Answers 2K Views
it would be nice to group experiments within projects use cases: hyperparameter sweep (10 experiments with different learning rate) finetuning models (for ex...
3 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
there is something weird going on with console log after latest updates of ClearML Server. it doesn't show the latest updates, instead it often jumps to the ...
2 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
hey guys, do you have any plans to add functionality to export training config with all hyperparameters to the different formats, such as training command li...
5 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
is there any way to export CSV with max metrics and hyperparameters for selected experiments?
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
I'm getting A LOT of errors when running cleanup service Failed deleting the following URIs - script fails to delete image and text files ERROR - Failed dele...
3 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
feature request: ClearML prints GitHub token in the log, when there is "repository not found" error. it would be nice if could hide it
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
any chance StorageManager could re-download files only if their size is different from file in cache (as an option)?
4 years ago
0 Votes
14 Answers
2K Views
0 Votes 14 Answers 2K Views
hey guys the first time I'm seeing this behavior I'm adding a new user to /opt/trains/config/apiserver.conf and restarting the containers. all old users are ...
5 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
here I am again... can't find how to create a custom queue
5 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
is there any way to post Slack alerts for the frozen experiments? (eg, after server restart they sometimes get stuck in Running mode, or https://github.com/p...
4 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
we just had a slight problem - there was a double space in S3 checkpoint name, but ClearML UI prints them as one in the model description. if you copy and pa...
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
4 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
4 years ago
0 Votes
20 Answers
2K Views
0 Votes 20 Answers 2K Views
5 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
is is possible to pass custom https://clear.ml/docs/latest/docs/configs/env_vars/ to ClearML agents?
3 years ago
Show more results questions
0 Hi

all our workers went down after starting the slack bot, is it expected?)

5 years ago
0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

another stupid question - what is the proper way to delete a worker? so far I've been using pgrep to find the relevant PID 😃

5 years ago
0 It Would Be Nice To Group Experiments Within Projects Use Cases:

hard to say, maybe just “related experiments” in experiment info would be enough. I’ll think about it

3 years ago
0 I Updated Trains-Server Today, And Now It'S Very Unstable, Web Interface Randomly Stops Working. Anyone Had The Same Problem? I'Ve Never Had Any Problems With Updating The Server Before

as a sidenote, I am not able to pull the newest release, looks like it's not pushed?
"Error response from daemon: manifest for allegroai/trains:0.14.2 not found"

5 years ago
0 It Would Be Nice To Group Experiments Within Projects Use Cases:

tags are somewhat fine for this, I guess, but there will be too many of them eventually, and they do not reflect sequential nature of the experiments

3 years ago
0 Downloading Output Artifacts From S3 By Clicking On The Download Button Next To Model Url Was Great, But Since We Moved From Aws To Yandex.Cloud, This Feature Doesn'T Work. Any Chance You Could Support Other Cloud Providers?

yes. we upload artifacts to Yandex.Cloud S3 using ClearML. we set " s3://storage.yandexcloud.net/clearml-models " as output uri parameter and add this section to the config:
{
host: " http://storage.yandexcloud.net "
key: "KEY"
secret:"SECRET_KEY",
secure: true
}

this works like a charm. but download button in UI is not working

3 years ago
0 I Keep Getting Errors When Trying To Compare A Lot Of Experiments At The Same Time (>10). What'S Evern Worse Is That Trains Start Working Much Slower In General After These Attempts, The Only Way To Fix It Is To Restart The Whole Thing. Would Getting Bett

running
docker network prunebefore starting the containers kind of helped. I still see an error when I'm comparing > 20 experiments, but at least trains works okay after that, and there are no connection pool limit errors in the logs

5 years ago
0 Is Is Possible To Pass Custom

this is probably what I need, thanks. I'll check if it works

3 years ago
0 Hey Guys, I'M Trying To Run An Experiment Using Trains-Agent. I Have A Custom Docker Image With Nightly Versions Of Pytorch And Our Own Library Installed From A Private Repo. I Was Assuming That These Packages Will Be Automatically Available To Trains Dur

I added the link just in case anyway 😃

also, is there any way to install a repo that we clone as a package. we often use absolute imports and do "pip install -e ." to utilize it
sorry there are so many questions, we just really want to migrate to trains-agent)

5 years ago
0 Hey Guys, I Keep Getting

thank you 😃

4 years ago
0 Here I Am Again... Can'T Find How To Create A Custom Queue

LOL
wow 😃
I was trying to find how to create a queue using CLI 😃

5 years ago
0 Hey Guys, I Keep Getting

new version worked

4 years ago
0 It Would Be Nice To Group Experiments Within Projects Use Cases:

nope, that's the point, quite often we run experiments separately, but they are related to each other. currently there's no way to see that one experiment is using checkpoint from the previous experiment since we need to manually insert S3 link as a hyperparameter. it would be useful to see these connections. maybe instead of grouping we could see which experiments are using artifacts of this experiment

3 years ago
0 Yo Guys, I'M Getting

yeah, that's exactly what I'm looking to right now 😃

5 years ago
Show more results compactanswers