Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DilapidatedParrot58
Moderator
42 Questions, 205 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

186 × Eureka!
0 Votes
7 Answers
665 Views
0 Votes 7 Answers 665 Views
there is something weird going on with console log after latest updates of ClearML Server. it doesn't show the latest updates, instead it often jumps to the ...
one year ago
0 Votes
6 Answers
770 Views
0 Votes 6 Answers 770 Views
hey guys, here I am again with another question 😃 after the latest update, I’m getting this error when I’m trying to compare scalars for more than 10 experi...
3 years ago
0 Votes
7 Answers
899 Views
0 Votes 7 Answers 899 Views
any chance StorageManager could re-download files only if their size is different from file in cache (as an option)?
2 years ago
0 Votes
2 Answers
823 Views
0 Votes 2 Answers 823 Views
2 years ago
0 Votes
16 Answers
749 Views
0 Votes 16 Answers 749 Views
yo guys, I'm getting Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'ConnectTimeoutError(, 'Connection to O...
3 years ago
0 Votes
5 Answers
797 Views
0 Votes 5 Answers 797 Views
2 years ago
0 Votes
6 Answers
731 Views
0 Votes 6 Answers 731 Views
one year ago
0 Votes
3 Answers
965 Views
0 Votes 3 Answers 965 Views
hey guys, thanks for creating Slack workspace, that's really cool. question - are we missing smth or is currently not possible to pass S3 credentials via env...
4 years ago
0 Votes
9 Answers
753 Views
0 Votes 9 Answers 753 Views
2 years ago
0 Votes
6 Answers
716 Views
0 Votes 6 Answers 716 Views
hey guys, I keep getting "Failed parsing task parameter" warning for the arguments such as this one: parser.add_argument( "--dataset_mean", type = float, nar...
2 years ago
0 Votes
6 Answers
735 Views
0 Votes 6 Answers 735 Views
we just had a slight problem - there was a double space in S3 checkpoint name, but ClearML UI prints them as one in the model description. if you copy and pa...
one year ago
0 Votes
7 Answers
758 Views
0 Votes 7 Answers 758 Views
I'm getting A LOT of errors when running cleanup service Failed deleting the following URIs - script fails to delete image and text files ERROR - Failed dele...
2 years ago
0 Votes
20 Answers
779 Views
0 Votes 20 Answers 779 Views
4 years ago
0 Votes
6 Answers
856 Views
0 Votes 6 Answers 856 Views
I’m interested in learning more about internals of ClearML Server - for example, how ElasticSearch, MongoDB, and Redis are used internally. are there any mat...
one year ago
0 Votes
10 Answers
683 Views
0 Votes 10 Answers 683 Views
what is the right way to increase number of retries when using StorageManager.get_local_copy?
2 years ago
0 Votes
7 Answers
648 Views
0 Votes 7 Answers 648 Views
3 years ago
0 Votes
5 Answers
702 Views
0 Votes 5 Answers 702 Views
Step 3 Task ( https://github.com/allegroai/trains/blob/master/examples/pipeline/step3_train_model.py ) - Loads the processed data (from Step 2) and clearml a...
3 years ago
0 Votes
8 Answers
781 Views
0 Votes 8 Answers 781 Views
3 years ago
0 Votes
0 Answers
749 Views
0 Votes 0 Answers 749 Views
hey guys, do you have any tutorials or examples of intergration with dvc?
4 years ago
0 Votes
27 Answers
702 Views
0 Votes 27 Answers 702 Views
hey guys, I keep getting trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?...
3 years ago
0 Votes
13 Answers
735 Views
0 Votes 13 Answers 735 Views
it would be nice to group experiments within projects use cases: hyperparameter sweep (10 experiments with different learning rate) finetuning models (for ex...
2 years ago
0 Votes
2 Answers
726 Views
0 Votes 2 Answers 726 Views
two annoying visual bugs in ClearML Server UI after latest update: experiment status is still shown as “Aborted” after successful resetting until you refresh...
one year ago
0 Votes
3 Answers
792 Views
0 Votes 3 Answers 792 Views
4 years ago
0 Votes
2 Answers
666 Views
0 Votes 2 Answers 666 Views
one year ago
0 Votes
4 Answers
710 Views
0 Votes 4 Answers 710 Views
feature request: ClearML prints GitHub token in the log, when there is "repository not found" error. it would be nice if could hide it
2 years ago
0 Votes
25 Answers
881 Views
0 Votes 25 Answers 881 Views
I'm probably stupid, but how do I specify worker name? usecase - I want to create two workers using the same GPU, and new worker just overwrites the old one
4 years ago
0 Votes
3 Answers
834 Views
0 Votes 3 Answers 834 Views
here I am again... can't find how to create a custom queue
4 years ago
0 Votes
30 Answers
759 Views
0 Votes 30 Answers 759 Views
3 years ago
0 Votes
30 Answers
704 Views
0 Votes 30 Answers 704 Views
is is possible to pass custom https://clear.ml/docs/latest/docs/configs/env_vars/ to ClearML agents?
2 years ago
0 Votes
29 Answers
675 Views
0 Votes 29 Answers 675 Views
2 years ago
Show more results questions
2 years ago
0 Hey Guys, I'M Trying To Run An Experiment Using Trains-Agent. I Have A Custom Docker Image With Nightly Versions Of Pytorch And Our Own Library Installed From A Private Repo. I Was Assuming That These Packages Will Be Automatically Available To Trains Dur

great, this helped, thanks! I simply added https://download.pytorch.org/whl/nightly/cu101/torch_nightly.html to trains.conf, and it seems to be working

I now have another problem, my code is looking for some additional files in the root folder of the project. I tried adding a Docker layer:
ADD file.pkl /root/.trains/venvs-builds/3.6/task_repository/project.git/extra_data/

but trains probably rewrites the folder when cloning the repo. is there any workaround?

4 years ago
0 Hey Guys, Is There A Ready Script That Can Delete All Models From S3 (Or Other Storage) That Are Related To Deleted Or Archived Experiments?

oh wow, I didn't see delete_artifacts_and_models option

I guess we'll have to manually find old artifacts that are related to already deleted tasks

2 years ago
0 I Keep Getting Errors When Trying To Compare A Lot Of Experiments At The Same Time (>10). What'S Evern Worse Is That Trains Start Working Much Slower In General After These Attempts, The Only Way To Fix It Is To Restart The Whole Thing. Would Getting Bett

btw, there are "[2020-09-02 15:15:40,331] [9] [WARNING] [urllib3.connectionpool] Connection pool is full, discarding connection: elasticsearch" in the apiserver logs again

3 years ago
0 I Updated Trains-Server Today, And Now It'S Very Unstable, Web Interface Randomly Stops Working. Anyone Had The Same Problem? I'Ve Never Had Any Problems With Updating The Server Before

I've already pulled new images from trains-server, let's see if the initial issue occurs again. thank for the fast response guys!

4 years ago
0 I'M Using Tensorboard Summarywriter To Add Scalar Metrics For The Experiment. If Experiment Crashed, And I Want To Continue It From Checkpoint, For Some Reason It Plots Metrics In A Really Weird Way. Even Though I Pass Global_Step=Epoch To The Summarywrit

not sure what you mean. I used to do task.set_initial_iteration(task.get_last_iteration()) in the task resuming script, but in the training code I explicitly pass global_step=epoch to the TensorBoard writer

2 years ago
0 Hey Guys, I Keep Getting

well, the server wouldn't work without them?

3 years ago
0 Hey Guys, I Keep Getting

default docker-compose

3 years ago
0 Hey Guys, I Keep Getting

do you have any idea why cleanup task keeps failing then (it used to work before the update)

3 years ago
0 Downloading Output Artifacts From S3 By Clicking On The Download Button Next To Model Url Was Great, But Since We Moved From Aws To Yandex.Cloud, This Feature Doesn'T Work. Any Chance You Could Support Other Cloud Providers?

yes. we upload artifacts to Yandex.Cloud S3 using ClearML. we set " s3://storage.yandexcloud.net/clearml-models " as output uri parameter and add this section to the config:
{
host: " http://storage.yandexcloud.net "
key: "KEY"
secret:"SECRET_KEY",
secure: true
}

this works like a charm. but download button in UI is not working

one year ago
0 I'M Using Tensorboard Summarywriter To Add Scalar Metrics For The Experiment. If Experiment Crashed, And I Want To Continue It From Checkpoint, For Some Reason It Plots Metrics In A Really Weird Way. Even Though I Pass Global_Step=Epoch To The Summarywrit

overwriting this value is not ideal though, because for :monitor:gpu and :monitor:machine values I would like to continue from the latest iteration

but for the metrics, I explicitly pass the number of epoch that my training is currently on. it'ls kind of weird that it adds offset to the values that are explicitly reported, no?

2 years ago
2 years ago
0 Is Is Possible To Pass Custom

ah, I see, I still keep it in agent.extra_docker_arguments

2 years ago
Show more results compactanswers