Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
JitteryCoyote63
Moderator
215 Questions, 1023 Answers
  Active since 10 January 2023
  Last activity one month ago

Reputation

0

Badges 1

981 × Eureka!
0 Votes
16 Answers
2K Views
0 Votes 16 Answers 2K Views
Got some errors while running migration script from ES5 to ES7: 2020-08-11 15:21:50,130 Running on: Linux 2020-08-11 15:21:50,227 Docker allocated memory: 16...
5 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
Hey! Would it be possible to tag the RC releases in the different repos? So that one knows what is inside?
5 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Hi, in the AWS AutoScaler, I am getting the following warning: Warning! exception occurred: APIError: code 400/1004: Worker is not registered: worker=aws:A10...
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hi, I see that there is a new parameter in aws autoscaler: max_spin_up_time_min - What is the difference with max_idle_time_min ?
aws
4 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
Hi, Is there a way to stop a clearml-agent from within an experiment? Or block it to prevent it running any other task?
4 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
Hi, would it be possible to parse torch requirement when it’s part of the extras_require dict? In my code, I have the following: train_task._update_requireme...
4 years ago
0 Votes
23 Answers
2K Views
0 Votes 23 Answers 2K Views
Hi, I would like to bring awareness on this issue , this impacts my work as I cannot install the older version of torch (1.11.0)
2 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
Hello there, is there a parameter to configure the number of columns rendered in the preview area of the CSV artifacts? (some of them are truncated with “…”)
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hi guys, since I am done with implementing the AWS autoscaler, I would like to share some pain points that I encountered in the process with the hope that th...
aws
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hey guys, quick question: is there a tool function to know if a task id is valid? Not verifying that the task itself exists, just that the task id is the cor...
4 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
4 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
Hi, in the context of multi-gpu training, is Model.get_local_copy() multi-process safe? or should make sure only the first process calls it first, then others
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Hey there, since which version, clearml stops connecting to the demo server by default?
3 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
4 years ago
0 Votes
13 Answers
2K Views
0 Votes 13 Answers 2K Views
Hey there, Is it possible for a clearml pipeline step to log a folder instead of numpy/pickle objects? Looking at the docs, monitor_artifacts could be what I...
3 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Looks like trains-agent 0.16 doesn't support --install-globally documented parameter -> Only available for trains-agent build command. Would it be possible t...
5 years ago
0 Votes
0 Answers
2K Views
0 Votes 0 Answers 2K Views
Hi, I encountered a bug on clearml-server 1.0.1: I tried to add in a project page a custom column in +HYPER PARAMETERS > Args > queue and got an error pop up...
4 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Hi, how can I search an old experiment based on its commit hash?
2 years ago
0 Votes
17 Answers
2K Views
0 Votes 17 Answers 2K Views
3 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
Hi, I cannot manage to start trains-server 0.16 with the docker-compose file, the trains-elastic container fails with the following error:
5 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
Does trains 0.16 supports pip >=20.2?
5 years ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
Hi there, would it be possible to add some Neural Architecture Search example, as for the HyperParameter Optimizer examples?
4 years ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
Hi, I think I found a small bug: Clone an experiment Enqueue it on a queue with no workers Delete the queue Try to Dequeue the experimentThe last operation w...
4 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
4 years ago
0 Votes
22 Answers
2K Views
0 Votes 22 Answers 2K Views
Hi, I would like to switch from the elastic-search service in the docker-compose of the clearml-server to an externally managed, scalable elastic-search clus...
4 years ago
0 Votes
18 Answers
2K Views
0 Votes 18 Answers 2K Views
Hi Guys, I had several times now the following errors poping in agents while executing a task: trains_agent: ERROR: Failed applying git diff: I attached the ...
4 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Hi there, is it possible to configure the clearml-agent to run some commands before running each experiment it launches? Eg. echo "test" > "test.txt" && <-- ...
3 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
The “Manage queue” option in the right tab on a queued experiment is broken in v1.0 (it does nothing)
4 years ago
0 Votes
0 Answers
2K Views
0 Votes 0 Answers 2K Views
Hi all, Would it be possible to make the aws autoscaler log each scale in/out operation in the console to help debugging/understanding the course of events?
4 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
Hi guys, with the new venv caching available in clearml, I have the following problem: I force my pip requirements to be: torch==1.7.1 pytorch-ignite clearml...
4 years ago
Show more results questions
0 Hey, Would It Possible To Add An Option To Make

awesome πŸŽ‰
Maybe then we can extend task.upload_artifact ?
def upload_artifact(..., wait_for_upload: bool = False): ... if wait_for_upload: self.flush(wait_for_uploads=True)

5 years ago
0 Hey, Would It Possible To Add An Option To Make

not really, because it is in the middle of the controller task, there are other things to be done afterwards (retrieving results, logging new artifacts, creating new tasks, etc)

5 years ago
0 Hi, I Have Another Problem

thanks, I will do that

5 years ago
0 Hi There

Here is the minimal reproducable example.
Run test_task_a.py - It will register a dummy artifact, create a new task, set a parameter in that task and enqueue it test_task_b will try to retrieve parameter from parent task and fail

5 years ago
0 Hi Guys, I Got A Very Unexpected Error Today On In One Of My Agents:

Unfortunately this is difficult to reproduce... Neverthless it would be important to me to be robust against it, because if this error happens in a task in the middle of my pipeline, the whole process fails.

This binds to another wider topic I think: How to "skip" tasks if they already run (a mechanism similar to what [ https://luigi.readthedocs.io/en/stable/ ] offers). That would allow to restart the pipeline and skip tasks until the point where the task failed

5 years ago
0 Hi There

AgitatedDove14 I cannot confirm at 100%, the context is different (see previous messages) but it could be the same bug behind the scene...

5 years ago
0 Hi Guys, Coming This Time To Share An Idea Of A Killer Feature For Clearml

AnxiousSeal95 Any update on this topic? I am very excited to see where this can go 🀩

4 years ago
0 Got Some Errors While Running Migration Script From Es5 To Es7:

I should also rename /opt/trains/data/elastic_migrated_2020-08-11_15-27-05 folder to /opt/trains/data/elastic before running the migration tool right?

5 years ago
0 Hi, I Have A Configuration File That I Read And Connect To My Training Tasks. I Cannot Use

Hi SuccessfulKoala55 , super that’s what I was looking for

3 years ago
0 Hey Again

Awesome!

4 years ago
4 years ago
0 Hi, How Does

Also enable_git_ask_pass is not dumped into the logs when an experiment start btw

2 years ago
0 Could You Please Explain A Bit More How Trains Adapt The Torch Version Depending On The Installed Cuda Version? Here Is My Setup:

I now have a different question: when installing torch from wheels files, I am guaranteed to have the corresponding cuda library and cudnn together right?

4 years ago
0 Hi, I Started A Trains-Agent (0.15) In Services Mode (Full Command:

I will try to isolate the bug, if I can, I will open an issue in trains-agent πŸ™‚

5 years ago
0 Hey There, Since Which Version, Clearml Stops Connecting To The Demo Server By Default?

Thanks SuccessfulKoala55 ! So CLEARML_NO_DEFAULT_SERVER=1 by default, right?

3 years ago
0 Got Some Errors While Running Migration Script From Es5 To Es7:

sure, will be happy to debug that πŸ™‚

5 years ago
0 Hey There, Since Which Version, Clearml Stops Connecting To The Demo Server By Default?

Will the from clearml import Task raise an error if no clearml.conf exists? Or only when actual features requiring to define the server (such as Task.init ) will be called

3 years ago
0 Hi, Together With

Exactly

5 years ago
5 years ago
0 Hi, Together With

Which commit corresponds to RC version? So far we tested with latest commit on master (9a7850b23d2b0e1f2098ab051de58ce806143fff)

5 years ago
0 Hi, I Have Another Problem

ho, that might be it then, thanks!

5 years ago
0 Hi, Is There A Way To Stop A Clearml-Agent From Within An Experiment? Or Block It To Prevent It Running Any Other Task?

My use case it: in a spot instance marked for termination after 2 mins by aws, I want to close a task and prevent the clearml-agent to pick up a new task after.

4 years ago
0 Hi, Together With

Not using pytorch distributed, all models are uploaded to s3 yes

5 years ago
0 Hi There, Congrats For Releasing V1

I am using 0.17.5, it could be either a bug on ignite or indeed a delay on the send. I will try to build a simple reproducible example to understand to cause

4 years ago
0 Hello There, I Would Like To Do Run Cleanup Code In Case The User Aborts One Task From The Dashboard (The Agent Is Not Using The Task In Docker). What Signal Should I Listen For In The Task?

Also maybe we are not on the same page - by clean up, I mean kill a detached subprocess on the machine executing the agent

4 years ago
0 Hi, I Would Like To Bring Awareness

I think we should switch back, and have a configuration to control which mechanism the agent uses , wdyt? (edited)

That sounds great!

2 years ago
0 Could You Please Explain A Bit More How Trains Adapt The Torch Version Depending On The Installed Cuda Version? Here Is My Setup:

From the answers I saw on the internet, it is most likely related to the mismatch of cuda/cudnn version

4 years ago
0 Hi, I Started A Trains-Agent (0.15) In Services Mode (Full Command:

The weird thing is that the second experiment started immediatly, correctly in a docker container, but failed with User aborted: stopping task (3) at some point (while installing the packages). The error message is suprizing since I did not do anything. And then all following experiments are queued to services queue and stuck there.

5 years ago
Show more results compactanswers