Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
ExcitedFish86
Moderator
8 Questions, 55 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

43 × Eureka!
0 Votes
3 Answers
670 Views
0 Votes 3 Answers 670 Views
Hi all, Is there a way to filter a experiments in a hyperparameter sweep based on a given range of a parameter/metric in the UI (similar to wandb )? Also, is...
2 years ago
0 Votes
9 Answers
621 Views
0 Votes 9 Answers 621 Views
Hi all, I'm trying to upgrade clearml-server but I keep getting permission errors from the elastic search container: clearml-elastic | ElasticsearchException...
3 years ago
0 Votes
13 Answers
684 Views
0 Votes 13 Answers 684 Views
Hi folks! I'm using SummaryWriter from PyTorch's tensorboard utils to log pr_curve , and I get the attached curve. Looks like the X axis is reversed, and I c...
3 years ago
0 Votes
3 Answers
578 Views
0 Votes 3 Answers 578 Views
Hi all, I see there is an option for running a bash script / commands inside a container started by an agent. Is it possible to have this set differently per...
2 years ago
0 Votes
18 Answers
618 Views
0 Votes 18 Answers 618 Views
2 years ago
0 Votes
30 Answers
555 Views
0 Votes 30 Answers 555 Views
Hi guys! Is there a way to tell an agent to run a task in an existing venv (without creating a new one)?
2 years ago
0 Votes
11 Answers
643 Views
0 Votes 11 Answers 643 Views
Hi all, I have a question regarding multi-node training using the clearml-agent. What is the recommended setup in this case? Say I have 3 nodes with 3 agents...
2 years ago
0 Votes
2 Answers
588 Views
0 Votes 2 Answers 588 Views
Hi guys, just wanted to let you know that many links in the ClearML github page are broken (i.e., https://github.com/allegroai/clearml/blob/master )
3 years ago
0 Hi, In One Of My Agents With Cuda Version: 11.1 (From Nvidia-Smi), Clearml Agent 0.17.1 Detects Version 100 (I Can See From Experiments Logs:

cudnn isn't cuda, it's a separate library.
are you running on docker on bare metal? you should have cuda installed at /usr/local/cuda-<>

3 years ago
0 Hi, In One Of My Agents With Cuda Version: 11.1 (From Nvidia-Smi), Clearml Agent 0.17.1 Detects Version 100 (I Can See From Experiments Logs:

just to be clear, multiple CUDA runtime version can coexist on a single machine, and the only thing that points to which one you are using when running an application are the library search paths (which can be set either with LD_LIBRARY_PATH , or, preferably, by creating a file under /etc/ld.so.conf.d/ which contains the path to your cuda directory and executing ldconfig )

3 years ago
0 Hi All, I'M Trying To Upgrade

just docker-compose up with the latest compose file from the repo

3 years ago
0 Hi All, I'M Trying To Upgrade

The legacy version worked just before I mv ed the folder but now (after reverting to the old name) that doesn't work also 😢

3 years ago
0 Hi All, I'M Trying To Upgrade

same thing 😞

3 years ago
0 Hi All, I'M Trying To Upgrade

Ok working now 🙂 had a mistake in the folder name

3 years ago
0 Hi All, I Have A Question Regarding Multi-Node Training Using The Clearml-Agent. What Is The Recommended Setup In This Case? Say I Have 3 Nodes With 3 Agents Running On Them. How Do I Make Sure They All Run The Same Job?

I see what you mean. So in a simple "all-or-nothing" solution I have to choose between potentially starving either the single node tasks (high priority + wait) or multi-node tasks (wait for a time when there are enough available agents and only then allocate the resource).

I actually meant NCCL. nvcc is the CUDA compiler 😅
NCCL communication can be both inter- and intra- node

2 years ago
0 Hi All, I Have A Question Regarding Multi-Node Training Using The Clearml-Agent. What Is The Recommended Setup In This Case? Say I Have 3 Nodes With 3 Agents Running On Them. How Do I Make Sure They All Run The Same Job?

I thought of some sort of gang-scheduling scheme should be implemented on top of the job.
Maybe the agents should somehow go through a barrier with a counter and wait there until enough agents arrived

2 years ago
Show more results compactanswers