Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
ZanySealion18
Moderator
13 Questions, 48 Answers
  Active since 03 November 2023
  Last activity 12 hours ago

Reputation

0

Badges 1

45 × Eureka!
0 Votes
0 Answers
695 Views
0 Votes 0 Answers 695 Views
Hey folks, could anyone please take a look at this thread? Thanks! None
one year ago
0 Votes
7 Answers
442 Views
0 Votes 7 Answers 442 Views
6 months ago
0 Votes
6 Answers
769 Views
0 Votes 6 Answers 769 Views
My agent (running on GCP in docker mode) is having trouble with git fetch --all. I'm using SSH for authentication, however, known_hosts doesn't seem to be pa...
12 months ago
0 Votes
8 Answers
762 Views
0 Votes 8 Answers 762 Views
11 months ago
0 Votes
3 Answers
395 Views
0 Votes 3 Answers 395 Views
After publishing task, one of the two models gets deleted from artifacts.
5 months ago
0 Votes
14 Answers
557 Views
0 Votes 14 Answers 557 Views
6 months ago
0 Votes
11 Answers
994 Views
0 Votes 11 Answers 994 Views
ClearML doesn't pick up model checkpoints automatically. Any idea what might be wrong? (code attached in the thread). Thanks
one year ago
0 Votes
3 Answers
698 Views
0 Votes 3 Answers 698 Views
11 months ago
0 Votes
11 Answers
785 Views
0 Votes 11 Answers 785 Views
I've been using ClearML on on-premise machines and would now like to deploy everything in GCP (deploy everything new from scratch, don't care about migrating...
12 months ago
0 Votes
8 Answers
558 Views
0 Votes 8 Answers 558 Views
"clearml-data sync --folder ." doesn't work
6 months ago
0 Votes
1 Answers
8 Views
0 Votes 1 Answers 8 Views
21 hours ago
0 Votes
1 Answers
42 Views
0 Votes 1 Answers 42 Views
5 days ago
0 Votes
6 Answers
244 Views
0 Votes 6 Answers 244 Views
Reporting NoneType scalars. Is it possible to report None somehow, to skip the point on a plot? If I just skip it, it will interpolate between the previous a...
4 months ago
0 Clearml (Remote Execution) Sometimes Doesn'T "Pick-Up" Gpu. After I Rerun The Task It Picks It Up. Seems Random, Doesn'T Happen Too Often (Maybe Once In 30-40 Times) And I Cannot Seem To Detect Any Pattern. Did Anyone Else Notice This? Agents Are Vms On G

It seems that task.set_base_docker must be called with docker_image as well (otherwise docker_arguments don't propagate), not sure if it's a bug or not, but I have a workaround now, thanks!

4 months ago
0 Clearml (Remote Execution) Sometimes Doesn'T "Pick-Up" Gpu. After I Rerun The Task It Picks It Up. Seems Random, Doesn'T Happen Too Often (Maybe Once In 30-40 Times) And I Cannot Seem To Detect Any Pattern. Did Anyone Else Notice This? Agents Are Vms On G

I know about clearml.conf but wanted to avoid ssh-ing through 50 instances to edit it.
task.set_base_docker does the job, but docker_arguments doesn't propagate if I leave docker_image as None (it just uses both image and arguments from clearml.conf of the agent). If I explicitly state docker_image and docker_arguments in task.set_base_docker it works fine.

4 months ago
0 "Clearml-Data Sync --Folder ." Doesn'T Work

Once I used clearml-data add --folder * API everything works correctly (though all files recursively ended up in the root, I had luck all were named differently).

6 months ago
0 I Have The Following Setup: Instance-1 With Clearml Server Hosted In Gcp, Visible Only Through Internal Network, Instance-2 With Clearml Agent Hosted In Gcp, Visible Only Through Internal Network, Reverse-Ngnix-Proxy Hosted In Gcp Acting As Interface Betw

Additional info:
-Public URL uses HTTPS, internal traffic doesn't.
-clearml.storage fails while trying to fetch None ...

Meaning it just replaced the internal IP with the URL at some point for some reason, it doesn't exist in that form anywhere in any configs (http and public URL).

11 months ago
4 months ago
0 "Clearml-Data Sync --Folder ." Doesn'T Work

OSX 12.5.1
Python 3.8.1.
Clearml 1.13.1

"clearml-data add --folder ./*" always flattens everything, I have that reproducible 100%.

5 months ago
0 "Clearml-Data Sync --Folder ." Doesn'T Work

I have a dataset of ~24GB and I've tried multiple times uploading it with the sync function.

  • The cache doesn't work, it attempts to download the dataset every time.
  • It "misses" some files somehow. So once the job runs it fails due to missing files.
  • I've ran verify afterwards (from the machine I used to upload the data) and it says it's all good. However, once I inspect the zip files on the server (look for the files in the specific zip the state json says they're in) the files are indee...
6 months ago
0 "Clearml-Data Sync --Folder ." Doesn'T Work

Single version. The issue seems to be the creation. If I use "clearml-data sync --folder ." it says it uploaded all the files. Running "clearml-data verify --folder ." says it's all good. Metadata on the WebUI reports the expected number of files. However, once I extract the zips (or download the dataset through Python API or CLI) not all the files are there.

"clearml-data add --folder ./*" seems to fix this issue though it doesn't preserve my directory structure so I'd have to write a scrip...

5 months ago
0 After Publishing Task, One Of The Two Models Gets Deleted From Artifacts.

I'll try to reproduce it and will get back at you. The HPO task (parent of this task) was deleted indeed but that shouldn't matter? One of the models was deleted but the other one wasn't.

5 months ago
0 Clearml Doesn'T Pick Up Model Checkpoints Automatically. Any Idea What Might Be Wrong? (Code Attached In The Thread). Thanks

No worries, sorry for pinging, was just making sure you (or anyone else who might help) doesn't miss it 🙂
I use Task.add_requirements("requirements.txt") right before the Task.init.
In main, I parse arguments command-line, add_requirements, initialize Task and call execute_remotely. After that it's all pretty much the usual workflow. Initialize the model, setup dataloaders, optimizer and run the training. I'm using pytorch-ignite and have model checkpoint made on validation evaluator COMPL...

one year ago
0 Clearml (Remote Execution) Sometimes Doesn'T "Pick-Up" Gpu. After I Rerun The Task It Picks It Up. Seems Random, Doesn'T Happen Too Often (Maybe Once In 30-40 Times) And I Cannot Seem To Detect Any Pattern. Did Anyone Else Notice This? Agents Are Vms On G

Found this, seems to be exactly this: None
It appears that running docker as --privileged resolves the issue which is easier for me than to edit all of the instances I've already created. Is there an easy way to add a docker argument in the python script?
I've tried task.set_base_docker(docker_arguments="--privileged") right after Task.init but it doesn't seem to work.
Thanks!

4 months ago
0 I'Ve Been Using Clearml On On-Premise Machines And Would Now Like To Deploy Everything In Gcp (Deploy Everything New From Scratch, Don'T Care About Migrating The Data). I'Ve Followed Tutorial

Having a bit of trouble with this one (sorry for possibly dumb questions).
Are there any docs on how to add certs to the docker image? I see this ( None ) which is where letsencrypt points me to, but I'm not sure what's the proper way to do this on the webapp docker (I'd assume there's a non-hacky way to do it as others are using the same setup I'm trying to make work I guess)

12 months ago
0 I'Ve Been Using Clearml On On-Premise Machines And Would Now Like To Deploy Everything In Gcp (Deploy Everything New From Scratch, Don'T Care About Migrating The Data). I'Ve Followed Tutorial

Probably not, I'm trying to access it via external IP. Could you point me to instructions for that in the docs, I don't remember seeing it anywhere? Thanks!

12 months ago
0 I'Ve Been Using Clearml On On-Premise Machines And Would Now Like To Deploy Everything In Gcp (Deploy Everything New From Scratch, Don'T Care About Migrating The Data). I'Ve Followed Tutorial

To make sure I understand, I need to setup a domain with a cert and it should work, no additional ClearML config is required?

12 months ago
0 Reporting Nonetype Scalars.

Got it, thanks!

4 months ago
0 Reporting Nonetype Scalars.

How would that be reflected on the plot?

4 months ago
0 Reporting Nonetype Scalars.

Got it. Is there any way to skip a point at some iteration? If I just don't report it at iteration t I'll get interpolation from t-1 to t+1.

4 months ago
0 Clearml (Remote Execution) Sometimes Doesn'T "Pick-Up" Gpu. After I Rerun The Task It Picks It Up. Seems Random, Doesn'T Happen Too Often (Maybe Once In 30-40 Times) And I Cannot Seem To Detect Any Pattern. Did Anyone Else Notice This? Agents Are Vms On G

I'll check the docker command next time this happens, thanks! For the machines, all of them have GPUs (and are in fact identical/cloned VMs) and if I rerun it and get the same exact machine again it works so it's some part of "GPU detection" or something, we'll know more hopefully once it happens again, thanks.

5 months ago
Show more results compactanswers