Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
AdventurousButterfly15
Moderator
17 Questions, 77 Answers
  Active since 10 January 2023
  Last activity 7 months ago

Reputation

0

Badges 1

75 × Eureka!
0 Votes
2 Answers
563 Views
0 Votes 2 Answers 563 Views
How can I extend a dataset? On community edition I have an existing dataset, I want to add some files and make a new version. I tried just doing a Dataset()....
10 months ago
0 Votes
2 Answers
471 Views
0 Votes 2 Answers 471 Views
Is there a way to clear ClearML caches, maybe some command? My server ran out of space and I lost a whole weekend of training. My venvs-cache folder was over...
8 months ago
0 Votes
3 Answers
619 Views
0 Votes 3 Answers 619 Views
I trained a model, saved it. Now I am trying to access it from another machine, but the model url is a local path. How can I download models from Clearml?
10 months ago
0 Votes
22 Answers
701 Views
0 Votes 22 Answers 701 Views
clearml-session fails ssh tunneling. It does not use key auth, instead sets up some weird password and then fails to auth: Remote machine is ready Setting up...
one year ago
0 Votes
2 Answers
532 Views
0 Votes 2 Answers 532 Views
When I run a task with Dataset.get the agent requests the dataset from a weird url. adamastor.gaiavf.local in this case. 2022-10-03 17:50:17,556 - clearml.st...
one year ago
0 Votes
6 Answers
404 Views
0 Votes 6 Answers 404 Views
I am getting CLEARML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start . How do fix it? The docs are not helpfu...
7 months ago
0 Votes
2 Answers
738 Views
0 Votes 2 Answers 738 Views
Hey, loving ClearML so far. I create an agent with 1 gpu and I am sending a task to it. But it says that it couldn’t create a docker with gpu access. How can...
one year ago
0 Votes
2 Answers
469 Views
0 Votes 2 Answers 469 Views
When I try to run any task the agent tries to mount something vscode related: 683637074988 adamastor:gpuall INFO Executing: ['docker', 'run', '-t', '--gpus',...
10 months ago
0 Votes
5 Answers
458 Views
0 Votes 5 Answers 458 Views
Has anyone had success using clearml with huggingface models? I create my HF Trainer with the ClearML callback, but the only thing I get in the logs is this ...
8 months ago
0 Votes
11 Answers
720 Views
0 Votes 11 Answers 720 Views
I am trying to do a remote execution of a test task, but it fails during env setup due to trying to install an obscure version of pytorch. Been trying to sol...
one year ago
0 Votes
3 Answers
571 Views
0 Votes 3 Answers 571 Views
When I set agent management to conda it tries to create envs with python 3.1 and fails. Executing Conda: /home/adamastor/anaconda3/bin/conda create --yes --m...
one year ago
0 Votes
1 Answers
598 Views
0 Votes 1 Answers 598 Views
I am saving a model with pickle, but it doesn’t show up as an artifact. Why? Task.init(..., output_uri=True) model = SklearnPipeline() ... pickle.dump(model,...
10 months ago
0 Votes
4 Answers
696 Views
0 Votes 4 Answers 696 Views
How can I stop clearml from uploading temporary models? I am running cross_validation, training a bunch of models in a loop like this: models = [] for X_trai...
10 months ago
0 Votes
5 Answers
483 Views
0 Votes 5 Answers 483 Views
I am training a model with Pytorch lightning. I save .ckpt checkpoints. But they never get uploaded to clearml! How do I make clearml detect the checkpoints?...
9 months ago
0 Votes
22 Answers
677 Views
0 Votes 22 Answers 677 Views
Why does my task execution freeze after pip installation (running agent in foreground mode)? The task is: from clearml import Task task = Task.init(project_n...
one year ago
0 Votes
14 Answers
658 Views
0 Votes 14 Answers 658 Views
ClearML task execution fails trying to pull data from Gitlab. The credentials are correct (username + access token), but I get this error: remote: HTTP Basic...
one year ago
0 Votes
21 Answers
638 Views
0 Votes 21 Answers 638 Views
one year ago
0 When I Do

I guess I am out of ideas. The config is wrong somewhere. Maybe double check all the configs? It’s taking the value from somewhere!

one year ago
0 I Am Getting

Version: 1.11.1

7 months ago
0 I Am Getting

I dont have a short version.
I am using community clearml. How do I find out my version?

7 months ago
0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

Also manually installing this torch version succeeds:
` (base) boris@adamastor:~$ python3.10 -m pip install /home/boris/.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl
Processing ./.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl
Requirement already satisfied: typing-extensions in ./miniconda3/lib/python3.10/site-packages (from torch==1.12.1+cu116) (4.3.0)
Installing collected packages: torch
Attempting uninstall: torch
...

one year ago
0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

Despite having manually installed this torch version, during task execution agent still tries to install it somehow and fails:
INFO:clearml_agent.commands.worker:Downloading " " to pip cache Collecting torch==1.12.1+cu116 File was already downloaded /home/boris/.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl Successfully downloaded torch INFO:clearml_agent.commands.worker:Downloading " ` " to pip cache
Collecting torchvision==0.13.1+cu116
File was...

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

This issue was resolved by setting the correct clearml.conf (replacing localhost with a public hostname for the server) 🙂

one year ago
9 months ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

AgitatedDove14 With --debug I see that after installing packages there is an endless stream of this:
` Retrying (Retry(total=239, connect=239, read=240, redirect=240, status=240)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7fac842e8be0>: Failed to establish a new connection: [Errno 111] Connection refused',)': /auth.login
Retrying (Retry(total=238, connect=238, read=240, redirect=240, status=240)) after connection broken by 'NewConnec...

one year ago
0 I Am Training A Model With Pytorch Lightning. I Save

Looking through history I found this link: None

Tldr: ClearML doesn’t support lightning , but supports pytorch_lightning . Downgrading from the new interface to the old one fixed my issue

9 months ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

CostlyOstrich36 in installed packages it has:
` # Python 3.10.6 | packaged by conda-forge | (main, Aug 22 2022, 20:41:22) [Clang 13.0.1 ]

Pillow == 9.2.0
clearml == 1.7.1
minio == 7.1.12
numpy == 1.23.1
pandas == 1.5.0
scikit_learn == 1.1.2
tensorboard == 2.10.1
torch == 1.12.1
torchvision == 0.13.1
tqdm == 4.64.1 `Which is the same as I have locally and on the server that runs clearml-agent

one year ago
0 When I Try To Run Any Task The Agent Tries To Mount Something Vscode Related:

The issue disappeared after I switched from docker mode to pip mode

10 months ago
0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

I don’t understand. The current cuda version is 11.7. Installed pytorch version is 1.12.1. Torch can access GPUs, all is fine.
Why does it try to install a different torch version?
` (base) boris@adamastor:~$ nvidia-smi
Fri Oct 7 14:16:24 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |
|-------------------------------+----------------------+----------------------+
| GPU Name ...

one year ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

Here’s the error I get:
https://justpaste.it/7aom5

It’s trying to downgrade pytorch to 1.12.1 for some reason (why?) using a version for an outdated CUDA (I have 11.7, it tries to use pytorch for CUDA 11.6). Finally crashes

one year ago
0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

I understand the idea, it makes sense. But it does not seem to work as intended. Why does it try to install a different pytorch? And why does it fail if it works if I do it manually? The env that’s executing the task has the same pytorch

Anyways, what should I do? So far my workers have not executed a single task, it always breaks with these env errors

one year ago
0 I Am Getting

I have not tired unfortunately

7 months ago
Show more results compactanswers