Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
AdventurousButterfly15
Moderator
17 Questions, 77 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

75 × Eureka!
0 Votes
6 Answers
804 Views
0 Votes 6 Answers 804 Views
I am getting CLEARML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start . How do fix it? The docs are not helpfu...
one year ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
How can I stop clearml from uploading temporary models? I am running cross_validation, training a bunch of models in a loop like this: models = [] for X_trai...
one year ago
0 Votes
2 Answers
856 Views
0 Votes 2 Answers 856 Views
When I try to run any task the agent tries to mount something vscode related: 683637074988 adamastor:gpuall INFO Executing: ['docker', 'run', '-t', '--gpus',...
one year ago
0 Votes
2 Answers
913 Views
0 Votes 2 Answers 913 Views
Is there a way to clear ClearML caches, maybe some command? My server ran out of space and I lost a whole weekend of training. My venvs-cache folder was over...
one year ago
0 Votes
21 Answers
1K Views
0 Votes 21 Answers 1K Views
2 years ago
0 Votes
14 Answers
1K Views
0 Votes 14 Answers 1K Views
ClearML task execution fails trying to pull data from Gitlab. The credentials are correct (username + access token), but I get this error: remote: HTTP Basic...
one year ago
0 Votes
22 Answers
1K Views
0 Votes 22 Answers 1K Views
clearml-session fails ssh tunneling. It does not use key auth, instead sets up some weird password and then fails to auth: Remote machine is ready Setting up...
2 years ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
I am saving a model with pickle, but it doesn’t show up as an artifact. Why? Task.init(..., output_uri=True) model = SklearnPipeline() ... pickle.dump(model,...
one year ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
I trained a model, saved it. Now I am trying to access it from another machine, but the model url is a local path. How can I download models from Clearml?
one year ago
0 Votes
5 Answers
857 Views
0 Votes 5 Answers 857 Views
Has anyone had success using clearml with huggingface models? I create my HF Trainer with the ClearML callback, but the only thing I get in the logs is this ...
one year ago
0 Votes
22 Answers
1K Views
0 Votes 22 Answers 1K Views
Why does my task execution freeze after pip installation (running agent in foreground mode)? The task is: from clearml import Task task = Task.init(project_n...
2 years ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
I am trying to do a remote execution of a test task, but it fails during env setup due to trying to install an obscure version of pytorch. Been trying to sol...
2 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
How can I extend a dataset? On community edition I have an existing dataset, I want to add some files and make a new version. I tried just doing a Dataset()....
one year ago
0 Votes
5 Answers
854 Views
0 Votes 5 Answers 854 Views
I am training a model with Pytorch lightning. I save .ckpt checkpoints. But they never get uploaded to clearml! How do I make clearml detect the checkpoints?...
one year ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Hey, loving ClearML so far. I create an agent with 1 gpu and I am sending a task to it. But it says that it couldn’t create a docker with gpu access. How can...
2 years ago
0 Votes
2 Answers
923 Views
0 Votes 2 Answers 923 Views
When I run a task with Dataset.get the agent requests the dataset from a weird url. adamastor.gaiavf.local in this case. 2022-10-03 17:50:17,556 - clearml.st...
2 years ago
0 Votes
3 Answers
957 Views
0 Votes 3 Answers 957 Views
When I set agent management to conda it tries to create envs with python 3.1 and fails. Executing Conda: /home/adamastor/anaconda3/bin/conda create --yes --m...
2 years ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

CostlyOstrich36 in installed packages it has:
` # Python 3.10.6 | packaged by conda-forge | (main, Aug 22 2022, 20:41:22) [Clang 13.0.1 ]

Pillow == 9.2.0
clearml == 1.7.1
minio == 7.1.12
numpy == 1.23.1
pandas == 1.5.0
scikit_learn == 1.1.2
tensorboard == 2.10.1
torch == 1.12.1
torchvision == 0.13.1
tqdm == 4.64.1 `Which is the same as I have locally and on the server that runs clearml-agent

2 years ago
0 Is There A Way To Get A Task'S Docker Container Id/Name? I'M Generally Interested In Resource Profiling Of Each Container, So I Noticed I Can Use

For a hacky way you can do docker ps and see the docker run command. I believe it contains the task id, so you can grep by task id

2 years ago
0 I Am Getting

Version: 1.11.1

one year ago
0 When I Try To Run Any Task The Agent Tries To Mount Something Vscode Related:

The issue disappeared after I switched from docker mode to pip mode

one year ago
0 I Am Training A Model With Pytorch Lightning. I Save

Looking through history I found this link: None

Tldr: ClearML doesn’t support lightning , but supports pytorch_lightning . Downgrading from the new interface to the old one fixed my issue

one year ago
0 I Am Getting

I dont have a short version.
I am using community clearml. How do I find out my version?

one year ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

Here’s the error I get:
https://justpaste.it/7aom5

It’s trying to downgrade pytorch to 1.12.1 for some reason (why?) using a version for an outdated CUDA (I have 11.7, it tries to use pytorch for CUDA 11.6). Finally crashes

2 years ago
0 When I Do

I guess I am out of ideas. The config is wrong somewhere. Maybe double check all the configs? It’s taking the value from somewhere!

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

The image I am using is pytorch/pytorch:1.7.0-cuda11.0-cudnn8-devel

2 years ago
0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

I mean if I enter my host machine ssh password it works. But we will disable password auth in future, so it’s not an option

2 years ago
0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

The agent is started with this command:
clearml-agent --debug daemon --queue gpu --gpus 0 --foreground --docker <gitlab org registry>/project-precog/clearml_config

2 years ago
0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

I can telnet the port from my mac:
(base) *[main][~/Documents/plant_age]$ telnet 192.168.1.55 10022 Trying 192.168.1.55... Connected to 192.168.1.55. Escape character is '^]'. SSH-2.0-OpenSSH_8.4p1 Debian-5+deb11u1 ^C

2 years ago
0 I Am Getting

I have not tired unfortunately

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

I tried it.
This time agent was run with docker image python ( https://hub.docker.com/_/python )

Gets stuck on
Installing collected packages: six, python-dateutil, pathlib2, psutil, attrs, pyrsistent, jsonschema, idna, chardet, certifi, urllib3, requests, PyYAML, pyparsing, pyjwt, pyhocon, orderedmultidict, furl, future, platformdirs, filelock, distlib, virtualenv, clearml-agent
ps aux inside the container reads
` (base) boris@adamastor:~$ docker exec -it angry_edison bash
root@041c0736c...

2 years ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

I guess this pip package installation happens as part of docker build

2 years ago
0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

Also manually installing this torch version succeeds:
` (base) boris@adamastor:~$ python3.10 -m pip install /home/boris/.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl
Processing ./.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl
Requirement already satisfied: typing-extensions in ./miniconda3/lib/python3.10/site-packages (from torch==1.12.1+cu116) (4.3.0)
Installing collected packages: torch
Attempting uninstall: torch
...

one year ago
2 years ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

On the agent side it’s trying to install different pytorch versions (even though the env already has it all configured), then fails with torch_<something>.whl is not a valid wheel for this system

2 years ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

Is there some minimal example of a docker env agent I can run, just to see that it works?

2 years ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

This issue was resolved by setting the correct clearml.conf (replacing localhost with a public hostname for the server) 🙂

2 years ago
0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

I don’t understand. The current cuda version is 11.7. Installed pytorch version is 1.12.1. Torch can access GPUs, all is fine.
Why does it try to install a different torch version?
` (base) boris@adamastor:~$ nvidia-smi
Fri Oct 7 14:16:24 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |
|-------------------------------+----------------------+----------------------+
| GPU Name ...

one year ago
Show more results compactanswers