Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
AdventurousButterfly15
Moderator
17 Questions, 77 Answers
  Active since 10 January 2023
  Last activity 8 months ago

Reputation

0

Badges 1

75 × Eureka!
0 Votes
2 Answers
617 Views
0 Votes 2 Answers 617 Views
How can I extend a dataset? On community edition I have an existing dataset, I want to add some files and make a new version. I tried just doing a Dataset()....
11 months ago
0 Votes
3 Answers
623 Views
0 Votes 3 Answers 623 Views
When I set agent management to conda it tries to create envs with python 3.1 and fails. Executing Conda: /home/adamastor/anaconda3/bin/conda create --yes --m...
one year ago
0 Votes
1 Answers
685 Views
0 Votes 1 Answers 685 Views
I am saving a model with pickle, but it doesn’t show up as an artifact. Why? Task.init(..., output_uri=True) model = SklearnPipeline() ... pickle.dump(model,...
11 months ago
0 Votes
2 Answers
799 Views
0 Votes 2 Answers 799 Views
Hey, loving ClearML so far. I create an agent with 1 gpu and I am sending a task to it. But it says that it couldn’t create a docker with gpu access. How can...
one year ago
0 Votes
2 Answers
519 Views
0 Votes 2 Answers 519 Views
Is there a way to clear ClearML caches, maybe some command? My server ran out of space and I lost a whole weekend of training. My venvs-cache folder was over...
9 months ago
0 Votes
22 Answers
741 Views
0 Votes 22 Answers 741 Views
Why does my task execution freeze after pip installation (running agent in foreground mode)? The task is: from clearml import Task task = Task.init(project_n...
one year ago
0 Votes
4 Answers
754 Views
0 Votes 4 Answers 754 Views
How can I stop clearml from uploading temporary models? I am running cross_validation, training a bunch of models in a loop like this: models = [] for X_trai...
11 months ago
0 Votes
14 Answers
723 Views
0 Votes 14 Answers 723 Views
ClearML task execution fails trying to pull data from Gitlab. The credentials are correct (username + access token), but I get this error: remote: HTTP Basic...
one year ago
0 Votes
2 Answers
582 Views
0 Votes 2 Answers 582 Views
When I run a task with Dataset.get the agent requests the dataset from a weird url. adamastor.gaiavf.local in this case. 2022-10-03 17:50:17,556 - clearml.st...
one year ago
0 Votes
5 Answers
504 Views
0 Votes 5 Answers 504 Views
Has anyone had success using clearml with huggingface models? I create my HF Trainer with the ClearML callback, but the only thing I get in the logs is this ...
9 months ago
0 Votes
6 Answers
450 Views
0 Votes 6 Answers 450 Views
I am getting CLEARML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start . How do fix it? The docs are not helpfu...
8 months ago
0 Votes
22 Answers
760 Views
0 Votes 22 Answers 760 Views
clearml-session fails ssh tunneling. It does not use key auth, instead sets up some weird password and then fails to auth: Remote machine is ready Setting up...
one year ago
0 Votes
3 Answers
678 Views
0 Votes 3 Answers 678 Views
I trained a model, saved it. Now I am trying to access it from another machine, but the model url is a local path. How can I download models from Clearml?
11 months ago
0 Votes
5 Answers
525 Views
0 Votes 5 Answers 525 Views
I am training a model with Pytorch lightning. I save .ckpt checkpoints. But they never get uploaded to clearml! How do I make clearml detect the checkpoints?...
10 months ago
0 Votes
2 Answers
520 Views
0 Votes 2 Answers 520 Views
When I try to run any task the agent tries to mount something vscode related: 683637074988 adamastor:gpuall INFO Executing: ['docker', 'run', '-t', '--gpus',...
11 months ago
0 Votes
11 Answers
783 Views
0 Votes 11 Answers 783 Views
I am trying to do a remote execution of a test task, but it fails during env setup due to trying to install an obscure version of pytorch. Been trying to sol...
one year ago
0 Votes
21 Answers
696 Views
0 Votes 21 Answers 696 Views
one year ago
0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

I can telnet the port from my mac:
(base) *[main][~/Documents/plant_age]$ telnet 192.168.1.55 10022 Trying 192.168.1.55... Connected to 192.168.1.55. Escape character is '^]'. SSH-2.0-OpenSSH_8.4p1 Debian-5+deb11u1 ^C

one year ago
one year ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

Locally I have a conda env with some packages and a basic requirements file.
I am running this thing:
` from clearml import Task, Dataset
task = Task.init(project_name='Adhoc', task_name='Dataset test')
task.execute_remotely(queue_name="gpu")

from config import DATASET_NAME, CLEARML_PROJECT
print('Getting dataset')

dataset_path = Dataset.get(
dataset_name=DATASET_NAME,
dataset_project=CLEARML_PROJECT,
).get_local_copy()#.get_mutable_local_copy(DATASET_NAME)

print('Dataset path', d...

one year ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

CostlyOstrich36 in installed packages it has:
` # Python 3.10.6 | packaged by conda-forge | (main, Aug 22 2022, 20:41:22) [Clang 13.0.1 ]

Pillow == 9.2.0
clearml == 1.7.1
minio == 7.1.12
numpy == 1.23.1
pandas == 1.5.0
scikit_learn == 1.1.2
tensorboard == 2.10.1
torch == 1.12.1
torchvision == 0.13.1
tqdm == 4.64.1 `Which is the same as I have locally and on the server that runs clearml-agent

one year ago
0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

On the agent side it’s trying to install different pytorch versions (even though the env already has it all configured), then fails with torch_<something>.whl is not a valid wheel for this system

one year ago
0 Is There A Way To Get A Task'S Docker Container Id/Name? I'M Generally Interested In Resource Profiling Of Each Container, So I Noticed I Can Use

For a hacky way you can do docker ps and see the docker run command. I believe it contains the task id, so you can grep by task id

one year ago
one year ago
one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

Freezing means that after the pip packages installation, pictured on screenshot, nothing happens. This screen hangs forever. No other output anywhere, including the web UI

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

I guess this pip package installation happens as part of docker build

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

The image I am using is pytorch/pytorch:1.7.0-cuda11.0-cudnn8-devel

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

Definitely not, the machine has 5 TB and is a recent clear install

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

I tried it.
This time agent was run with docker image python ( https://hub.docker.com/_/python )

Gets stuck on
Installing collected packages: six, python-dateutil, pathlib2, psutil, attrs, pyrsistent, jsonschema, idna, chardet, certifi, urllib3, requests, PyYAML, pyparsing, pyjwt, pyhocon, orderedmultidict, furl, future, platformdirs, filelock, distlib, virtualenv, clearml-agent
ps aux inside the container reads
` (base) boris@adamastor:~$ docker exec -it angry_edison bash
root@041c0736c...

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

So the only process is something called /usr/local/bin/python3.10 -u -m clearml_agent execute .
So I guess pip install finished working
But the task is evidently not being executed.

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

Good idea. I can just ssh into the container of task execution, right?

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

Is there some minimal example of a docker env agent I can run, just to see that it works?

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

AgitatedDove14 With --debug I see that after installing packages there is an endless stream of this:
` Retrying (Retry(total=239, connect=239, read=240, redirect=240, status=240)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7fac842e8be0>: Failed to establish a new connection: [Errno 111] Connection refused',)': /auth.login
Retrying (Retry(total=238, connect=238, read=240, redirect=240, status=240)) after connection broken by 'NewConnec...

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

This issue was resolved by setting the correct clearml.conf (replacing localhost with a public hostname for the server) 🙂

one year ago
0 Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

AgitatedDove14 This example does not specify how to start a clearml-agent with docker such that it actually executes the task

one year ago
Show more results compactanswers