Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
ShinyRabbit94
Moderator
6 Questions, 34 Answers
  Active since 10 January 2023
  Last activity 19 days ago

Reputation

0

Badges 1

34 × Eureka!
0 Votes
4 Answers
532 Views
0 Votes 4 Answers 532 Views
2 years ago
0 Votes
16 Answers
537 Views
0 Votes 16 Answers 537 Views
Hello, I am trying to run the clearml-agent in docker mode. I use this command to start it : sudo clearml-agent daemon --cpu-only --queue training_queue --do...
2 years ago
0 Votes
10 Answers
633 Views
0 Votes 10 Answers 633 Views
Hello, I am running my own instance of the clearml-server. All works as expected, but sometimes my training tasks get stuck for 40+ minutes (while usually ta...
2 years ago
0 Votes
21 Answers
656 Views
0 Votes 21 Answers 656 Views
Hello! When I delete Tasks, Models or Datasets from the fileserver's UI, the associated artifacts (in /opt/clearml/data/fileserver ) are not deleted. Any ide...
2 years ago
0 Votes
7 Answers
589 Views
0 Votes 7 Answers 589 Views
Hello! The agent-services present in ClearML server's docker-compose is only for cleanup tasks, right ? For training I would need to run another instance of ...
2 years ago
0 Votes
16 Answers
513 Views
0 Votes 16 Answers 513 Views
Hello ! When running Dataset.get the wrong file_server api is being used. This is the content of my clearml.conf api { # Notice: 'host' is the api server (de...
2 years ago
0 Hello ! When Running

of course, I am checking using the env command

2 years ago
0 Hello, I Am Trying To Run The

I tried to fix the python binary in the config as well :
agent.python_binary = /opt/venv/bin/python3where :
/opt/venv/bin/python3is the output of which python ran inside a docker container using my image.
In the clearml-agent logs I see this :
/root/.clearml/venvs-builds/3.8/bin/python -u /root/.clearml/venvs-builds/3.8/code/train.pySo I don't know if it's using the same python version or not.

2 years ago
0 Hello ! When Running

I am very confused now, I tried switch to my local machine and change the clearml.conf.
It only partly worked :
Dataset.list_datasets() returns the correct list (from the remote server).
But Dataset.get(dataset_id="ce2abe847e004ac282cc435bfa9c4bd5")
gives me :
2021-12-20 13:46:39,404 - clearml.storage - ERROR - Could not download ` , err: Failed getting object localhost:8081/annotation_dataset/annotation.ce2abe847e004ac282cc435bfa9c4bd5/artifacts/state/state.json (404): <!DO...

2 years ago
0 Hello ! When Running

Ah! That's it, thank you very much ! I did not know this was an issue. I though the dataset was only linked to the fileserver and not to the specific url used to upload it.

2 years ago
0 Hello ! When Running

I was looking at the code of the Dataset class, but I could not find where the files_server is retrieved.

2 years ago
0 Hello ! When Running

What is the proper way to change a clearml.conf ?

2 years ago
0 Hello ! When Running

here is the command I am using :
sudo docker run -it -v /home/ubuntu/app/:/app/ -v /home/ubuntu/folder/clearml.conf:/root/clearml.conf --network "clearml_backend" my_image bash

2 years ago
0 Hello ! When Running

there is nothing in the env

2 years ago
0 Hello ! When Running

I did, I copy pasted the config from within the docker

2 years ago
0 Hello! The Agent-Services Present In Clearml Server'S Docker-Compose Is Only For Cleanup Tasks, Right ? For Training I Would Need To Run Another Instance Of Clearml-Agent Alongside The Docker-Compose ?

Thank you! Is there a way to test the agent on a machine without GPU ?
When running this little script, I can see my agent installing the requirements, but it does not seem to ever start running the task.
task = Task.create( project_name="train", task_name="train", requirements_file="./requirements.txt", repo="") task.set_script(entry_point="./test.py") Task.enqueue(task, queue_name="training_queue")The logs are as follows :
` Starting Task ...

2 years ago
0 Hello! The Agent-Services Present In Clearml Server'S Docker-Compose Is Only For Cleanup Tasks, Right ? For Training I Would Need To Run Another Instance Of Clearml-Agent Alongside The Docker-Compose ?

It seems the agent does not like working with scripts located inside a git repository, I moved the requirements and the script in a folder without a .git and it works now, thank you!

2 years ago
0 Hello! The Agent-Services Present In Clearml Server'S Docker-Compose Is Only For Cleanup Tasks, Right ? For Training I Would Need To Run Another Instance Of Clearml-Agent Alongside The Docker-Compose ?

Maybe it is some sort of misunderstanding from my side ? I thought :
Task.enqueue(task, queue_name="training_queue")is what starts the execution of the task. Do I need another function ?

2 years ago
0 Hello, I Am Running My Own Instance Of The Clearml-Server. All Works As Expected, But Sometimes My Training Tasks Get Stuck For 40+ Minutes (While Usually Taking About 5 Minutes) With The Following Log :

The fileserver is remote, but the bandwidth is not an issue.
Is the automatic artifact storage of clearml async ? (meaning even if the task is finished it could still be uploading associated artifacts ?)

2 years ago
0 Hello, I Am Trying To Run The

Sorry for the late reply. It is indeed a venv, I though it would not be an issue since the PYTHONPATH and the PATH are both set to prioritize the venv. I'll try to create a more classic image.

2 years ago
0 Hello, I Am Trying To Run The

I noticed logs start as follows :
/usr/bin/python3.9 /usr/bin/python3.9: No module named pip /usr/local/bin/python3.8

2 years ago
0 Hello, I Am Trying To Run The

even thought when starting the worker I see this :
agent.python_binary = /opt/venv/bin/python3

2 years ago
0 Hello, I Am Trying To Run The

and agent.python_binary is empty.

2 years ago
2 years ago
0 Hello, I Am Trying To Run The

The logs continue like this :
` Summary - installed python packages:
pip:

  • attrs==20.3.0
  • backports.entry-points-selectable==1.1.1
  • certifi==2021.10.8
  • chardet==4.0.0
  • clearml==1.1.4
  • Cython==0.29.26
  • distlib==0.3.4
  • filelock==3.4.0
  • furl==2.1.3
  • future==0.18.2
  • idna==2.10
  • jsonschema==3.2.0
  • numpy==1.21.5
  • orderedmultidict==1.0.1
  • pathlib2==2.3.6
  • Pillow==8.4.0
  • platformdirs==2.4.0
  • psutil==5.8.0
  • pyhocon==0.3.59
  • PyJWT==2.0.1
  • pyparsing==2.4.7
  • pyrsistent==0.18.0
  • pyt...
2 years ago
0 Hello! When I Delete Tasks, Models Or Datasets From The Fileserver'S Ui, The Associated Artifacts (In

CostlyOstrich36 Yes, I am getting the exact same error as Malcolm (thanks for the link!) except I can see the URLs of my artifacts instead of undefined .
SuccessfulKoala55 I am running a self-hosted server. I installed it about 3 months ago, so I would assume my current version is v1.1.1 , how can I check for sure ?

2 years ago
2 years ago
0 Hello! When I Delete Tasks, Models Or Datasets From The Fileserver'S Ui, The Associated Artifacts (In

For example to create a dataset, I use this :
from clearml import Dataset ds = Dataset.create(dataset_project='XX', dataset_name='XX') ds.add_files( path='/tmp/tmpbk2g6c3h' ) ds.upload() ds.finalize()

2 years ago
0 Hello! When I Delete Tasks, Models Or Datasets From The Fileserver'S Ui, The Associated Artifacts (In

I can provide a screenshot, but I'd need to hide the urls 😅 and if do so it would look just like Malcolm's screenshot.

2 years ago
0 Hello! When I Delete Tasks, Models Or Datasets From The Fileserver'S Ui, The Associated Artifacts (In

The URLs are correct, I can use them to download the dataset zip.

2 years ago
Show more results compactanswers