Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
JitteryCoyote63
Moderator
214 Questions, 1021 Answers
  Active since 10 January 2023
  Last activity 6 months ago

Reputation

0

Badges 1

979 × Eureka!
0 Votes
7 Answers
966 Views
0 Votes 7 Answers 966 Views
Hi, one more question: When creating a task with Task.init(), we can specify the task_type . Now when using Task.clone(), we cannot specify the task_type (is...
4 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Hi, how does agent.enable_git_ask_pass works? I am using the clearml-agent in docker mode and my experiment is stuck at downloading a private dependency: Clo...
one year ago
0 Votes
2 Answers
902 Views
0 Votes 2 Answers 902 Views
3 years ago
0 Votes
5 Answers
887 Views
0 Votes 5 Answers 887 Views
Hey again ๐Ÿ˜ I am migrating my trains-server to AWS and I would like now to have secure accounts (with password). But I don't want to loose the current users...
4 years ago
0 Votes
5 Answers
939 Views
0 Votes 5 Answers 939 Views
Hi, It seems that the package_manager.pip_version has been removed from the https://allegro.ai/docs/references/trains_ref/#agent , although still being shown...
4 years ago
0 Votes
3 Answers
934 Views
0 Votes 3 Answers 934 Views
Hi, I have several long running experiments failing with Process failed, exit code -9 and no other error with clearml 1.0.4 and clearml-agent 1.0.0, what cou...
3 years ago
0 Votes
12 Answers
1K Views
0 Votes 12 Answers 1K Views
Hi there! Is there an easy way to retrieve the site-package directory that was created by an agent from inside a task? Eg. task = Task.init(...) task.add_req...
2 years ago
0 Votes
1 Answers
572 Views
0 Votes 1 Answers 572 Views
Quick question: Why does clearml-server 1.15.0 api-server python package require ES 8.12.0 but the docker-compose references ES 7.17.18?
7 months ago
0 Votes
5 Answers
978 Views
0 Votes 5 Answers 978 Views
Hi, from within an experiment, how can I intercept the signal that the experiment was aborted and execute a cleanup function? I tried to intercept SIGINT and...
2 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
The “Manage queue” option in the right tab on a queued experiment is broken in v1.0 (it does nothing)
3 years ago
0 Votes
30 Answers
968 Views
0 Votes 30 Answers 968 Views
Hi there ๐Ÿ™‚ Task.get_parameters() returns an empty dict from within a trains-agent task being executed. When I execute it outside, it works properly. Is it i...
4 years ago
0 Votes
4 Answers
963 Views
0 Votes 4 Answers 963 Views
Hey again ๐Ÿ˜ Is it possible to run multiple agents on the same machine? And with some in services mode?
4 years ago
0 Votes
22 Answers
1K Views
0 Votes 22 Answers 1K Views
Hi, I would like to switch from the elastic-search service in the docker-compose of the clearml-server to an externally managed, scalable elastic-search clus...
3 years ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
Hi, how can I easily start a shell script from within an experiment and have its logs (stdin/err) logged in clearml?
2 years ago
0 Votes
2 Answers
961 Views
0 Votes 2 Answers 961 Views
Are the env variables passed to trains-agent available in experiments run by this trains-agent?
4 years ago
0 Votes
6 Answers
961 Views
0 Votes 6 Answers 961 Views
Hi, Is there a way to stop a clearml-agent from within an experiment? Or block it to prevent it running any other task?
3 years ago
0 Votes
23 Answers
1K Views
0 Votes 23 Answers 1K Views
Hi, I started a trains-agent (0.15) in services mode (full command: trains-agent daemon --services-mode --detached --queue services --create-queue --docker u...
4 years ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
Hi there, any plan/benefit to support virtualenv= 20 ?
4 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Hi, I am using the aws autoscaler and getting the following error while trying to spin up spot instances: 2021-08-16 17:18:48 Spinning new instance type=v100...
3 years ago
0 Votes
15 Answers
976 Views
0 Votes 15 Answers 976 Views
Hi, how can I get the logs from the pytorch ignite early stopping handler to be logged in clearml?
3 years ago
0 Votes
5 Answers
932 Views
0 Votes 5 Answers 932 Views
Hi, is it possible to disable some of the system metrics monitored? and also downsample the rate of logging?
3 years ago
0 Votes
0 Answers
902 Views
0 Votes 0 Answers 902 Views
(sorry I pinned the message accidentally ๐Ÿ˜… )
4 years ago
0 Votes
5 Answers
981 Views
0 Votes 5 Answers 981 Views
3 years ago
0 Votes
2 Answers
884 Views
0 Votes 2 Answers 884 Views
Hi, in the Metric Snapshot graph, is it possible to scale the Y axis to [y_min *0.9, y_max * 1,1] ? currently all my values are flat at the same ~y and it is...
3 years ago
0 Votes
27 Answers
967 Views
0 Votes 27 Answers 967 Views
Hi there, I found a memory leak in Logger.report_matplotlib_figure . I was constantly running out of memory when training my models so I decided to spend som...
one year ago
0 Votes
5 Answers
907 Views
0 Votes 5 Answers 907 Views
How can I do the following? (basically, filtering by task type) Task.get_tasks(project_name="my-project", task_name="my-task", task_filter=dict(type="trainin...
4 years ago
0 Votes
3 Answers
947 Views
0 Votes 3 Answers 947 Views
โš ๏ธ Hi there, I recently updated clearml server to 1.7.0, and found the following critical regression: When I reset an experiment, it is actually deleted ๐Ÿ˜ต ,...
one year ago
0 Votes
8 Answers
944 Views
0 Votes 8 Answers 944 Views
Hi, is it possible to pass temporary IAM role to the web app could access?
3 years ago
0 Votes
3 Answers
978 Views
0 Votes 3 Answers 978 Views
Hi, in a subproject, would it be possible to hide the parent project if it is empty?
3 years ago
0 Votes
7 Answers
922 Views
0 Votes 7 Answers 922 Views
Hi, is there a way to get some stats about the use of workers? I would like to know, over the past 3 months: Number of training hours per user Number of trai...
3 years ago
Show more results questions
0 Hi, I Just Updated Clearml Server 1.0 Using

This is what I get, when I am connected and when I am logged out (by clearing cache/cookies)

3 years ago
0 Hi, I Just Updated Clearml Server 1.0 Using

It worked with clearml server 0.17

3 years ago
0 Hi, I Just Updated Clearml Server 1.0 Using

I added the pass_hashed and restarted the server, still get the same problem

3 years ago
4 years ago
0 Hello, I Am Getting `Valueerror: Could Not Get Access Credentials For '

region is empty, I never entered it and it worked

4 years ago
0 Hello, I Am Getting `Valueerror: Could Not Get Access Credentials For '

So most likely trains was masking the original error, it might be worth investigating to help other users in the future

4 years ago
0 Hi Guys, With The New Venv Caching Available In Clearml, I Have The Following Problem: I Force My Pip Requirements To Be:

AgitatedDove14 The first time it installs and create the cache for the env, the second time it fails with:
Applying uncommitted changes ERROR: Directory '.' is not installable. Neither 'setup.py' nor 'pyproject.toml' found. clearml_agent: ERROR: Command '['/home/user/.clearml/venvs-builds.1/3.6/bin/python', '-m', 'pip', '--disable-pip-version-check', 'install', '-r', '/tmp/cached-reqsmncaxx45.txt']' returned non-zero exit status 1.

3 years ago
0 Hi, I Deleted Some Archived Experiments In Clearml Server 1.0 And The Popup In The Dashboard Showed “The Following Artifacts Were Not Deleted”, With A List Of Files That Are Under

These images are actually stored there and I can access them via the url shared above (the one written in the pop up message saying that these files could not be deleted)

3 years ago
3 years ago
0 Hi Again, I Am Trying To Make The Aws Autoscaler Work With Ec2 Instances, But It Fails To Setup The Agent In The Machine: The Logs Of The User-Data Script Show That It Fails Updating The Machine (See Below)

so what worked for me was the following startup userscript:
` #!/bin/bash
sleep 120
while sudo fuser /var/{lib/{dpkg,apt/lists},cache/apt/archives}/lock >/dev/null 2>&1; do echo 'Waiting for other instances of apt to complete...'; sleep 5; done
sudo apt-get update
while sudo fuser /var/{lib/{dpkg,apt/lists},cache/apt/archives}/lock >/dev/null 2>&1; do echo 'Waiting for other instances of apt to complete...'; sleep 5; done
sudo apt-get install -y python3-dev python3-pip gcc git build-essential...

3 years ago
0 Hi Again, I Am Trying To Make The Aws Autoscaler Work With Ec2 Instances, But It Fails To Setup The Agent In The Machine: The Logs Of The User-Data Script Show That It Fails Updating The Machine (See Below)

there is no error from this side, I think the aws autoscaler just waits for the agent to connect, which will never happen since the agent wonโ€™t start because the userdata script fails

3 years ago
0 Hi There, I Am Running A Clearml-Agent In Services Mode (With Docker) On A Machine With Two Disks: One With The Os (8Go, 91% Space Used) And One For The Data (100Go, 40% Space Used). When Executing The Auto-Scaler Task In This Agent, I Get The Following E

/data/shared/miniconda3/bin/python /data/shared/miniconda3/bin/clearml-agent daemon --services-mode --detached --queue services --create-queue --docker ubuntu:18.04 --cpu-only

3 years ago
3 years ago
0 Hi, Is There A Way To Stop A Clearml-Agent From Within An Experiment? Or Block It To Prevent It Running Any Other Task?

My use case it: in a spot instance marked for termination after 2 mins by aws, I want to close a task and prevent the clearml-agent to pick up a new task after.

3 years ago
0 Hi Guys, Coming This Time To Share An Idea Of A Killer Feature For Clearml

I also discovered https://h2oai.github.io/wave/ last week, would be awesome to be able to deploy it in the same manner

3 years ago
0 Hi Guys For The Aws Auto-Scaler I Need To Access Aws Ssm Or Create .Env File Locally When Using The Init Script. Has Anyone Done This?

I would probably leave it to the ClearML team to answer you, I am not using the UI app and for me it worked just well with different regions. Maybe check permissions of the key/secrets?

3 years ago
0 Hi, If I Am Starting My Training With The Following Command:

btw I see in the pytorch_distributed_example I see that you average_gradients , but from pytorch https://pytorch.org/tutorials/beginner/dist_overview.html it says:
DDP takes care of gradient communication to keep model replicas synchronized and overlaps it with the gradient computations to speed up training.

3 years ago
3 years ago
0 Hi, I Deleted All Archived Experiments In A Project And I Just Realized All Experiments Of All Projects Were Deleted (Clearml Server V1.0.0)

Restarting the server ( docker-compose down then docker-compose up ) solved the problem ๐Ÿ˜Œ All experiments are back

3 years ago
0 Is It Possible To Run An Agent, Listen To The Services Queue Without Using Docker?

That gave me

Running in Docker mode (v19.03 and above) - using default docker image: nvidia/cuda running python3

Building Task 94jfk2479851047c18f1fa60c1364b871 inside docker: ubuntu:18.04

Starting docker build
docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]].
ERRO[0000] error waiting for container: context canceled

4 years ago
0 Hi Guys, Following Up On This

AgitatedDove14 This looks awesome! Unfortunately this would require a lot of changes in my current code, for that project I found a workaround ๐Ÿ™‚ But I will surely use it for the next pipelines I will build!

4 years ago
Show more results compactanswers