Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
JitteryCoyote63
Moderator
214 Questions, 1021 Answers
  Active since 10 January 2023
  Last activity 7 months ago

Reputation

0

Badges 1

979 × Eureka!
0 Votes
2 Answers
972 Views
0 Votes 2 Answers 972 Views
Hi, how can I search an old experiment based on its commit hash?
one year ago
0 Votes
3 Answers
959 Views
0 Votes 3 Answers 959 Views
Hi, I have several long running experiments failing with Process failed, exit code -9 and no other error with clearml 1.0.4 and clearml-agent 1.0.0, what cou...
3 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
Hey, I have one question regarding the cleanup_service task in the DevOps project: Does it assume that the agent in services mode is in the trains-server mac...
4 years ago
0 Votes
1 Answers
606 Views
0 Votes 1 Answers 606 Views
Quick question: Why does clearml-server 1.15.0 api-server python package require ES 8.12.0 but the docker-compose references ES 7.17.18?
8 months ago
0 Votes
0 Answers
1K Views
0 Votes 0 Answers 1K Views
Hi, I encountered a bug on clearml-server 1.0.1: I tried to add in a project page a custom column in +HYPER PARAMETERS > Args > queue and got an error pop up...
3 years ago
0 Votes
5 Answers
916 Views
0 Votes 5 Answers 916 Views
Hey again 😁 I am migrating my trains-server to AWS and I would like now to have secure accounts (with password). But I don't want to loose the current users...
4 years ago
0 Votes
1 Answers
965 Views
0 Votes 1 Answers 965 Views
Hi there, is it safe to use ClearML (trains >= 0.17) with the trains ignite handler? Should we wait for the update on their side?
3 years ago
0 Votes
12 Answers
926 Views
0 Votes 12 Answers 926 Views
Hey, would it possible to add an option to make task.upload_artifact() blocking? (Not running in background)
4 years ago
0 Votes
2 Answers
923 Views
0 Votes 2 Answers 923 Views
Hi, is it possible to get an artifact from a Task and force not using local cache? The task itself updated the artifact in the meantime and I cannot get the ...
3 years ago
0 Votes
2 Answers
926 Views
0 Votes 2 Answers 926 Views
3 years ago
0 Votes
3 Answers
961 Views
0 Votes 3 Answers 961 Views
Hi ClearML team members! Is there any progress made on the clearml-serving repo? I’d love to start using it but I lack a straightforward get started example....
3 years ago
0 Votes
5 Answers
979 Views
0 Votes 5 Answers 979 Views
Hi there, I would like to report a bug with the resizing of the columns in the projects view: it doesn’t work as expected. Please look at the behavior of the...
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Hi, I would like to report something else weird in the clearml-agent 1.5.1 running in docker mode: In the logs, when it dumps its config, it writes: docker_c...
one year ago
0 Votes
15 Answers
1K Views
0 Votes 15 Answers 1K Views
Hi, I restarted my clearml-server (1.1.0) and the login page always redirects me to the login page. I am using fixed users in config files. In the logs of th...
3 years ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
Hi, I have a question regarding the aws-autoscaler: am I understanding correctly that: max_idle_time_min=5 max_spin_up_time_min=10 polling_interval_time_min=...
3 years ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
3 years ago
0 Votes
4 Answers
950 Views
0 Votes 4 Answers 950 Views
Hey, I would like my experiment to call at some point a CLI program installed as a dependency of the experiment. Here is what I do: myTask = Task.init(...) i...
4 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
Hi, I am trying to update the aws_autoscaler to the latest version on the master branch. I simply changed the commit id in the experiment and run it, this ga...
3 years ago
0 Votes
5 Answers
941 Views
0 Votes 5 Answers 941 Views
Hi guys, I would like to start using the AWS autoscaler shipped in trains. I need to create a IAM user to get and I would like to know what are the minimal p...
4 years ago
0 Votes
30 Answers
978 Views
0 Votes 30 Answers 978 Views
Could you please explain a bit more how trains adapt the torch version depending on the installed cuda version? Here is my setup: cuda 102 installed and corr...
4 years ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
2 years ago
0 Votes
1 Answers
976 Views
0 Votes 1 Answers 976 Views
Hi there, I moved my ClearML server from US to EU and now I am trying to setup the AWS autoscaler with the different architecture that I have now. So far I u...
3 years ago
0 Votes
30 Answers
1K Views
0 Votes 30 Answers 1K Views
Hi, is it possible to pass environment variables to agents created by the AWS AutoScaler service?
3 years ago
0 Votes
3 Answers
977 Views
0 Votes 3 Answers 977 Views
hi guys, is it possible to spin up two agents on one GPU? Something like trains-agent daemon --gpus 0 --queue default & trains-agent daemon --gpus 0 --queue ...
3 years ago
0 Votes
1 Answers
966 Views
0 Votes 1 Answers 966 Views
Hi, I have a clearml-agent (1.1.2) in a g4dn.4xlarge AWS instance (with one T4 GPU), that reports agent.cuda_version = 0 agent.cudnn_version = 0and does not ...
2 years ago
0 Votes
14 Answers
1K Views
0 Votes 14 Answers 1K Views
3 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
4 years ago
0 Votes
1 Answers
900 Views
0 Votes 1 Answers 900 Views
Hey there 🙂 Would in the WebUI, on an experiment CONFIGURATION tab, for a specific parameter, would it be possible not show its value as a single string whe...
2 years ago
0 Votes
28 Answers
1K Views
0 Votes 28 Answers 1K Views
Hi, I am trying to use omegaconf with task.connect_configuration and I get the following error: >>> OmegaConf.create(task.connect_configuration(config_dict))...
2 years ago
0 Votes
10 Answers
1K Views
0 Votes 10 Answers 1K Views
Hi, on clearml-server 1.5.0, in scalar graphs, the new default value is “Show closest data on hover”. Would it be possible to make it automatically set to “C...
2 years ago
Show more results questions
0 Hey There, Does Trains Support

No worries! I asked more to be informed, I don't have a real use-case behind. This means that you guys internally catch the argparser object somehow right? Because you could also simply use sys argv to find the parameters, right?

4 years ago
0 Hi, I Have An Agent That Is Running Two Experiments At The Same Time: One That Was Running For A Long Time (11H) And One That The Agent Picked Up Afterwards, While The First One Was Still Running. Context: I Have 3 Agents Up (Not In Docker Mode) And All O

Some more context: the second experiment finished and now, in the UI, in workers&queues tab, I see randomly
trains-agent-1 | - | - | - | ... (refresh page) trains-agent-1 | long-experiment | 12h | 72000 |

4 years ago
0 Hey There, I Would Like To Increase The

by replacing the pid with $PID ?

3 years ago
0 Hey There, I Would Like To Increase The

it actually looks like I don’t need such a high number of files opened at the same time

3 years ago
0 Hey There, I Would Like To Increase The

because at some point it introduces too much overhead I guess

3 years ago
0 Hey There, I Would Like To Increase The

now how to adapt to do it from extra_vm_bash_script ?

3 years ago
0 Hey There, I Would Like To Increase The

that works from within the ssh session

3 years ago
0 Hey There, I Would Like To Increase The

mmmh it fails, but if I connect to the instance and execute ulimit -n , I do see
65535while the tasks I send to this agent fail with:
OSError: [Errno 24] Too many open files: '/root/.commons/images/aserfgh.png'and from the task itself, I run:
import subprocess print(subprocess.check_output("ulimit -n", shell=True))Which gives me in the logs of the task:
b'1024'So nnofiles is still 1024, the default value, but not when I ssh, damn. Maybe rebooting would work

3 years ago
0 Hey There, I Would Like To Increase The

I will try adding
sudo sh -c "echo '\n* soft nofile 65535\n* hard nofile 65535' >> /etc/security/limits.conf"to the extra_vm_bash_script , maybe that’s enough actually

3 years ago
0 Hey There, I Would Like To Increase The

So actually I don’t need to play with this limit, I am OK with the default for now

3 years ago
0 Hi, What Happens Exactly When I Execute The Following Command:

Thanks AgitatedDove14 !
What would be the exact content of NVIDIA_VISIBLE_DEVICES if I run the following command?
trains-agent daemon --gpus 0,1 --queue default &

4 years ago
0 Hi, Although

Add carriage return flush support using the sdk.development.worker.console_cr_flush_period configuration setting (GitHub trains Issue 181)

3 years ago
0 Hi, How Can I Search An Old Experiment Based On Its Commit Hash?

I checked the commit date anch and went to all experiments, and scrolled until finding the experiment

one year ago
0 Hi, Another Bug To Report With The Aws_Auto_Scaler Using 1.1.2:

Nevermind, i was able to make it work, but no idea how

3 years ago
0 Hi, Another Bug To Report With The Aws_Auto_Scaler Using 1.1.2:

with 1.1.1 I get
User aborted: stopping task (3)

3 years ago
0 Hi, I Have An Agent That Is Running Two Experiments At The Same Time: One That Was Running For A Long Time (11H) And One That The Agent Picked Up Afterwards, While The First One Was Still Running. Context: I Have 3 Agents Up (Not In Docker Mode) And All O

no, one worker (trains-agent-1) "forget from time to time" the current experiment he is running and picks another experiment on top of the one he is currently running

4 years ago
0 Hi, I Am Giving Another Try To Clearml-Session And I Am Blocked At The Current Error Shown When The Cli Try To Establish The Tunneling:

Does the agent install the nvidia-container toolkit, so that GPUs of the instance can be accessed from inside the docker running jupyterlab?

2 years ago
4 years ago
0 Hi, I Attached An Iam Role To An Ec2 Instance To Grant Access To An S3 Bucket. The Ec2 Instance Is Running A Clearml-Agent (V1.1.0). I Didn’T Specify Any Key/Secret For Clearml. The Tasks Fail With The Following Error:

There is no need to add creds on the machine, since the EC2 instance has an attached IAM profile that grants access to s3. Boto3 is able retrieve the files from the s3 bucket

3 years ago
0 Hi, I Would Like To Follow-Up In This

Hi AppetizingMouse58 , I sent you the files in PM 🙂

2 years ago
Show more results compactanswers