ReassuredTiger98

95 Questions, 639 Answers

Active since 10 January 2023

Last activity 8 months ago

Reputation

Badges 1

606 × Eureka!

Questions 95
Answers 639

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

I Am Back With Another Question: Is There A File Similar To The

I am back with another question: Is there a file similar to the .clearml.conf that is project specific (for example in the top-level dir of the project-repos...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hello Everyone, I Have A Question About Ssh/Credentials: Let'S Say I Have Multiple Users / Multiple Ssh Credentials That I Do Not Want To Share With The Clearml-Agent Workstations. Is There A Way To Send Credentials To The Agent In The Task? So For Exampl

Hello everyone, I have a question about SSH/credentials: Let's say I have multiple users / multiple ssh credentials that I do not want to share with the clea...

mlops

3 years ago

0 Votes

31 Answers

20K Views

0 Votes 31 Answers 20K Views

I Have A Self-Hosted Clearm-Server And And Clearml-Agent Started With

I have a self-hosted clearm-server and and clearml-agent started with clearml-agent daemon --queue default --foreground I try to execute the following simple...

mlops

3 years ago

0 Votes

161 Answers

22K Views

0 Votes 161 Answers 22K Views

Hello! Since Today I Get

Hello! Since today I get AssertionError: Torch not compiled with CUDA enabled for PyTorch 1.8. Tasks that I submitted yesterday to the queue are also not wor...

clearml

3 years ago

0 Votes

22 Answers

1K Views

0 Votes 22 Answers 1K Views

Another Question: Is It Possible To Read The Dependencies Manually From A Conda Environment.Yml? It Seems Like Clearml Is Not Able To Fetch The Dependencies Correctly When

Another question: Is it possible to read the dependencies manually from a conda environment.yml? It seems like clearml is not able to fetch the dependencies ...

clearml

3 years ago

Show more results

0 I Am Back With Another Question: Is There A File Similar To The

Yep, I will add this as an issue. Btw: Should I rather post the kind of questions I am asking as an issue or do they fit better here?

3 years ago

0 I Am Back With Another Question: Is There A File Similar To The

Sounds good!

3 years ago

0 Quick Question: Does Overriding A Task With Task.Init() Do A Proper Delete Of Debug Samples At An Fileserver Or S3 Host?

Or does MinIO delay deletion somehow? Deleting a task via the web interface also does not result in deletion of debug samples on MinIO

one year ago

0 Hi Everyone, I Tried To Implement Ssl Support With Nginx And Everything Seems To Work So Far, But I Get "The Following Artifacts Could Not Be Deleted". How Can I Debug This? I Do Not See Any Error In The Logs. I Can Safe Artifacts And Retrieve Them (Howev

Thanks, I will look into it. For me the weird thing is that saving works and only deletion fails somehow.

2 years ago

0 Is There Some

Thanks! 🙂

3 years ago

0 Hello! Since Today I Get

So to further debug I need to somehow access /tmp/conda_envaz1ne897.yml

3 years ago

0 It Seems Like Clearml Agent Does Not Support Arparse Subparsers, Right?

That seems to be the case. After parsing the args I run task = Task.init(...) and then task.execute_remotely(queue_name=args.enqueue, clone=False, exit_process=True) .

3 years ago

0 Hello! Since Today I Get

But here is the funny thing:

channels:
- pytorch
- conda-forge
- defaults
dependencies:
- cudatoolkit=11.1.1
- pytorch=1.8.0

Installs GPU

3 years ago

0 Hello! Since Today I Get

Is ther a way to see the contents of /tmp/conda_envaz1ne897.yml ? Seems to be deleted after the task is finihsed

3 years ago

0 Hello! Since Today I Get

Sure, but I will try it tomorrow then.

3 years ago

0 Hello! Since Today I Get

And then?

3 years ago

0 Hello! Since Today I Get

Perfect, will try it. fyi: The conda_channels that I used are from clearml-agent init

3 years ago

0 Hello! Since Today I Get

conda 4.9.2

3 years ago

0 Can Someone Confirm That

Is this working in the latest version? clearml-agent falls back to /usr/bin/python3.8 no matter how I configure clearml.conf Just want to make sure, so I can investigate what's wrong with my machine if it is working for you.

3 years ago

0 Can Someone Confirm That

Thank you very much. I tested it on a different machine now and it works like intended. So there must be something misconfigured with this one machine.

3 years ago

0 I Have A Problem That Might Not Directly Be Clearml Related, But Maybe Someone Here Has An Idea: I Run A Clearml-Server On A Machine With 128Gb Ram, 32 Cores And 2 Gpus. On The Same Machine I Run 2 Clearml-Agent Each With Access To 1 Gpu, 12 Cores, An 48G

CostlyOstrich36 Actually no container exits, so I guess if it s because of OOM like SuccessfulKoala55 implies, than maybe a process inside the container gets killed and the container will hang? Is this possible?
SuccessfulKoala55 I did not observe elastic to use much RAM (at least right after starting). Doesn't this line in the docker-compose control the RAM usage?
ES_JAVA_OPTS: -Xms2g -Xmx2g -Dlog4j2.formatMsgNoLookups=true

2 years ago

0 Hello! Since Today I Get

For now I can tell you that with conda_freeze: true it fails, but with conda_freeze: false it works!

3 years ago

0 Hello! Since Today I Get

Do you know how I can make sure I do not have CUDA or a broken installation installed?

3 years ago

0 Hello! Since Today I Get

I mean the version which it bases the PyTorch installation on.

3 years ago

0 Hello! Since Today I Get

Also tried conda version 4.7.12. Same problem.

3 years ago

0 Hello! Since Today I Get

Type "help", "copyright", "credits" or "license" for more information.
>>> from clearml_agent.helper.gpu.gpustat import get_driver_cuda_version
>>> get_driver_cuda_version()
'110'

3 years ago

0 Hello! Since Today I Get

I do not have a global cuda install on this machine. Everything except for the driver is installed via conda.

3 years ago

0 Hello! Since Today I Get

And this works fine.

3 years ago

0 Hello! Since Today I Get

What do you mean?

3 years ago

0 Hello! Since Today I Get

I tried to run the task with detect_with_conda_freeze: false instead of true and got

Executing Conda: /home/tim/miniconda3/condabin/conda install -p /home/tim/.clearml/venvs-builds/3.8 -c defaults -c conda-forge -c pytorch 'pip<20.2' --quiet --json
Pass
Conda: Trying to install requirements:
['pytorch~=1.8.0']
Executing Conda: /home/tim/miniconda3/condabin/conda env update -p /home/tim/.clearml/venvs-builds/3.8 --file /tmp/conda_envh7rq4qmc.yml --quiet --json
Conda error: Unsati...

3 years ago

0 Hello! Since Today I Get

ca-certificates           2021.1.19            h06a4308_1  
certifi                   2020.12.5        py38h06a4308_0  
cudatoolkit               11.0.221             h6bb024c_0  
ld_impl_linux-64          2.33.1               h53a641e_7  
libedit                   3.1.20191231         h14c3975_1  
libffi                    3.3                  he6710b0_2  
libgcc-ng                 9.1.0                hdf63c60_0  
libstdcxx-ng              9.1.0                hdf63c60_0  
ncurses    ...

3 years ago

0 Hello! Since Today I Get

I get 110 but it should be 111

3 years ago

0 Hello Everyone! Is It Possible To Deactivate Package Analysis For Remote Execution? I Run My Code With Clearml-Agent In Docker Mode With Nvidia:Pytorch Container. When Clearml Is Running Inside The Docker The Installed Packages Of The Webui Get Updated. H

clearml will register preinstalled conda packages as requirements.

3 years ago

0 I Cannot Get Clearml-Agent With Docker Containers To Work. Clearml Uses

So it seems to be definitely a problem with docker and not with clearml. However, I do not get, why it works for you but on none of my machine (all Ubuntu 20.04 with docker 20.10)

3 years ago

0 I Cannot Get Clearml-Agent With Docker Containers To Work. Clearml Uses

In the beginning my config file was not empty 😕

3 years ago

Show more results