ReassuredTiger98

95 Questions, 639 Answers

Active since 10 January 2023

Last activity 8 months ago

Reputation

Badges 1

606 × Eureka!

Questions 95
Answers 639

0 Votes

17 Answers

1K Views

0 Votes 17 Answers 1K Views

I Am Trying Pytorch Nightly Again With Python 3.10. Works Fine Locally, But Fails On Clearml-Agent In Docker Mode.

I am trying PyTorch Nightly again with Python 3.10. Works fine locally, but fails on clearml-agent in docker mode. ERROR: Ignored the following versions that...

clearml

one year ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Hi Everyone, How Can I Add A New Local Storage Location. Clearml Is In /Opt/Clearml And I Have Now Added A Second Hard Driver To My Server. How Can I Make Clearml Use The Additional Storage Space?

Hi everyone, how can I add a new local storage location. clearml is in /opt/clearml and I have now added a second hard driver to my server. How can I make cl...

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Second Question: I Want To Run Clearml-Server Without Fileserver. However, It Seems Like The Webserver Has Some Dependency On It (I Get Some Nginx Error). Am I Assuming Correct, That It Is Currently Not Possible To Run Clearml Without Fileservers?

Second question: I want to run clearml-server without fileserver. However, it seems like the webserver has some dependency on it (I get some nginx error). Am...

clearml

2 years ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

Another Question: Is It Possible To Specify In Which Directory To Save All The Files That Clearml-Agent Creates (E.G. Cache Files Or Results Of The Currently Running Experiments)

Another question: Is it possible to specify in which directory to save all the files that clearml-agent creates (e.g. cache files or results of the currently...

clearml

3 years ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

I Am Back With Another Question: Is There A File Similar To The

I am back with another question: Is there a file similar to the .clearml.conf that is project specific (for example in the top-level dir of the project-repos...

clearml

3 years ago

0 Votes

16 Answers

1K Views

0 Votes 16 Answers 1K Views

Btw: There Seems To Be No Support For Videos In Tensorboard/Experiment View (E.G.

Btw: There seems to be no support for videos in tensorboard/experiment view (e.g. https://tensorboardx.readthedocs.io/en/latest/tensorboard.html#tensorboardX...

tensorboard

3 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Hello Everyone, Quick Question: How Can I Specify That I Want To Run My Task With

Hello everyone, quick question: How can I specify that I want to run my task with python -O flag remotely?

clearml

2 years ago

0 Votes

2 Answers

992 Views

0 Votes 2 Answers 992 Views

Anyone Else Experiencing That Tasks That Fail Are Not Shown As "Failed", But Instead Keep On Running And Hogging The Clearml-Agent?

Anyone else experiencing that tasks that fail are not shown as "failed", but instead keep on running and hogging the clearml-agent?

clearml

3 years ago

0 Votes

18 Answers

1K Views

0 Votes 18 Answers 1K Views

Is There A Reason Why All Clearml.Task Methods Regarding Requirements (E.G. Pip Requirements) Are Class Methods? Are Requirements Not Stored In A Task?

Is there a reason why all clearml.Task methods regarding requirements (e.g. pip requirements) are class methods? Are requirements not stored in a task?

clearml

3 years ago

0 Votes

16 Answers

1K Views

0 Votes 16 Answers 1K Views

Hello Everyone! Is It Possible To Deactivate Package Analysis For Remote Execution? I Run My Code With Clearml-Agent In Docker Mode With Nvidia:Pytorch Container. When Clearml Is Running Inside The Docker The Installed Packages Of The Webui Get Updated. H

Hello everyone! Is it possible to deactivate package analysis for remote execution? I run my code with clearml-agent in docker mode with nvidia:pytorch conta...

clearml

3 years ago

0 Votes

26 Answers

1K Views

0 Votes 26 Answers 1K Views

Another Quick Question About Fileservers And Clearml-Agent: Clearml-Agent Seems To Ignore The Output Destination Set In The Task Config

Another quick question about fileservers and clearml-agent: clearml-agent seems to ignore the output destination set in the task config sdk.development.defau...

clearml

one year ago

0 Votes

2 Answers

847 Views

0 Votes 2 Answers 847 Views

Hey Guys

Hey guys 🙂 I am wondering what is the best way to make a clearml-server deployment robust? E.g. let's say my server may become unavailable because of a hard...

clearml

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Is There Some

Is there some is_executed_remotely function in clearml?

clearml

3 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hey Everyone, When Uploading With

Hey everyone, when uploading with clearml-data it will print and save a lot of logs (multiple GB) because it prints the progress quite often. How can I make ...

dataset

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Quick Question: Is It Possible To See Who Aborted A Task?

Quick question: Is it possible to see who aborted a task?

clearml

2 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi Everyone, Is There Any Difference Between The Service Agent That Comes With The Server And The Normal Clearml-Agent In

Hi everyone, is there any difference between the service agent that comes with the server and the normal clearml-agent in --services-mode ?

mlops

2 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi Everyone, I Have A Question That Is Not Directly Related To Clearml: What Is The Best Way To Start Clearml-Server And Clearml-Agents At Startup Of A Ubuntu Machine? I Do Not Need An Explanation, Just Need To Know What I Need To Read About.

Hi everyone, I have a question that is not directly related to ClearML: What is the best way to start clearml-server and clearml-agents at startup of a Ubunt...

clearml

3 years ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hey Everyone, Is It Possible To Set The Fileservers In The Auto-Generated Credentials Of The Users? I Use Minio Fileservers And Want To Provide The Url With The Credentials Automatically.

Hey everyone, is it possible to set the fileservers in the auto-generated credentials of the users? I use MinIO fileservers and want to provide the url with ...

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Is Is Intended That Restarting The Server Somestimes Leads To Abortion Of Experiments On Clearml-Agents

Is is intended that restarting the server somestimes leads to abortion of experiments on clearml-agents User aborted: stopping task (3) ?

clearml

2 years ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

Hi Everyone, I Tried To Implement Ssl Support With Nginx And Everything Seems To Work So Far, But I Get "The Following Artifacts Could Not Be Deleted". How Can I Debug This? I Do Not See Any Error In The Logs. I Can Safe Artifacts And Retrieve Them (Howev

Hi everyone, I tried to implement SSL support with NGINX and everything seems to work so far, but I get "The following artifacts could not be deleted". How c...

clearml

2 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hello Everyone, What Is The Behavior Of Clearml-Agent When Run In Docker Mode And The Container Crashes? To Me It Seems Like The Clearml-Agent Will Stop. In My Case I Memory-Limit The Container And It Seems To Crash, When Too Much Memory Is Used. However,

Hello everyone, what is the behavior of clearml-agent when run in docker mode and the container crashes? To me it seems like the clearml-agent will stop. In ...

clearml

3 years ago

0 Votes

13 Answers

1K Views

0 Votes 13 Answers 1K Views

Hello Everyone, I Have A Question About Clearml-Agent + Ssh. I Have Valid Ssh Credentials In

Hello everyone, I have a question about clearml-agent + ssh. I have valid ssh credentials in ~/.ssh but no .ssh/known_hosts yet. My clearml-agent fails to ex...

clearml

3 years ago

0 Votes

1 Answers

945 Views

0 Votes 1 Answers 945 Views

Btw: The Loading Indicator For The Webui Is Not Displayed When You Switch Between Tasks Quickly And Are In The Scalars Section. For Example I Press On The First Task And The Loading Icon Appears And Now I Press On Another Task Quickly, The Loading Icon Wi

Btw: The loading indicator for the WebUI is not displayed when you switch between tasks quickly and are in the scalars section. For example I press on the fi...

clearml

3 years ago

0 Votes

22 Answers

1K Views

0 Votes 22 Answers 1K Views

Another Question: Is It Possible To Read The Dependencies Manually From A Conda Environment.Yml? It Seems Like Clearml Is Not Able To Fetch The Dependencies Correctly When

Another question: Is it possible to read the dependencies manually from a conda environment.yml? It seems like clearml is not able to fetch the dependencies ...

clearml

3 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

It Seems Like Clearml Agent Does Not Support Arparse Subparsers, Right?

It seems like ClearML Agent does not support arparse subparsers, right? https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_subparser...

mlops

3 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

I Cannot Get Clearml-Agent With Docker Containers To Work. Clearml Uses

I cannot get clearml-agent with docker containers to work. clearml uses docker run -v /home/.ssh:/root/.ssh to mount the ssh profile into the container. Howe...

clearml

3 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi, Is It Intented Behavior That Models That Are Saved By A Clearml-Agent Will Have The Clearml-Agents User (So The User Of Which Generated The Api Credentials For The Agent) In The "User" Field Of The Model Instead Of The User Who Started The Task?

Hi, is it intented behavior that models that are saved by a clearml-agent will have the clearml-agents user (so the user of which generated the API credentia...

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Also Is There A Way To Just Run A Dockerfile? So For Example My Project Includes A

Also is there a way to just run a Dockerfile? So for example my project includes a Dockerfile in the top-level dir. When sending to the agent I would like it...

mlops

3 years ago

0 Votes

15 Answers

1K Views

0 Votes 15 Answers 1K Views

I Have A Questions About Queue Priorities With Clearml-Agent. I Have Two Queues,

I have a questions about queue priorities with clearml-agent. I have two queues, A and B . Some of my agents support queue A and B with higher priority for B...

clearml

3 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Hi Everyone, Quick Question Regarding Minio And Logging:

Hi everyone, quick question regarding MinIO and logging: task = Task.init( project_name=project_name, task_name=task_name, tags=tags, output_uri=" ", ) torch...

clearml

2 years ago

Show more results

0 What Is The Current State Of Deleting Debug Samples? I Use S3/Minio As My Fileserver. If I Delete Tasks From The Ui, Are Debug Samples Deleted On S3? If I Run The Cleanup Service Script, Does It Debug Samples On S3?

@<1523701994743664640:profile|AppetizingMouse58> Thank you very much. I forgot the volume mapping.
So can I just add the config to the async_delete container and mirror the directory structure from github?

volumes:
    - /opt/clearml/config:/opt/clearml/config
    - /opt/clearml/logs:/var/log/clearml

one year ago

0 Another Question: Is It Possible To Read The Dependencies Manually From A Conda Environment.Yml? It Seems Like Clearml Is Not Able To Fetch The Dependencies Correctly When

The package is just subdir by the way. So it should not be in installed packages anyways, right?

3 years ago

0 Hi Again

Thank you! I agree with CostlyOstrich36 that is why I meant false sense of security 🙂
Thank you SuccessfulKoala55 so actually only the file-server needs to be secured.

2 years ago

0 Hi Everyone, I Saw A New Entry In Docker-Compose:

Perfect, just what I always wanted. Looking forward to the MinIo version. Thank you:)

2 years ago

0 Hello! Since Today I Get

==> 2021-03-11 13:54:59 <==
# cmd: /home/tim/miniconda3/condabin/conda create --yes --mkdir --prefix /home/tim/.clearml/venvs-builds/3.8 python=3.8
# conda version: 4.9.2
+defaults/linux-64::_libgcc_mutex-0.1-main
+defaults/linux-64::ca-certificates-2021.1.19-h06a4308_1
+defaults/linux-64::certifi-2020.12.5-py38h06a4308_0
+defaults/linux-64::ld_impl_linux-64-2.33.1-h53a641e_7
+defaults/linux-64::libedit-3.1.20191231-h14c3975_1
+defaults/linux-64::libffi-3.3-he6710b0_2
+defaults/linux-64...

3 years ago

0 Hi Everyone, I Tried To Implement Ssl Support With Nginx And Everything Seems To Work So Far, But I Get "The Following Artifacts Could Not Be Deleted". How Can I Debug This? I Do Not See Any Error In The Logs. I Can Safe Artifacts And Retrieve Them (Howev

Thanks, I will look into it. For me the weird thing is that saving works and only deletion fails somehow.

2 years ago

0 Hello!

Yes, that works fine. Just the http vs https was the problem. The UI will automatically change s3://<minio-address>:<port> to http://<minio-address>:<port> in http://myclearmlserver.org/settings/webapp-configuration . However what is needed for me is https://<minio-address>:<port>

2 years ago

0 Btw: There Seems To Be No Support For Videos In Tensorboard/Experiment View (E.G.

Based on https://github.com/lanpa/tensorboardX/blob/34d1616c035faaa0f3f7c9d19cb8bb4425f19939/tensorboardX/summary.py#L355 I would guess that it is already encoded before added to the tensorboard summary.

3 years ago

And the files that I see on github are the default configuration of the server, even if I do not have these files in my installation, right?

one year ago

0 Hi, I Want To Set Up A Clearml Server For Experiment Management And Later Add Clearml-Agents. Can I Link Gpu Nodes To The Clearml Server Later On? What Are The Recommended Server Specs For Hosting The Clearml Server?

You can add and remove clearml-agents to/from the clearml-server anytime.

one year ago

0 Hi Everyone, I Am Having Problems Getting Pytorch Nightly (For Torch 2.0 Preview) To Run On Clearml-Agent. Here Is My Log. Maybe Someone Sees What The Issue Is. I Dont Get It. It Runs All Fine Locally!

Maybe if you have time you can take a look at the log I posted in the beginning. I think I have the same extra_index_url and the nightly flag activated 😕

2 years ago

0 Can Someone Confirm That

Thank you very much. I tested it on a different machine now and it works like intended. So there must be something misconfigured with this one machine.

3 years ago

0 Btw: There Seems To Be No Support For Videos In Tensorboard/Experiment View (E.G.

I guess this is the current way to do it: https://github.com/tensorflow/tensorboard/issues/39#issuecomment-568917607 so I would say: Yes, it supports gif.

3 years ago

0 Can Someone Confirm That

Is this working in the latest version? clearml-agent falls back to /usr/bin/python3.8 no matter how I configure clearml.conf Just want to make sure, so I can investigate what's wrong with my machine if it is working for you.

3 years ago

0 Fyi: Conda Installation Of Pytorch Is Broken Again. My Old Tasks Which Worked Before Now Fail Since They Do Not Find Torch. However, I Can See In The Execution That Conda Had Errors. Most Probably It Happens Because Pytorch 1.8.1 Has Been Released, But I

Installed packages:

` # Python 3.7.10 (default, Feb 26 2021, 18:47:35) [GCC 7.3.0]

absl-py==0.12.0
aiostream==0.4.2
attrs==20.3.0
cached-property==1.5.2
cffi==1.14.5
chardet==4.0.0
clearml==0.17.5
cython==0.29.22
dm-control==0.0.364896371
dm-env==1.4
dm-tree==0.1.5
fasteners==0.16
furl==2.1.0
future==0.18.2
glfw==2.1.0
gym==0.18.0
h5py==3.2.1
humanfriendly==9.1
idna==2.10
imageio-ffmpeg==0.4.3
importlib-metadata==3.7.3
jsonschema==3.2.0
labmaze==1.0.4
lxml==4.6.3
moviepy==1.0.3
mujoco-py==...

3 years ago

0 Hi Again

Hey 🙂
No, nothing specific. It is just that users can get a sense of false security.

2 years ago

0 Hey Everyone, I Have Another Question: Is It Possible To Change Agent Config For Each Task? E.G.

Maybe this opens up another question, which is more about how clearml-agent is supposed to be used. The "pure" way would be to make the docker image provide everything and clearml-agent should do not setup at all.
What I currently do instead is letting the docker image provide all system dependencies and let clearml-agent setup all the python dependencies. This allows me to reuse a docker image for more different experiments. However, then it would make sense to have as many configs as possib...

2 years ago

0 Hi Everyone, Quick Question: When Clearml-Agent Sets Up The Virtual Environment With Pip, Is Finding The Correct Cuda Version For Pytorch Something That Pip Or That Clearml Does?

Hi CostlyOstrich36 , thank you for answering so quick. I think that s not how it works because if this was true, one would have to always match local machine to servers. Afaik clearml finds the correct PyTorch Version, but I was not sure how (custom vs pip does it)

2 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

An upload of 11GB took around 20 hours which cannot be right. Do you have any idea whether ClearML could have something to do with this slow upload speed? If not I am going to start debugging with the hardware/network.

3 years ago

0 Another Question: Is It Possible To Read The Dependencies Manually From A Conda Environment.Yml? It Seems Like Clearml Is Not Able To Fetch The Dependencies Correctly When

Thanks for your help again. I will just use detect_with_conda_freeze: true . Seems like a perfect solution for me!

3 years ago

0 Quick Question About The

I see! Thanks!

3 years ago

0 It Seems Like Clearml Agent Does Not Support Arparse Subparsers, Right?

Ah, it actually is also a string with remote_execution, but still not what it should be.

3 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

Seems more like a bug or something is not properly configured on my side.

3 years ago

0 Is There Some

Thanks! 🙂

3 years ago

0 Is There A Reason Why All Clearml.Task Methods Regarding Requirements (E.G. Pip Requirements) Are Class Methods? Are Requirements Not Stored In A Task?

I think doing all that work is not worth it right now, I am just trying to understand why I clearml seems not to be designed something like this:

` task_name = args.task_name

task = Task()
task = task.load_statedict(await Task.load_or_create(task_name))

task.requirements.add(...)
await task.synchronize()

task.execute_remotely(queue_name, exit=True) `

3 years ago

0 Btw: There Seems To Be No Support For Videos In Tensorboard/Experiment View (E.G.

You mean I should have opencv/ffmpeg available on the clearml-server machine?

3 years ago

0 Is There A Reason Why All Clearml.Task Methods Regarding Requirements (E.G. Pip Requirements) Are Class Methods? Are Requirements Not Stored In A Task?

Both, actually. So what I personally would find intuitive is something like this:
` class Task:
def load_statedict(self, state_dict):
pass

async def synchronize(self):
    ...

async def task_execute_remotely(self):
    await self.synchronize()
    ...

def add_requirement(self, requirement):
    ...

@classmethod
async def init(task_name):
    task = Task()
    task.load_statedict(await Task.load_or_create(task_name))
    await tas...

3 years ago

0 I Have A Problem That Might Not Directly Be Clearml Related, But Maybe Someone Here Has An Idea: I Run A Clearml-Server On A Machine With 128Gb Ram, 32 Cores And 2 Gpus. On The Same Machine I Run 2 Clearml-Agent Each With Access To 1 Gpu, 12 Cores, An 48G

CostlyOstrich36 Actually no container exits, so I guess if it s because of OOM like SuccessfulKoala55 implies, than maybe a process inside the container gets killed and the container will hang? Is this possible?
SuccessfulKoala55 I did not observe elastic to use much RAM (at least right after starting). Doesn't this line in the docker-compose control the RAM usage?
ES_JAVA_OPTS: -Xms2g -Xmx2g -Dlog4j2.formatMsgNoLookups=true

2 years ago

SuccessfulKoala55 I just had the issue again. The logs show nothing of interest. It looks like OOM to me, but I will test this again with way larger SWAP, so the server only slows down, but does not kill something. Unfortunately, kernel logs also do not show much (maybe I have my server logs misconfigured, I am no expert).
What is interesting though is that docker only showed my nginx, minio and docker-registry to have exited, while all the clearml containers were still running. I restarted ...

2 years ago

Could be clean log after restart. Unfortunately, I restarted the server right away 😞 I gonna post if it happens again with the appropriate logs.

2 years ago

Show more results