ReassuredTiger98

95 Questions, 639 Answers

Active since 10 January 2023

Last activity 8 months ago

Reputation

Badges 1

606 × Eureka!

Answers 639

0 I Suddenly Get

It is server version 1.0 and everything that came with it.

3 years ago

0 I Suddenly Get

It could be that either the clearml-server has bad behaviour while clean up is ongoing or even after.

3 years ago

0 I Suddenly Get

However, deleting tasks gives me errors.

3 years ago

0 I Suddenly Get

Okay, it seems like it just takes some time to delete and to reflect in the WebUI. So when I try to delete again, actually a deletion process seems already to be running in the background.

3 years ago

0 I Suddenly Get

I created an github issue because the problem with the slow deletion still exists. https://github.com/allegroai/clearml/issues/586#issue-1142916619

2 years ago

0 I Suddenly Get

It also seems like the deletion operation will slow down the server substantially.

3 years ago

0 I Suddenly Get

Yea, the one script that is preinstalled.

3 years ago

0 I Suddenly Get

SuccessfulKoala55 So what happens is, that always when/after the cleanup_service runs, clearml will throw these kind of errors

3 years ago

0 I Suddenly Get

[root@dc01deffca35 elasticsearch]# curl `
{
"cluster_name" : "clearml",
"status" : "yellow",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 10,
"active_shards" : 10,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 10,
"delayed_unassigned_shards" : 0,
"number_of_pending_tasks" : 0,
"number_of_in_flight_fetch" : 0,
"task_max_waiting_in_queue_millis" : 0,
"active_shards_percent_as_nu...

3 years ago

0 I Suddenly Get

Here it is:

3 years ago

0 Hello! Since Today I Get

I will try again tomorrow. It s getting late! Thank you for helping so far!

3 years ago

0 Hello! Since Today I Get

No problem! I profit so much from clearml 🙂

3 years ago

0 Hi Everyone, How Can I Add A New Local Storage Location. Clearml Is In /Opt/Clearml And I Have Now Added A Second Hard Driver To My Server. How Can I Make Clearml Use The Additional Storage Space?

And how do I specify this in the output_uri ? The default file server is specified by passing True . How would I specify to use the second?

2 years ago

0 Hello! Since Today I Get

Okay. And 110 means 11.1 and not 11.0?

3 years ago

0 Hello! Since Today I Get

Yea, one second.

3 years ago

0 I Have A Self-Hosted Clearm-Server And And Clearml-Agent Started With

3 years ago

0 Hello! Since Today I Get

One more thing: The cuda_version that clearml finds automatically is wrong.

3 years ago

0 Hello! Since Today I Get

The ordering of the channels seems to matter!

3 years ago

0 Hello! Since Today I Get

I just tried to envrionment setup steps that clearml-agent is doing locally, but with my environment.yml instead of the one that clearml generates.

3 years ago

0 Hello! Since Today I Get

By host you mean the machine on which the agent is running? How does clearml-agent find the cuda_version?

3 years ago

0 Hello! Since Today I Get

I tried "~=", "==" and "="

3 years ago

0 Hi Guys, There Is A Bug Introduced With Clearml-Agent 1.5.0: The Resolution Of The Torch Version Is Broken: It Will Try To Find The Torch Version Matching The Cuda Version Of The System, As Opposed To Version 1.4.1, Where It Tries To Find The Cuda Version

AnxiousSeal95 Thanks a lot. Seems to be working fine for me. I see the clearml-agent version that pip installs in the docker is now fixed to the host version 🙂 PyTorch Nightly is also installed correctly now!

2 years ago

0 Hello! Since Today I Get

Do you know how I can get this version?

3 years ago

0 Is There A Reason

Well, I guess no hurdles vs. safety is inherently no solvable. I am all for hurdles, if it is clear how to overcome it. And in my opinion referring to clearml-init is something which makes sense from a developer and a user perspective.

3 years ago

0 Hello! Since Today I Get

I just started a task from this environment and it fails on the agent.

3 years ago

0 Hello Everyone, Where Does The Clearml-Services Agent Come From? Are Experiments Executed Directly On The Server? Or Did I Start It Somehow And Forgot It? If So How Do You Stop It/Restart It?

Alright, thanks again!

3 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

Yea, and the script ends with clearml.Task - INFO - Waiting to finish uploads

3 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

So my network seems to be fine. Downloading artifacts from the server to the agents is around 100 MB/s, while uploading from the agent to the server is slow.

3 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

An upload of 11GB took around 20 hours which cannot be right. Do you have any idea whether ClearML could have something to do with this slow upload speed? If not I am going to start debugging with the hardware/network.

3 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

But it is not related to network speed, rather to clearml. I simple file transfer test gives me approximately 1 GBit/s transfer rate between the server and the agent, which is to be expected from the 1Gbit/s network.

3 years ago

Show more results