ReassuredTiger98

95 Questions, 639 Answers

Active since 10 January 2023

Last activity 8 months ago

Reputation

Badges 1

606 × Eureka!

Answers 639

0 Is There Some

Thanks! 🙂

3 years ago

0 Btw: There Seems To Be No Support For Videos In Tensorboard/Experiment View (E.G.

Yea, tensorboardX is using moviepy.

3 years ago

0 Hi Everyone, Quick Question: When Clearml-Agent Sets Up The Virtual Environment With Pip, Is Finding The Correct Cuda Version For Pytorch Something That Pip Or That Clearml Does?

I have to correct myself, I do not even have CUDA installed. Only the driver and everything CUDA-related is provided by the docker container. This works with a container that has CUDA 11.4, but now I have one with 11.6 (latest nvidia pytorch docker).

However, even after changing the clearml.conf and overriding with CUDA_VERSION, the clearml-agent prints on the docker container agent.cuda_version = 114 ! (Other changes to the clearml.conf on the agent are reflected in the docker, so only...

2 years ago

0 Hi Everyone, Quick Question: When Clearml-Agent Sets Up The Virtual Environment With Pip, Is Finding The Correct Cuda Version For Pytorch Something That Pip Or That Clearml Does?

Hi CostlyOstrich36 , thank you for answering so quick. I think that s not how it works because if this was true, one would have to always match local machine to servers. Afaik clearml finds the correct PyTorch Version, but I was not sure how (custom vs pip does it)

2 years ago

0 Hi Everyone, Quick Question: When Clearml-Agent Sets Up The Virtual Environment With Pip, Is Finding The Correct Cuda Version For Pytorch Something That Pip Or That Clearml Does?

I used the wrong docker container. The docker container I used had version 11.4. Interestingly, the override from clearml.conf and CUDA_VERSION Env variable did not work there.

With the correct docker container everything works fine. Shame on me.

2 years ago

0 Hi Everyone, Quick Question: When Clearml-Agent Sets Up The Virtual Environment With Pip, Is Finding The Correct Cuda Version For Pytorch Something That Pip Or That Clearml Does?

Nvm, I think its my mistake. I will investigate.

2 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

The agent and server have similar hardware also. So I would expect same read/write speed.

3 years ago

0 Any Idea Why I Would Be Getting The Following Error When Running A Task In A Clearml-Agent? (Python 3.7.9, Package_Manager.Type = Conda)

I will debug this myself a little more.

3 years ago

0 Hi Everyone, How Can I Check Programmatically Whether A Task Is Running Remotely And How Can I Get The Hostname? Additionally, Retrieving The User Name That Is Shown In The Server Ui Would Be Nice.

Perfect, works! I was looking for "host", didn't come to my mind to search for "worker". Any idea about getting the user that created the task?

one year ago

0 Hi Everyone, How Can I Check Programmatically Whether A Task Is Running Remotely And How Can I Get The Hostname? Additionally, Retrieving The User Name That Is Shown In The Server Ui Would Be Nice.

Or maybe even better: How can I get all the information of the "INFO" page in the WebUI of a task?

one year ago

0 Is There A Reason Why All Clearml.Task Methods Regarding Requirements (E.G. Pip Requirements) Are Class Methods? Are Requirements Not Stored In A Task?

I think doing all that work is not worth it right now, I am just trying to understand why I clearml seems not to be designed something like this:

` task_name = args.task_name

task = Task()
task = task.load_statedict(await Task.load_or_create(task_name))

task.requirements.add(...)
await task.synchronize()

task.execute_remotely(queue_name, exit=True) `

3 years ago

0 Hi Quick Question About Clearml-Data: It Seems Like It Is Only Using A Single Thread For Compression. Is There A Way To Use Multi-Threading? For Big Datasets It Takes Quite A While To Upload.

Also, is max_workers about compression threads or upload threads or both?

8 months ago

0 I Am Trying To Plot Values That Are Either 0 Or 1 (With Tensorboardx.Add_Scalar). However, It Doesn'T Show Correctly. Any Idea Why? (Smoothing Is 0)

I see. But I just realized: Subsampling means you just show every nth datapoint, right? I still do not get why this leads to some 0.5 values when in my plot there should only be 0 and 1.

3 years ago

0 I Finally Got The Cleanup_Service.Py To Run. However, Now I Get Errors When Trying To Load Scalars. This Is What I Found In The Logs

[2021-05-07 10:53:00,566] [9] [WARNING] [elasticsearch] POST ` [status:N/A request:60.061s]
Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/urllib3/connectionpool.py", line 445, in _make_request
six.raise_from(e, None)
File "<string>", line 3, in raise_from
File "/usr/local/lib/python3.6/site-packages/urllib3/connectionpool.py", line 440, in _make_request
httplib_response = conn.getresponse()
File "/usr/lib64/python3.6/http/client.py", lin...

3 years ago

0 Hello! Since Today I Get

Or there should be an early error for trying to run conda based tasks on pip agents

3 years ago

0 Currently, To Provide Ssh Access To The Docker Images For A Task,

Is there a way for me to configure/add the run arguments for the docker run call?

3 years ago

0 I Suddenly Get

No, it is only a single one.

3 years ago

0 Is It Possible To Ask An Agent To Use A Specified Existing Python Environment Instead Of Building One From Scratch?

I am pretty sure there is a flag in the clearml.conf where you can specify which python binary to use.

3 years ago

0 Hello! Since Today I Get

Perfect! 🙂

3 years ago

0 Hello! Since Today I Get

I tried "~=", "==" and "="

3 years ago

0 Is There A Reason Why All Clearml.Task Methods Regarding Requirements (E.G. Pip Requirements) Are Class Methods? Are Requirements Not Stored In A Task?

I think I still don't get how clearml is supposed to work/be used. Why wouldn't the following work currently?
Example:
` task = Task.init(...)
if not running_remotely:
task_dict = task.export_task()
requirements = task_dict["script"]["requirements"]["pip"].splitlines()
requirement_torch = [r for r in requirements if r.startswith("torch==")]
requirements.remove(requirement_torch[0])
requirements.append("torch >= 1.8.1")
task_dict["script"]["requirements"]["pip"] = "\n"....

3 years ago

0 I Finally Got The Cleanup_Service.Py To Run. However, Now I Get Errors When Trying To Load Scalars. This Is What I Found In The Logs

No idea what's happening there.

3 years ago

0 Any Idea Why Only A Single Instance Of Mujoco Can Be Run With Clearml-Agent? I Run 2 Clearm-Agents, One Per Gpu On My Workstation. However, The Second Task Failes With One Of The Following Errors:

So the environment variables are not set by the clearml-agent, but by clearml itself

3 years ago

0 Hi Everyone, Is It Possible To Not Create A Copy Of A Dataset When Adding To Clearml? My Data Is Already In A Directory On The Clearml-Server Machine And I Do Not Want To Copy It, Just Add It To Clearml As Dataset.

Maybe a related question: Anyone every worked with datasets larger than the clearml-agent cache? Some colleague of mine has a dataset of ~ 1 tera byte...

2 years ago

0 Hi Everyone, Quick Question Regarding Minio And Logging:

mytask.get_logger().current_logger().set_default_upload_destination(" s3://ip:9000/clearml ") this is what I do. Do you do the same?

2 years ago

0 Hello! Since Today I Get

I get 110 but it should be 111

3 years ago

0 Can Someone Point Me Whether/How The Services-Agent The Starts With The Clearml-Server Mounts The

It seems like the services-docker is always started with Ubuntu 18.04, even when I use
task.set_base_docker( "continuumio/miniconda:latest -v /opt/clearml/data/fileserver/:{}".format( file_server_mount ) )

3 years ago

0 Hi Everyone, How Can I Add A New Local Storage Location. Clearml Is In /Opt/Clearml And I Have Now Added A Second Hard Driver To My Server. How Can I Make Clearml Use The Additional Storage Space?

Hi SuccessfulKoala55
I meant that in the WebUI deletion should only be allowed for artifacts for which deletion actually works.
For example I now have a lot of lingering artifacts that exist on the fileservers, but not on the clearml-api-server (I think).
Another example: I delete a task via WebUI. ClearML-server tries to delete the task and the artifacts belonging to the task. However, it will show that the task has been successfully deleted but some artifacts have not. Now there is no way...

2 years ago

0 Hello! Since Today I Get

Just tried: also works with 0.17.2

3 years ago

0 Hi Everyone, I Have A Quick Question: When I Use

I have set default_output_uri to s3://my_minio_instance:9000/clearml

If I set files_server to s3://my_minio_instance:9000 /bucket_that_does_not_exist it fails at uploading metrics, but model upload still works:

WARNING - Failed uploading to s3://my_minio_instance:9000/ bucket_that_does_not_exist ('NoneType' object has no attribute 'upload')

clearml.Task - INFO - Completed model upload to s3://my_minio_instance:9000/clearml

What is ` default_out...

2 years ago

Show more results