ReassuredTiger98

97 Questions, 644 Answers

Active since 10 January 2023

Last activity 7 months ago

Reputation

Badges 1

611 × Eureka!

Questions 97
Answers 644

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Another Question: How Can I Set The Fileserver Ip? E.G. I Want The Artifact Locations To Point To My Proxy Server, So That I Can Support Ssl

Another question: How can I set the fileserver IP? E.g. I want the artifact locations to point to my proxy server, so that I can support SSL

clearml

3 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

It Seems Like Clearml Agent Does Not Support Arparse Subparsers, Right?

It seems like ClearML Agent does not support arparse subparsers, right? https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_subparser...

mlops

4 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi Everyone, Is There A Way To Either Aggregate Scalars In The Web Ui Or To Read Scalars From Existing Tasks? My Use Case Is That I Do Multiple Runs Of The Same Task And Want To Plot The Mean Metrics. However, Since A Single Run Takes Quite Long, I Do Wan

Hi everyone, is there a way to either aggregate scalars in the web UI or to read scalars from existing tasks? My use case is that I do multiple runs of the s...

clearml

2 years ago

0 Votes

31 Answers

128K Views

0 Votes 31 Answers 128K Views

I Have A Self-Hosted Clearm-Server And And Clearml-Agent Started With

I have a self-hosted clearm-server and and clearml-agent started with clearml-agent daemon --queue default --foreground I try to execute the following simple...

mlops

4 years ago

0 Votes

26 Answers

3K Views

0 Votes 26 Answers 3K Views

Another Quick Question About Fileservers And Clearml-Agent: Clearml-Agent Seems To Ignore The Output Destination Set In The Task Config

Another quick question about fileservers and clearml-agent: clearml-agent seems to ignore the output destination set in the task config sdk.development.defau...

clearml

2 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hello Everyone, I Have A Question About Ssh/Credentials: Let'S Say I Have Multiple Users / Multiple Ssh Credentials That I Do Not Want To Share With The Clearml-Agent Workstations. Is There A Way To Send Credentials To The Agent In The Task? So For Exampl

Hello everyone, I have a question about SSH/credentials: Let's say I have multiple users / multiple ssh credentials that I do not want to share with the clea...

mlops

4 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

What Does

What does agent.package_manager.type actually do? I am wondering what happens, if my project uses a package that is only available either on pip or with cond...

clearml

4 years ago

Show more results

0 I Cannot Get Clearml-Agent With Docker Containers To Work. Clearml Uses

So it seems to be definitely a problem with docker and not with clearml. However, I do not get, why it works for you but on none of my machine (all Ubuntu 20.04 with docker 20.10)

4 years ago

0 I Have A Questions About Queue Priorities With Clearml-Agent. I Have Two Queues,

No. Here is a better example. I have two types of workstations: Type X can execute tasks of type A and B. Type Y can execute tasks of type B. This could be the case if type X workstations have for example more VRAM, newer drivers, etc...
I have two queues. Queue A and Queue B. I submit tasks of type A to queue A and tasks of type B to queue B.

Here is what can happen:
Enqueue the first task of type B. Workstations of type X will run this task. Enqueue the second task of type A. Workstation ...

4 years ago

0 I Finally Got The Cleanup_Service.Py To Run. However, Now I Get Errors When Trying To Load Scalars. This Is What I Found In The Logs

[2021-05-07 10:53:00,566] [9] [WARNING] [elasticsearch] POST ` [status:N/A request:60.061s]
Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/urllib3/connectionpool.py", line 445, in _make_request
six.raise_from(e, None)
File "<string>", line 3, in raise_from
File "/usr/local/lib/python3.6/site-packages/urllib3/connectionpool.py", line 440, in _make_request
httplib_response = conn.getresponse()
File "/usr/lib64/python3.6/http/client.py", lin...

4 years ago

0 I Finally Got The Cleanup_Service.Py To Run. However, Now I Get Errors When Trying To Load Scalars. This Is What I Found In The Logs

So I just tried again, but with manual deleting via Web UI.

4 years ago

0 Hi Everyone, How Can I Add A New Local Storage Location. Clearml Is In /Opt/Clearml And I Have Now Added A Second Hard Driver To My Server. How Can I Make Clearml Use The Additional Storage Space?

Thank you very much 🙂

3 years ago

0 Hi Everyone, I Am Just Wondering Whether The Bugs Regarding The Deletion Of Tasks Is Fixed In The Current Version? E.G. This Happening When You Want To Delete A Lot Of Tasks.

Thanks a lot. Will try it!

2 years ago

0 I Have A Self-Hosted Clearm-Server And And Clearml-Agent Started With

What do you mean by prints?

4 years ago

0 Hello Clearml Community, Does Anyone Have An Idea How I Could Integrate/Manager Carla (

I am wondering where to put my experiment logic, so that it gets lazily executed and not at task definition time (i.e. in get_task_experiment() how to get my experiment logic in there without running it)

3 years ago

0 Quick Question: Does Overriding A Task With Task.Init() Do A Proper Delete Of Debug Samples At An Fileserver Or S3 Host?

Alright, that s unfortunate. But thank you very much!

2 years ago

0 It Seems Like Clearml Agent Does Not Support Arparse Subparsers, Right?

With remote_execution it is command="[...]" , but on local it is command='train' like it is supposed to be.

4 years ago

0 Hello! Since Today I Get

But here is the funny thing:

channels:
- pytorch
- conda-forge
- defaults
dependencies:
- cudatoolkit=11.1.1
- pytorch=1.8.0

Installs GPU

4 years ago

0 I Have A Problem That Might Not Directly Be Clearml Related, But Maybe Someone Here Has An Idea: I Run A Clearml-Server On A Machine With 128Gb Ram, 32 Cores And 2 Gpus. On The Same Machine I Run 2 Clearml-Agent Each With Access To 1 Gpu, 12 Cores, An 48G

SuccessfulKoala55 I just had the issue again. The logs show nothing of interest. It looks like OOM to me, but I will test this again with way larger SWAP, so the server only slows down, but does not kill something. Unfortunately, kernel logs also do not show much (maybe I have my server logs misconfigured, I am no expert).
What is interesting though is that docker only showed my nginx, minio and docker-registry to have exited, while all the clearml containers were still running. I restarted ...

3 years ago

0 Hi Everyone, How Can I Add A New Local Storage Location. Clearml Is In /Opt/Clearml And I Have Now Added A Second Hard Driver To My Server. How Can I Make Clearml Use The Additional Storage Space?

Any idea why deletion of artifacts on my second fileserver does not work?

fileserver_datasets: networks: - backend - frontend command: - fileserver container_name: clearml-fileserver-datasets image: allegroai/clearml:latest restart: unless-stopped volumes: - /opt/clearml/logs:/var/log/clearml - /opt/clearml/data/fileserver-datasets:/mnt/fileserver - /opt/clearml/config:/opt/clearml/config ports: - "8082:8081"
ClearML successfu...

3 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

I guess this is from clearml-server and seems to be bottlenecking artifact transfer speed.

4 years ago

0 Hi Everyone, I Tried To Implement Ssl Support With Nginx And Everything Seems To Work So Far, But I Get "The Following Artifacts Could Not Be Deleted". How Can I Debug This? I Do Not See Any Error In The Logs. I Can Safe Artifacts And Retrieve Them (Howev

Mhhm, good hint! Unfortunetly I can see nowhere in logs when the server creates a delete request

3 years ago

0 I Found The Following Config Parameter (Related To Clearml-Data I Guess?):

Okay, thanks for explaining!

4 years ago

Yes

3 years ago

0 Hello! Since Today I Get

Like this?

4 years ago

0 Another Question: How Can I Make Clearml-Agent Use Pre-Installed Version From The Nvidia/Pytorch (

Yea, but doesn't this feature make sense on a task level? If I remember correctly, some dependencies will sometimes require different pip versions. And dependencies are on task basis.

3 years ago

0 Hello Clearml Community, Does Anyone Have An Idea How I Could Integrate/Manager Carla (

Can you tell me how I create tasks correctly? The PipelineController.add_step takes the task-id/task-name, but I would rather just define a function that returns the task directly, since the base-task may not be already on the clearml-server.

3 years ago

0 Also Is There A Way To Just Run A Dockerfile? So For Example My Project Includes A

Nvm. I forgot to start my agent with --docker . So here comes my follow up question: It seems like there is no way to define that a Task requires docker support from an agent, right?

4 years ago

0 Hi Everyone, How Can I Add A New Local Storage Location. Clearml Is In /Opt/Clearml And I Have Now Added A Second Hard Driver To My Server. How Can I Make Clearml Use The Additional Storage Space?

And how to specify this fileserver as output_uri ?

3 years ago

0 Hi Everyone, Is It Possible To Show The Upload Progress Of Artificats? E.G. I Use

` # Connecting ClearML with the current process,

from here on everything is logged automatically

task = Task.init(project_name="examples", task_name="artifacts example")
task.set_base_docker(
"my_docker",
docker_arguments="--memory=60g --shm-size=60g -e NVIDIA_DRIVER_CAPABILITIES=all",
)

if not running_remotely():
task.execute_remotely("docker", clone=False, exit_process=True)

timer = Timer()
with timer:
# add and upload Numpy Object (stored as .npz file)
task.upload_a...

4 years ago

0 Did Someone Here Already Try The

Obviously in my examples there is a lot of stuff missing. I just want to show, that the user should be able to replicate Task.init easily so it can be configured in every way, but still can make use of the magic that clearml has, for stuff that does not differ from the comfort way.

4 years ago

0 Hello! Since Today I Get

Mhhm, now conda env creation takes forever since it probably resolves conflicts. At least that is what is happening when I tried to manually install my environment

4 years ago

0 I Finally Got The Cleanup_Service.Py To Run. However, Now I Get Errors When Trying To Load Scalars. This Is What I Found In The Logs

What's ES?

4 years ago

0 Another Question: How Can I Make Clearml-Agent Use Pre-Installed Version From The Nvidia/Pytorch (

I just manually went into the docker container and ran python -m venv env --system-site-packages and activated the virtual env.
When I run pip list then, it correctly shows the preinstalled packages including torch 1.12.0a0+2c916ef

3 years ago

0 Hi Everyone, I Am Having Problems Getting Pytorch Nightly (For Torch 2.0 Preview) To Run On Clearml-Agent. Here Is My Log. Maybe Someone Sees What The Issue Is. I Dont Get It. It Runs All Fine Locally!

Hi TimelyMouse69 Thank you for your answer.
I use 3.10.8 locally and 3.10.6 remotely. Everything is run in a docker container, locally and remotely on the docker-agent (exactly the same docker image).
Thank you for looking into the disappearing dev . It seems like this should be the reason for pip trying to install a stable version of 1.14, which does only exist as nightly

2 years ago

btw: Could you check whether agent.package_manager.system_site_packages is true or false in your config and in the summary that the agent gives before execution?
I start my agent in --foreground mode for debugging and it clearly show false , but in the summary that the agent gives before the task is executed, it shows true .

2 years ago

You mean I can add exactly what you wrote
--extra-index-url clearml torch == 1.14.0.dev20221205+cu117 torchvision == 0.15.0.dev20221205+cputo the installed packages section?

2 years ago

Show more results