UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Side Note For Documentation,

Side note for documentation, Task.project reads > Returns the current Task’s project name. But it actually returns the project ID

clearml

one year ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Also (unrelated), I noticed that after the upgrade to ClearML Server 1.2.0, the AWS (MinIO) credentials are not saved/used. It keeps asking for them whenever...

clearml

2 years ago

0 Votes

16 Answers

1K Views

0 Votes 16 Answers 1K Views

We’Re Running Clearml-Agent On K8S And I First Noticed Some Warnings From The Pod About Python 3.6..?

We’re running clearml-agent on k8s and I first noticed some warnings from the pod about Python 3.6..? > /usr/lib/python3/dist-packages/secretstorage/dhcrypto...

clearml

one year ago

0 Votes

26 Answers

1K Views

0 Votes 26 Answers 1K Views

I Have Some Code That Launches Ml Tasks And It Accepts A Yaml File,

I have some code that launches ML tasks and it accepts a YAML file, .env file and various CSVs. What would be the best way to upload these to a clearml task ...

clearml

3 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Is there any testing suite that ships with ClearML? If we'd like to make some unit tests for our code?

clearml

2 years ago

0 Votes

0 Answers

944 Views

0 Votes 0 Answers 944 Views

Also Ubuntu 18.04 Is Losing Support So Everything In The Image Needs An Update…

Also ubuntu 18.04 is losing support so everything in the image needs an update…

clearml

one year ago

0 Votes

18 Answers

1K Views

0 Votes 18 Answers 1K Views

What Would Be The Best Way To Approach This Flow?

What would be the best way to approach this flow? We have a configuration file that defines e.g. the project name to use in ClearML, alongside other experime...

clearml

2 years ago

0 Votes

4 Answers

986 Views

0 Votes 4 Answers 986 Views

Weird Error; My Local Execution Hung With

Weird error; my local execution hung with ClearML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start 2022-03-07 ...

clearml

2 years ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

Pipelines suddenly no longer appear in the Pipelines tab, what could/should I look into? EDIT: Using latest ClearML (agent, server, SDK), and creating the pi...

clearml

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

I Think Something Is Messed Up In My Remote Agents Environment, Could Someone Lend A Hand? I'M Getting This During Remote Execution (Poetry Queue):

I think something is messed up in my remote agents environment, could someone lend a hand? I'm getting this during remote execution (poetry queue): Summary -...

clearml

one year ago

0 Votes

24 Answers

1K Views

0 Votes 24 Answers 1K Views

Can One Compare Experiments/Tasks From Different Projects? Edit: I Mean, I Can Manually Navigate To Some

Can one compare experiments/tasks from different projects? EDIT: I mean, I can manually navigate to some /compare-experiments end point and then find the tas...

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Bug Report? We Noticed That The Aws Autoscaler Will Lose Track Of Instances Crashing Due To No Space Left On Device, And The Ec2 Instance Will Remain Running Indefinitely.

Bug report? We noticed that the AWS autoscaler will lose track of instances crashing due to no space left on device, and the ec2 instance will remain running...

mlops

one year ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Can I Shutdown Specific Workers Somehow? Running

Can I shutdown specific workers somehow? Running clearml-agent .... --stop just iterates over all the local workers and shuts them down one by one

clearml

2 years ago

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

We're using Ray and ClearML together, and suddenly we're seeing some hanging threads, and finally we got an error message: 2022-01-10 09:58:56,803 [ERROR] [C...

clearml

2 years ago

0 Votes

42 Answers

50K Views

0 Votes 42 Answers 50K Views

How Can I Ensure Tasks In A Pipeline Have The Same Environment As The Pipeline Itself? It Seems A Bit Counter-Intuitive That The Pipeline (Executed Remotely) Captures The Local Environment, But The Tasks (Executed Remotely) Do Not Use That Same Environmen

How can I ensure tasks in a pipeline have the same environment as the pipeline itself? It seems a bit counter-intuitive that the pipeline (executed remotely)...

clearml

one year ago

0 Votes

3 Answers

987 Views

0 Votes 3 Answers 987 Views

+ Side Question - Any Plans To Include Native Support For

side question - any plans to include native support for lgbm ?

clearml

3 years ago

Show more results

0 Back To Autoscaler; Is There Any Way To Ensure The Environment Variables On The Services Queue (Where The Scaler Runs) Will Be Automatically Exposed To New Ec2 Instance? Some Bash Hack Or Similar Would Be Nice, Really

Answering myself for future interested users (at least GrumpySeaurchin29 I think you were interested):

You can "hide" (explained below) secrets directly in the agent 😁 :
When you start the agent listening to a specific queue (i.e. the services worker), you can specify additional environment variables by prefixing them to the execution, i.e. FOO='bar' clearml-agent daemon .... Modify the example AWS autoscaler script - after the driver = AWSDriver.from_config(conf) , inject ...

2 years ago

CostlyOstrich36 I'm not sure what you mean by "through the apps", but any script AFAICS would expose the values of these environment variables; or what am I missing?

2 years ago

0 Does Clearml Somehow

True, and we plan to migrate to pipelines once we have some time for it :) but anyway that condition is flawed I believe

2 years ago

0 Does Clearml Somehow

So now we need to pass Task.init(deferred_init=0) because the default Task.init(deferred_init=False) is wrong

2 years ago

0 Hey! Probably Missed Something, But I Recently Upgraded To 1.1.1, And I'Ve Just Noticed That Png Artifacts Are Not Displayed In The Preview. Is This Intentional? Edit: Ah, It Has To Be A

That's a nice work around of course - I'm sure it works and I'll give it a shot momentarily. I'm just wondering if ClearML could automatically recognize image files in upload_artifact (and other well known suffixes) and do that for me.

3 years ago

0 Hey! Probably Missed Something, But I Recently Upgraded To 1.1.1, And I'Ve Just Noticed That Png Artifacts Are Not Displayed In The Preview. Is This Intentional? Edit: Ah, It Has To Be A

Actually TimelyPenguin76 I get only the following as a "preview" -- I thought the preview for an image would be... the image itself..?

3 years ago

0 Since V1.4.0, Our

Thanks David! I appreciate that, it would be very nice to have a consistent pattern in this!

2 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Note that it would succeed if e.g. run with pytest -s

2 years ago

0 Does Clearml Somehow

SmugDolphin23 I think you can simply change not (type(deferred_init) == int and deferred_init == 0) to deferred_init is True ?

2 years ago

0 We'Re Trying To Use The Aws Autoscaler And Have Managed To Get It Up And Running With Spinning Up Instances. However, It Does Not Seem To Pull Any Of The Tasks For The Remote Instances. We See It Gets

I'll see if we can do that still (as the queue name suggests, this was a POC, so I'm trying to fix things before they give up 😛 ).
Any other thoughts? The original thread https://clearml.slack.com/archives/CTK20V944/p1641490355015400 suggests this PR solved the issue

2 years ago

0 Hi, I'M New To Clearml, And Like It A Lot, Except That The Default "Example" Projects That Come With The Self-Hosted Clearml Server Are Read-Only And I Can'T Archive/Delete Them From Either The Webui Or The Sdk. Maybe I'M A Little Ocd, But This Really Bu

Something like this, SuccessfulKoala55 ?
Open a bash session on the docker ( docker exec -it <docker id> /bin/bash ) Open a mongo shell ( mongo ) Switch to backend db ( use backend ) Get relevant project IDs ( db.project.find({"name": "ClearML Examples"}) and db.project.find({"name": "ClearML - Nvidia Framework Examples/Clara"}) ) Remove relevant tasks ( db.task.remove({"project": "<project_id>"}) ) Remove project IDs ( db.project.remove({"name": ...}) )

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Holy crap this was a light-bulb moment, is this listed somewhere in the docs?
It solves so much of my issues xD

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

and I don't think it's in the docs - we'll add that

Very welcome update, please use some highlighting for it too, it's so important for a complete understanding of how the remote execution works

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Exactly; the cloud instances (that are run with clearml-agent ) should have that clearml.conf + any changes specified in extra_clearml_configuration for the scaler

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

I guess it does not do so for all settings, but only those that come from Session()

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Right, but that's as defined in the services agent, which is not immediately transparent

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Let me know if you do; would be nice to have control over that 😁

2 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

The idea is that the features would be copied/accessed by the server, so we can transition slowly and not use the available storage manager for data monitoring

3 years ago

0 Is There A Way To Set Precedence On Package Managers? If We Set An Agent To Use

Or some users that update their poetry.lock and some that update manually as they prefer to resolve on their own.

2 years ago

0 Hello All, I'M Trying To Run A Few Pipeline Tasks Remotely Via A Private Docker Image But Struggling To Find Any Documentation On How/Where The

Well you can install the binary in the additional start up commands.
Matter of fact, you can just include the ECR login in the "startup steps" offered by the scaler, so no need for this repository. I was thinking these are local instances.

one year ago

0 Does Clearml Somehow

Kinda, yes, and this has changed with 1.8.1.
The thing is that afaik currently ClearML does not officially support a remotely executed task to spawn more tasks, so we also have a small hack that marks the remote "master process" as a local task prior to anything else.

2 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Coming back to this; ClearML prints a lot of error messages in local tests, supposedly because the output streams are not directly available:
` --- Logging error ---
Traceback (most recent call last):
File "/usr/lib/python3.10/logging/init.py", line 1103, in emit
stream.write(msg + self.terminator)
ValueError: I/O operation on closed file.
Call stack:
File "/home/idan/CC/git/ds-platform/.venv/lib/python3.10/site-packages/clearml/task.py", line 3504, in _at_exit
self.__shutdown...

2 years ago

0 ... And Yet Another

i.e.
ERROR Fetching experiments failed. Reason: Backend timeout (600s)
ERROR Fetching experiments failed. Reason: Invalid project ID

2 years ago

0 When Uploading An Artifact, Can I List It In Some Grouping (Like With Parameters, Having E.G.

Hey FrothyDog40 ! Thanks for clarifying - guess we'll have to wait for that as a feature 😁

Should I create a new issue or just add to this one? https://github.com/allegroai/clearml/issues/529

2 years ago

0 Is It Possible To Avoid The Clearml-Agent For Local Installations, And Have The File Server Automatically Use An S3 Bucket? I'Ve Found

I will TIAS, but maybe worthwhile to also mention if it has to be the absolute path or if relative path is fine too!

3 years ago

0 Is There Anywhere In The Web Ui Where One Can See The Clearml Server Version Running? I Keep Getting "Version 1.1.1 Is Now Available" Even Though I'M Pretty Sure I Took All The Steps To Update To The Latest Version

Sure CostlyOstrich36 , sorry it took me so long to reply. I minimized the window a bit here so everything will fill in nicely. Worth mentioning this happens on all pages of course, but I went to the profile page so you can also see the clearml server version.

3 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

3.3.0 😅

2 years ago

Any thoughts SuccessfulKoala55 ?

2 years ago

0 What Would Be The Best Way To Approach This Flow?

We have a more complicated case but I'll work around it 😄

Follow up though - can configuration objects refer to one-another internally in ClearML?

2 years ago

0 For Some Reason The Agent Is Now Trying To Use Python 2.7 All Of A Sudden, Any Idea Why?

Oh and clearml-agent==1.1.2

2 years ago

Show more results