UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

We're trying to upgrade our ClearML on K8s but we're getting this error - Error: UPGRADE FAILED: cannot patch "clearml-fileserver-data" with kind PersistentV...

clearml

2 years ago

0 Votes

7 Answers

975 Views

0 Votes 7 Answers 975 Views

Weird Encounter On Macos (Local Execution, Rerunning After It Failed - Worked Normally):

Weird encounter on macOS (local execution, rerunning after it failed - worked normally): Traceback (most recent call last): File "/Users/.../repositories/......

clearml

2 years ago

0 Votes

6 Answers

994 Views

0 Votes 6 Answers 994 Views

So The Around App Is Also From Clearml?

So the around app is also from ClearML? :shocked_face_with_exploding_head:

clearml

3 years ago

0 Votes

24 Answers

1K Views

0 Votes 24 Answers 1K Views

Can One Compare Experiments/Tasks From Different Projects? Edit: I Mean, I Can Manually Navigate To Some

Can one compare experiments/tasks from different projects? EDIT: I mean, I can manually navigate to some /compare-experiments end point and then find the tas...

clearml

2 years ago

0 Votes

7 Answers

995 Views

0 Votes 7 Answers 995 Views

Good Week Clearml Team

Good week ClearML team 🙂 I have found two more WebUI bugs (on the self-hosted server): If I look at a specific experiment (say, the Artifacts tab), and then...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Trying To Run Aws Autoscaler With

Trying to run AWS autoscaler with poetry queue, and I get: Traceback (most recent call last): File "/root/.local/bin/poetry", line 5, in from poetry.console....

mlops

one year ago

0 Votes

8 Answers

959 Views

0 Votes 8 Answers 959 Views

Is There A Guide On How To Deploy A Services Agent On A K8S Setup? Specifically, With The Cli We Use Flags Such As

Is there a guide on how to deploy a services agent on a k8s setup? Specifically, with the CLI we use flags such as --services-mode --cpu-only , and I can’t s...

mlops

one year ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

What Could Cause A Queue To Be Recreated Automatically? I Experimented With The Autoscaler With Queue Name

What could cause a queue to be recreated automatically? I experimented with the autoscaler with queue name foo , got it working, closed the autoscaler, delet...

mlops

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

How/Where Can I Change The Session Timeout Time For The Webui? It'S Timing Out Way Too Fast After The Latest Updates

How/where can I change the session timeout time for the WebUI? It's timing out WAY too fast after the latest updates

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Is there any testing suite that ships with ClearML? If we'd like to make some unit tests for our code?

clearml

2 years ago

0 Votes

18 Answers

1K Views

0 Votes 18 Answers 1K Views

Since Clearml 1.6.3, A Dataset Attached To A Task Now Renames That Task By Adding A

Since ClearML 1.6.3, a dataset attached to a Task now renames that task by adding a .datasets in the path, making it hidden. Can we control/cancel this? We c...

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Is There Any Way (Or Are There Any Plans) To Include Some View For Datasets In The Webui? One That Is Detached From The Generating Task?

Is there any way (or are there any plans) to include some view for datasets in the WebUI? One that is detached from the generating task?

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

I'm trying to set up some initial experiments within our stack, but when I use the execute_remotely task, I get this error: clearml_agent: ERROR: Failed gett...

clearml

3 years ago

0 Votes

42 Answers

49K Views

0 Votes 42 Answers 49K Views

How Can I Ensure Tasks In A Pipeline Have The Same Environment As The Pipeline Itself? It Seems A Bit Counter-Intuitive That The Pipeline (Executed Remotely) Captures The Local Environment, But The Tasks (Executed Remotely) Do Not Use That Same Environmen

How can I ensure tasks in a pipeline have the same environment as the pipeline itself? It seems a bit counter-intuitive that the pipeline (executed remotely)...

clearml

one year ago

0 Votes

13 Answers

1K Views

0 Votes 13 Answers 1K Views

<Edited> Two Things:

Two things: The release of clearml-agent was not announced in 🤯 clearml-agent seems outdated wrt requirements, making poetry and clearml-agent incompatible:...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

One more follow-up still; we're trying to run non-GPU scaler, and I've finally sorted out subnet and security groups issues, only to run into this: Executing...

clearml

2 years ago

Show more results

0 For Some Reason The Agent Is Now Trying To Use Python 2.7 All Of A Sudden, Any Idea Why?

Okay this was a deep dive into clearml-agent code 😁
Took a long time to figure out that there was a specific Python version with a specific virtualenv that was old (Python 3.6.9 and Python 3.8 had latest virtualenv, but Python 3.7.5 had an old virtualenv).
Then the task requested to use Python 3.7, and that old virtualenv version was broken.

As a result -> Could the agent maybe also output the virtualenv version used with setting up the environment for the first time?

2 years ago

0 What Would Be The Best Way To Approach This Flow?

I think I may have brought this up multiple times in different ways :D
When dealing with long and complicated configurations (whether config objects, yaml, or otherwise), it's often useful to break them down into relevant chunks (think hydra, maybe).

In our case, we have a custom YAML instruction !include , i.e.
` # foo.yaml
bar: baz

bar.yaml

obj: !include foo.yaml
maybe_another_obj: !include foo.yaml `

2 years ago

0 What Would Be The Best Way To Approach This Flow?

Say I upload each of these yamls as a configuration object (as with the above). Once I try to load bar.yaml remotely it will crash, since foo.yaml is missing (and is instead a clearml configuration object).
Does that make sense?

2 years ago

0 Is It Possible To Avoid The Clearml-Agent For Local Installations, And Have The File Server Automatically Use An S3 Bucket? I'Ve Found

Yes, exactly! I've added instructions for the users on creating their account and running clearml-init , and then they run the snippet that updates the api and sdk sections.

Or did you mean I can couple a short "mini config" with the package and redirect clearml to use this local one (instead of the one at ~/clearml.conf)?

2 years ago

0 For Some Reason The Agent Is Now Trying To Use Python 2.7 All Of A Sudden, Any Idea Why?

I'll have yet another look at both the latest agent RC and at the docker-compose, thanks!

There was no "default" services agent btw, just the queue, I had to launch an agent myself (not sure if it's relevant)

2 years ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Hey @<1523701070390366208:profile|CostlyOstrich36> , thanks for the reply!
I’m familiar with the above repo, we have the ClearML Server and such deployed on K8s.
What’s lacking is documentation regarding the clearml-agent helm chart. What exactly does it offer, etc.
We’re interested in e.g. using karpenter to scale our deployments per demand, effectively replacing the AWS autoscaler.

one year ago

0 Weird Error; My Local Execution Hung With

Yes

2 years ago

0 What Would Be The Best Way To Approach This Flow?

Right and then for text (file path) use some regex or similar for extraction, and for dictionary simply parse the values?

2 years ago

0 Bug Report? We Noticed That The Aws Autoscaler Will Lose Track Of Instances Crashing Due To No Space Left On Device, And The Ec2 Instance Will Remain Running Indefinitely.

We're using the example autoscaler, nothing modified

one year ago

0 Clearml Pipelines Can Be Build From Tasks, Functions, And Decorated Functions, According To The Examples In

Also, creating from functions allows dynamic pipeline creation without requiring the tasks to pre-exist in ClearML, which is IMO the strongest point to make about it

one year ago

0 If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

I'm not sure about the intended use of connect_configuration now.
I was under the assumption that in connect_configuration(configuration, name=None, description=None) , the configuration is only used in local execution.
But when I run config = task.connect_configuration({}, name='General') (in remote execution), the configuration is set to the empty dictionary

2 years ago

0 If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

There used to be a good example but it's now missing. I'm not sure what does Use only for automation (externally), otherwise use Task.connect_configuration mean when e.g. looking at Task.set_configuration_object , etc.

Could you clarify a bit, CostlyOstrich36 or AgitatedDove14 ?

2 years ago

0 If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

First bullet point - yes, exactly

Second bullet point - all of it, really. The SDK documentation and the examples.
For example, the Task object is heavily overloaded and its documentation would benefit from being separated into logical units of work. It would also make it easier for the ClearML team to spot any formatting issues.
Any linked example to github is welcome, but some visualization/inline code with explanation is also very much welcome.

2 years ago

0 If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

Basically when running remotely, the first argument to any configuration (whether object or string, or whatever) is ignored, right?

2 years ago

0 Can One Compare Experiments/Tasks From Different Projects? Edit: I Mean, I Can Manually Navigate To Some

That's exactly what I meant AgitatedDove14 🙂 It's just that to access that comparison page, you have to make a comparison first. It would be handy to have a link (in the side bar?) to an empty comparison

2 years ago

0 <Edited> Two Things:

There's not much (or anything) in the log to provide...

` (.venv) 15:42 [0:user@server$~] CLEARML_CONFIG_FILE=~/agent_clearml.conf clearml-agent daemon --queue default on_prem --detached --order-fairness
Environment variables set from configuration: ['AWS_ACCESS_KEY_ID', 'AWS_SECRET_ACCESS_KEY', 'AWS_DEFAULT_REGION']
...

2 years ago

0 Clearml Pipelines Can Be Build From Tasks, Functions, And Decorated Functions, According To The Examples In

Ah, you meant “free python code” in that sense. Sure, I see that. The repo arguments also exist for functions though.

Sorry for hijacking your thread @<1523704157695905792:profile|VivaciousBadger56>

one year ago

0 We Use Environment Variables In Our

So the config loading is not deferred until execution 😞

2 years ago

0 If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

Generally, really. I've struggled recently (and in the past), because the documentation seems:
Very complete wrt available SDK (though the formatting is sometimes off) Very lacking wrt to how things interact with one anotherA lot of what I need I actually find from pluging into the source code.
I think ClearML would benefit itself a lot if it adopted a documentation structure similar to numpy ecosystem (numpy, pandas, scipy, scikit-image, scikit-bio, scikit-learn, etc)

2 years ago

0 We Use Environment Variables In Our

I see, okay that already clarifies some stuff, I'll dig a bit more into this then! Thanks!

2 years ago

0 If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

Sorry, found it on my end!

2 years ago

0 Clearml Version 1.8.1 Had "Fix" For The Deferred Init Which Introduces A Bug Btw, I'Ve Opened

@<1523701205467926528:profile|AgitatedDove14> this

2 years ago

0 Our Mac Users Are Having Some Issues. They Have Their Respective ~/Clearml.Conf, And Yet They Get: Clearml 1.1.5

My suspicion is that this relates to https://clearml.slack.com/archives/CTK20V944/p1643277475287779 , where the config file is loaded prematurely (upon import ), so our dotenv.load_dotenv() call has not yet registered.

2 years ago

0 What Happens To File That Are Downloaded To A Remote_Execution Via Storagemanager? Are They Removed At The End Of The Run, Or Does It Continuously Increases Disk Space?

Honestly, this is all related to issue #340. The only reason we have this to begin with is because we need one separate "initializer" task that downloads the remote cache and prepares the agent environment for execution (downloading the configuration files, etc).
Otherwise it fits perfectly with pipelines, but we're not there yet.

In the local execution we don't have this initializer task, so we use Task.init() before starting to work on a model, and task.close() when we're done....

2 years ago

0 <Edited> Two Things:

These issues popped when I upgraded to clearml-agent 1.5.0.

2 years ago

0 How Would I Go About Adding Multiple Credentials In The Autoscaler? (I.E. Specify Multiple

So basically what I'm looking for and what I have now is something like the following:
(Local) I have a well-defined aws_autoscaler.yaml that is used to run the AWS autoscaler. That same autoscaler is also run with CLEARML_CONFIG_FILE=.... (Remotely) The autoscaler launches, listens to the predefined queue, and is able to launch instances as needed. I would run a remote execution task object that's appended to the autoscaler queue. The autoscaler picks it up, launches a new instanc...

2 years ago

0 Hi, I'M Looking At Clearml As An Option To Automate Our Training Pipelines. However, From Reading The Documentation I'M Confused If Clearml Can Do What We Want. In Essence, I Would Like To Understand The Methods Of Queuing A

I think you're looking for the execute_remotely function?

3 years ago

0 We Use Environment Variables In Our

Hey SuccessfulKoala55 ! Is the configuration file needed for Task.running_locally() ? This is tightly related with issue #395, where we need additional files for remote execution but have no way to attach them to the task other then using the StorageManager as a temporary cache.

2 years ago

0 Is There A Way To Generate Usage Stats And Reports For Queues? For Example, How Often Is A Queue Used, How Much Cpu Does

We're wondering how many on-premise machines we'd like to deprecate. For that, we want to see how often our "on premise" queue is used (how often a task is submitted and run), for how long, how many resources it consumes (on average), etc.

2 years ago

0 What Would Be The Best Way To Approach This Flow?

I don't think there's a PR issue for that yet, at least I haven't created one.

I could have a look at this and maybe make a PR.
Not sure what would the recommended flow be like though 🤔

2 years ago

Show more results