UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 10 months ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

What Could Cause A Queue To Be Recreated Automatically? I Experimented With The Autoscaler With Queue Name

What could cause a queue to be recreated automatically? I experimented with the autoscaler with queue name foo , got it working, closed the autoscaler, delet...

mlops

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

We'Re Running Into Errors Such As This:

We're running into errors such as this: Action failed <500/0: tasks.add_or_update_artifacts/v2.10 (Update failed (BSONObj size: 18564134 (0x11B4426) is inval...

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Also (unrelated), I noticed that after the upgrade to ClearML Server 1.2.0, the AWS (MinIO) credentials are not saved/used. It keeps asking for them whenever...

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Does Anyone Know How We Can Restore Clearml On Helms Chart From Existing Snapshots (Aws)?

Does anyone know how we can restore ClearML on Helms chart from existing Snapshots (AWS)?

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Side Note For Documentation,

Side note for documentation, Task.project reads > Returns the current Task’s project name. But it actually returns the project ID

clearml

one year ago

0 Votes

2 Answers

577 Views

0 Votes 2 Answers 577 Views

Is There A Gcp Driver Similar To

Is there a GCP driver similar to aws_driver.py ? None

clearml

10 months ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

One more follow-up still; we're trying to run non-GPU scaler, and I've finally sorted out subnet and security groups issues, only to run into this: Executing...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Trying To Run Aws Autoscaler With

Trying to run AWS autoscaler with poetry queue, and I get: Traceback (most recent call last): File "/root/.local/bin/poetry", line 5, in from poetry.console....

mlops

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Bug Report? We Noticed That The Aws Autoscaler Will Lose Track Of Instances Crashing Due To No Space Left On Device, And The Ec2 Instance Will Remain Running Indefinitely.

Bug report? We noticed that the AWS autoscaler will lose track of instances crashing due to no space left on device, and the ec2 instance will remain running...

mlops

one year ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

There Seems To Be An Error If A Project Name Has Spaces (At Least At The Top-Level Name). I Created A Project Called

There seems to be an error if a project name has spaces (at least at the top-level name). I created a project called internal tests (with the space), and it ...

clearml

2 years ago

0 Votes

26 Answers

1K Views

0 Votes 26 Answers 1K Views

I Have Some Code That Launches Ml Tasks And It Accepts A Yaml File,

I have some code that launches ML tasks and it accepts a YAML file, .env file and various CSVs. What would be the best way to upload these to a clearml task ...

clearml

3 years ago

0 Votes

0 Answers

952 Views

0 Votes 0 Answers 952 Views

Also Ubuntu 18.04 Is Losing Support So Everything In The Image Needs An Update…

Also ubuntu 18.04 is losing support so everything in the image needs an update…

clearml

one year ago

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

When Using

When using StorageManager.download_folder , I get the following error: Traceback (most recent call last): File "/home/idan/.clearml/venvs-builds/3.7/lib/pyth...

clearml

3 years ago

0 Votes

4 Answers

985 Views

0 Votes 4 Answers 985 Views

Is There An Easy Way To Apply The Uncommitted Changes, Logged By Clearml, Locally To A Dev Environment?

Is there an easy way to apply the uncommitted changes, logged by ClearML, locally to a dev environment?

clearml

2 years ago

0 Votes

7 Answers

987 Views

0 Votes 7 Answers 987 Views

Weird Encounter On Macos (Local Execution, Rerunning After It Failed - Worked Normally):

Weird encounter on macOS (local execution, rerunning after it failed - worked normally): Traceback (most recent call last): File "/Users/.../repositories/......

clearml

2 years ago

0 Votes

4 Answers

989 Views

0 Votes 4 Answers 989 Views

Weird Error; My Local Execution Hung With

Weird error; my local execution hung with ClearML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start 2022-03-07 ...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Our Mac Users Are Having Some Issues. They Have Their Respective ~/Clearml.Conf, And Yet They Get: Clearml 1.1.5

Our Mac users are having some issues. They have their respective ~/clearml.conf, and yet they get: ClearML 1.1.5 Traceback (most recent call last): ... File ...

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

What Privileges/Iam Role Would The Aws Autoscaler Need?

What privileges/IAM role would the AWS autoscaler need?

mlops

2 years ago

0 Votes

16 Answers

1K Views

0 Votes 16 Answers 1K Views

We’Re Running Clearml-Agent On K8S And I First Noticed Some Warnings From The Pod About Python 3.6..?

We’re running clearml-agent on k8s and I first noticed some warnings from the pod about Python 3.6..? > /usr/lib/python3/dist-packages/secretstorage/dhcrypto...

clearml

one year ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

... And Yet Another

... and yet another 😄 When using the UI with regex to search for experiments, due to the greedy nature of the search, it consistently pops up the "ERROR Fet...

clearml

2 years ago

0 Votes

18 Answers

1K Views

0 Votes 18 Answers 1K Views

What Would Be The Best Way To Approach This Flow?

What would be the best way to approach this flow? We have a configuration file that defines e.g. the project name to use in ClearML, alongside other experime...

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

If I Create A Dataset With

If I create a dataset with Dataset.create(..., use_current_task=True) , that task holds the dataset. Can I then refer/copy/attach the same dataset to other t...

clearml

2 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Is There A Way To Force Clearml To First Parse The Config File, Before Running

Is there a way to force ClearML to first parse the config file, before running Task.init ? We’re relying in some code that the credentials in clearml.conf ar...

clearml

one year ago

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

We're trying to upgrade our ClearML on K8s but we're getting this error - Error: UPGRADE FAILED: cannot patch "clearml-fileserver-data" with kind PersistentV...

clearml

2 years ago

0 Votes

12 Answers

1K Views

0 Votes 12 Answers 1K Views

When Uploading An Artifact, Can I List It In Some Grouping (Like With Parameters, Having E.G.

When uploading an artifact, can I list it in some grouping (like with parameters, having e.g. Args/foo , Args/bar )?

clearml

2 years ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

The Remote Workers Seem To Insist On Using Old Git Credentials, Even After I Stopped Them, Updated The Conf File, And Restarted Them Using It. Are The Credentials Cached Somewhere? Edit: Nevermind, The Credentials Are Stored In The

The remote workers seem to insist on using old git credentials, even after I stopped them, updated the conf file, and restarted them using it. Are the creden...

clearml

2 years ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

More Clarification On Documentation (Clearml Data):

More clarification on documentation (ClearML Data): > Dataset changes are stored using differentiable storage, meaning a version will store the change-set fr...

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Is It Expected That K8S Helm Chart Also Starts A Clearml Worker?

Is it expected that K8s helm chart also starts a ClearML worker?

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

In The Pipeline Examples, Components Have The Following Note:

In the pipeline examples, components have the following note: > # notice all package imports inside the function will be automatically logged as # required p...

clearml

one year ago

0 Votes

24 Answers

1K Views

0 Votes 24 Answers 1K Views

Can One Compare Experiments/Tasks From Different Projects? Edit: I Mean, I Can Manually Navigate To Some

Can one compare experiments/tasks from different projects? EDIT: I mean, I can manually navigate to some /compare-experiments end point and then find the tas...

clearml

2 years ago

Show more results

0 When Using

Any leads TimelyPenguin76 ? I've also tried setting up a minio s3 bucket, but I'm not sure if the remote agent has copied the credentials and host 🤔

3 years ago

0 Hey! Probably Missed Something, But I Recently Upgraded To 1.1.1, And I'Ve Just Noticed That Png Artifacts Are Not Displayed In The Preview. Is This Intentional? Edit: Ah, It Has To Be A

Not really - it will just show the string. A preview would be more like a low-res version of the uploaded image or similar.

3 years ago

0 How Can I Ensure Tasks In A Pipeline Have The Same Environment As The Pipeline Itself? It Seems A Bit Counter-Intuitive That The Pipeline (Executed Remotely) Captures The Local Environment, But The Tasks (Executed Remotely) Do Not Use That Same Environmen

Alternatively, it would be good to specify both some requirements and auto-detect 🤔

one year ago

0 Back To This

Based on where it hangs, I think this has to do with reporting the scatter plots.

2 years ago

0 How Do I Stop A Zombie Agent?

Thanks!

2 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

A follow up question (instead of opening a new thread), is there a way I could signal some files/directories to be copied to the execute_remotely task?

3 years ago

0 Clearml Pipelines Can Be Build From Tasks, Functions, And Decorated Functions, According To The Examples In

Ah, you meant “free python code” in that sense. Sure, I see that. The repo arguments also exist for functions though.

Sorry for hijacking your thread @<1523704157695905792:profile|VivaciousBadger56>

one year ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Does it make sense to you to run several such glue instances, to manage multiple resource requirements?

one year ago

0 I Think Something Is Messed Up In My Remote Agents Environment, Could Someone Lend A Hand? I'M Getting This During Remote Execution (Poetry Queue):

Latest (1.5.1 I believe?), full log incoming, but it's like I've posted elsewhere already 🤔
It just sets up the environment and immediately crashes when trying to run the code.
The setup itself is done correctly.

one year ago

0 Clearml Server V1.2.0 Has Just Been Released!

Perfect now 👌 (also nice cleanup of default_new_data_root duplicate code :D)

2 years ago

0 How Can I Send A Composed Chunk Of Code For Remote Execution

No that does not seem to work, I get

task.execute_remotely(queue_name="default")
2024-01-24 11:28:23,894 - clearml - WARNING - Calling task.execute_remotely is only supported on main Task (created with Task.init)
Defaulting to self.enqueue(queue_name=default)

Any follow-up thoughts, @<1523701070390366208:profile|CostlyOstrich36> , or maybe @<1523701087100473344:profile|SuccessfulKoala55> ? 🤔

11 months ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Perfect, thanks for the answers Valeriano. These small stuff are missing from the documentation, but I now feel much more confident in setting this up.

one year ago

0 So The Around App Is Also From Clearml?

https://clearml.slack.com/apps/A019MGGA2Q1-around ?

3 years ago

0 For Remote Execution Where The Queue Has

Sorry AgitatedDove14 , forgot to get back to this.
I've been trying to convince my team to drop poetry 😄

one year ago

0 When Using

Added the following line under volumes for apiserver , fileserver , agent-services :
- /data/clearml:/data/clearml

3 years ago

0 How Can I Send A Composed Chunk Of Code For Remote Execution

Consider e.g:

# steps.py

class DataFetchingStep:
    def __init__(self, source, query, locations, timestamps):
        # ...
    def run(self, queue=None, **kwargs):
        # ...

class DataTransformationStep:
    def __init__(self, inputs, transformations):
        # inputs can include instances of DataFetchingStep, or local files, for example
        # ...
    def run(self, queue=None, **kwargs):
        # ...

And then the following SDK usage in a notebook:

from steps imp...

11 months ago

0 How Can I Send A Composed Chunk Of Code For Remote Execution

I guess in theory I could write a run_step.py , similarly to how the pipeline in ClearML works… 🤔 And then use Task.create() etc?

11 months ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Hey @<1523701070390366208:profile|CostlyOstrich36> , thanks for the reply!
I’m familiar with the above repo, we have the ClearML Server and such deployed on K8s.
What’s lacking is documentation regarding the clearml-agent helm chart. What exactly does it offer, etc.
We’re interested in e.g. using karpenter to scale our deployments per demand, effectively replacing the AWS autoscaler.

one year ago

0 We Use Environment Variables In Our

Setting the endpoint will not be the only thing missing though, so unfortunately that's insufficient 😞

2 years ago

0 How Can I Send A Composed Chunk Of Code For Remote Execution

Any thoughts @<1523701070390366208:profile|CostlyOstrich36> ?
I wouldn’t want to run the entire notebook, just a specific part of it.

11 months ago

0 How Can I Send A Composed Chunk Of Code For Remote Execution

Hey @<1537605940121964544:profile|EnthusiasticShrimp49> ! You’re mostly correct. The Step classes will be predefined (of course developers are encouraged to add/modify as needed), but as in the DataTransformationStep , there may be user-defined functions specified. That’s not a problem though, I can provide these functions with the helper_functions argument.

The .add_function_step is indeed a failing point. I can’t really create a task from the notebook because calling `Ta...

11 months ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Yes, I’ve found that too (as mentioned, I’m familiar with the repository). My issue is still that there is documentation as to what this actually offers.
Is this simply a helm chart to run an agent on a single pod? Does it scale in any way? Basically - is it a simple agent (similiar to on-premise agents, running in the background, but here on K8s), or is it a more advanced one that offers scaling features? What is it intended for, and how does it work?

The official documentation are very spa...

one year ago

0 We Use Environment Variables In Our

We load the endpoint (and S3 credentials) from a .env file, so they're not immediately available at the time of from clearml import Task .
It's a convenience thing, rather than exporting many environment variables that are tied together.

2 years ago

0 How Can I Send A Composed Chunk Of Code For Remote Execution

I can elaborate in more detail if you have the time, but generally the code is just defined in some source files.
I’ve been trying to play around with pipelines for this purpose, but as suspected, it fails finding the definition for the pickled object…

11 months ago

0 Hi Everyone, Quick Question Regarding Minio And Logging:

with s3

2 years ago

0 Is It Possible To Avoid The Clearml-Agent For Local Installations, And Have The File Server Automatically Use An S3 Bucket? I'Ve Found

The odd thing is that it was already defined, and then when I clicked an S3 link, it asked me to fill it in again, adding a duplicate credentials row

3 years ago

0 How Would I Go About Adding Multiple Credentials In The Autoscaler? (I.E. Specify Multiple

I am indeed

2 years ago

0 When Using

Ubuntu 18.04, latest clearml version