UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

I Think Something Is Messed Up In My Remote Agents Environment, Could Someone Lend A Hand? I'M Getting This During Remote Execution (Poetry Queue):

I think something is messed up in my remote agents environment, could someone lend a hand? I'm getting this during remote execution (poetry queue): Summary -...

clearml

2 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

Is It Possible To Avoid The Clearml-Agent For Local Installations, And Have The File Server Automatically Use An S3 Bucket? I'Ve Found

Is it possible to avoid the clearml-agent for local installations, and have the file server automatically use an S3 bucket? I've found https://clearml.slack....

clearml

3 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

Since V1.4.0, Our

Since v1.4.0, our StorageManager.download_folder(..., local_folder='./') is failing - we've had to revert back to 1.3.2. I saw the changelist includes a fix ...

clearml

3 years ago

0 Votes

24 Answers

2K Views

0 Votes 24 Answers 2K Views

Can One Compare Experiments/Tasks From Different Projects? Edit: I Mean, I Can Manually Navigate To Some

Can one compare experiments/tasks from different projects? EDIT: I mean, I can manually navigate to some /compare-experiments end point and then find the tas...

clearml

3 years ago

0 Votes

7 Answers

2K Views

0 Votes 7 Answers 2K Views

Is It Expected That K8S Helm Chart Also Starts A Clearml Worker?

Is it expected that K8s helm chart also starts a ClearML worker?

clearml

3 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

Our Mac Users Are Having Some Issues. They Have Their Respective ~/Clearml.Conf, And Yet They Get: Clearml 1.1.5

Our Mac users are having some issues. They have their respective ~/clearml.conf, and yet they get: ClearML 1.1.5 Traceback (most recent call last): ... File ...

clearml

3 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

Trying To Run Aws Autoscaler With

Trying to run AWS autoscaler with poetry queue, and I get: Traceback (most recent call last): File "/root/.local/bin/poetry", line 5, in from poetry.console....

mlops

2 years ago

0 Votes

0 Answers

2K Views

0 Votes 0 Answers 2K Views

Any Easy Way To Ensure The Pipline (Tasks + Controller) Are Not Hidden In The Projects Ui? Edit: Apparently Not; For The Time Being I Created This

Any easy way to ensure the pipline (tasks + controller) are not hidden in the projects UI? EDIT: Apparently not; for the time being I created this https://gi...

clearml

2 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Getting A Weird Error On Local Setup For Clearml Server:

Getting a weird error on local setup for ClearML Server: 2022-03-30 08:07:07,932 - clearml.Task - ERROR - Action failed <500/0: tasks.get_all/v1.0 (mongo:270...

clearml

3 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Is There A Guide On How To Deploy A Services Agent On A K8S Setup? Specifically, With The Cli We Use Flags Such As

Is there a guide on how to deploy a services agent on a k8s setup? Specifically, with the CLI we use flags such as --services-mode --cpu-only , and I can’t s...

mlops

2 years ago

0 Votes

29 Answers

2K Views

0 Votes 29 Answers 2K Views

We'Re Trying To Use The Aws Autoscaler And Have Managed To Get It Up And Running With Spinning Up Instances. However, It Does Not Seem To Pull Any Of The Tasks For The Remote Instances. We See It Gets

We're trying to use the AWS autoscaler and have managed to get it up and running with spinning up instances. However, it does not seem to pull any of the tas...

mlops

3 years ago

0 Votes

16 Answers

2K Views

0 Votes 16 Answers 2K Views

We’Re Running Clearml-Agent On K8S And I First Noticed Some Warnings From The Pod About Python 3.6..?

We’re running clearml-agent on k8s and I first noticed some warnings from the pod about Python 3.6..? > /usr/lib/python3/dist-packages/secretstorage/dhcrypto...

clearml

2 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Follow Up On Execute_Remotely, I See One Can Limit The Available Gpu Resources In A Worker Daemon; Could One Also Limit The Number Of Cpu Cores Available?

Follow up on execute_remotely, I see one can limit the available GPU resources in a worker daemon; could one also limit the number of CPU cores available?

clearml

3 years ago

0 Votes

12 Answers

2K Views

0 Votes 12 Answers 2K Views

When Uploading An Artifact, Can I List It In Some Grouping (Like With Parameters, Having E.G.

When uploading an artifact, can I list it in some grouping (like with parameters, having e.g. Args/foo , Args/bar )?

clearml

3 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

... And Yet Another

... and yet another 😄 When using the UI with regex to search for experiments, due to the greedy nature of the search, it consistently pops up the "ERROR Fet...

clearml

3 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

I'm trying to set up some initial experiments within our stack, but when I use the execute_remotely task, I get this error: clearml_agent: ERROR: Failed gett...

clearml

4 years ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Back To Autoscaler; Is There Any Way To Ensure The Environment Variables On The Services Queue (Where The Scaler Runs) Will Be Automatically Exposed To New Ec2 Instance? Some Bash Hack Or Similar Would Be Nice, Really

Back to autoscaler; is there any way to ensure the environment variables on the services queue (where the scaler runs) will be automatically exposed to new E...

clearml

3 years ago

0 Votes

26 Answers

2K Views

0 Votes 26 Answers 2K Views

I Have Some Code That Launches Ml Tasks And It Accepts A Yaml File,

I have some code that launches ML tasks and it accepts a YAML file, .env file and various CSVs. What would be the best way to upload these to a clearml task ...

clearml

4 years ago

0 Votes

9 Answers

2K Views

0 Votes 9 Answers 2K Views

How Do I Stop A Zombie Agent?

How do I stop a zombie agent? 🤔 I see it in clearml-agent list , but I do not see it in ps aux and when I try clearml-agent daemon --stop , I get Could not ...

clearml

3 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Soooo… The Tooltip For Some Buttons In The Webui Hides The Actual Buttons

Soooo… the tooltip for some buttons in the WebUI hides the actual buttons 🤔 Any quick fixes?

clearml

2 years ago

0 Votes

7 Answers

2K Views

0 Votes 7 Answers 2K Views

Is There A Way To Generate Usage Stats And Reports For Queues? For Example, How Often Is A Queue Used, How Much Cpu Does

Is there a way to generate usage stats and reports for queues? For example, how often is a queue used, how much CPU does the queue consume when it's used, et...

clearml

3 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

One more follow-up still; we're trying to run non-GPU scaler, and I've finally sorted out subnet and security groups issues, only to run into this: Executing...

clearml

3 years ago

0 Votes

10 Answers

2K Views

0 Votes 10 Answers 2K Views

More Clarification On Documentation (Clearml Data):

More clarification on documentation (ClearML Data): > Dataset changes are stored using differentiable storage, meaning a version will store the change-set fr...

clearml

3 years ago

0 Votes

18 Answers

2K Views

0 Votes 18 Answers 2K Views

Does Clearml Somehow

Does ClearML somehow remove any loggers from logging module? We suddenly noticed that we have some handlers missing when running in ClearML

clearml

2 years ago

0 Votes

7 Answers

2K Views

0 Votes 7 Answers 2K Views

Can We Report A Pandas Table With Styling To Be Retained In The Webui? It Would Be Nice To Report E.G.

Can we report a pandas table with styling to be retained in the WebUI? It would be nice to report e.g. df.style.background_gradient(cmap='coolwarm', axis=Non...

clearml

3 years ago

0 Votes

9 Answers

2K Views

0 Votes 9 Answers 2K Views

Hey! Probably Missed Something, But I Recently Upgraded To 1.1.1, And I'Ve Just Noticed That Png Artifacts Are Not Displayed In The Preview. Is This Intentional? Edit: Ah, It Has To Be A

Hey! Probably missed something, but I recently upgraded to 1.1.1, and I've just noticed that png artifacts are not displayed in the preview. Is this intentio...

clearml

4 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Weird Error; My Local Execution Hung With

Weird error; my local execution hung with ClearML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start 2022-03-07 ...

clearml

3 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Can I Shutdown Specific Workers Somehow? Running

Can I shutdown specific workers somehow? Running clearml-agent .... --stop just iterates over all the local workers and shuts them down one by one

clearml

3 years ago

0 Votes

0 Answers

2K Views

0 Votes 0 Answers 2K Views

I Have A Project Filled With Failed Attempts At Onboarding To Clearml By Now

I have a project filled with failed attempts at onboarding to clearml by now 😅

clearml

4 years ago

0 Votes

0 Answers

2K Views

0 Votes 0 Answers 2K Views

Also Ubuntu 18.04 Is Losing Support So Everything In The Image Needs An Update…

Also ubuntu 18.04 is losing support so everything in the image needs an update…

clearml

2 years ago

Show more results

0 Is There Some Automated Migration For Existing Tasks From Other Mlops Frameworks To Clearml? (Specifically, Interested In Migrating From Mlflow)

Is it currently broken? 🤔

3 years ago

0 Can I Run An Autoscaler Listening To A Regular Queue (I.E. Combine Autoscaling With On Premise Machines)? Or Does It Run The Danger Of Creating An Ec2 Instance While An On-Premise Machine Takes The Job?

I think now there's the following:
Resource type Queue (name) defines resource + max instancesAnd I'm looking for:
Resource type "pool" of resources (type + max instances) A pool can be shared among queues

3 years ago

0 Clearml Pipelines Can Be Build From Tasks, Functions, And Decorated Functions, According To The Examples In

Heh, my bad, the term "user" is very much ingrained in our internal way of working. You can think of it as basically any technically-inclined person in your team or company.

Indeed the options in the WebUI are too limited for our use case, so we're developed "apps" that take a yaml configuration file and build a matching pipeline.
With that, our users do not need to code directly, and we can offer much more fine control over the pipeline.

As for the imports, what I meant is that I encounter...

2 years ago

0 Is There Anywhere In The Web Ui Where One Can See The Clearml Server Version Running? I Keep Getting "Version 1.1.1 Is Now Available" Even Though I'M Pretty Sure I Took All The Steps To Update To The Latest Version

Of course now it's not there anymore 😆 If/when it happens again I'll ping you here 🙂

4 years ago

0 Is There A Way To Set Precedence On Package Managers? If We Set An Agent To Use

Maybe it's better to approach this the other way, if one uses Task.force_requirements_env_freeze() , then the locally updated packages aren't reflected in poetry 🤔

3 years ago

0 Is It Possible To Avoid The Clearml-Agent For Local Installations, And Have The File Server Automatically Use An S3 Bucket? I'Ve Found

That's fine for the current use-case I believe.
Once the team is happy with the logging functionality, we'll move on to remote execution and things will update.

3 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Another example - trying to validate dataset interactions ends with

` else:
self._created_task = True
dataset_project, parent_project = self._build_hidden_project_name(dataset_project, dataset_name)
task = Task.create(
project_name=dataset_project, task_name=dataset_name, task_type=Task.TaskTypes.data_processing)
if bool(Session.check_min_api_server_version(Dataset.__min_api_version)):
get_or_create_proje...

3 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

I'm guessing that's not on pypi yet?

3 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Hurrah! Added
git config --system credential.helper 'store --file /root/.git-credentials' to the extra_vm_bash_script and now it works
(logs the given git credentials in the store file, which can then be used immediately for the recursive calls)

3 years ago

0 Is It Expected That K8S Helm Chart Also Starts A Clearml Worker?

That's probably in the newer ClearML server pages then, I'll have to wait still 😅

3 years ago

0 Is It Expected That K8S Helm Chart Also Starts A Clearml Worker?

yes, a lot of moving pieces here as we're trying to migrate to AWS and set up autoscaler and more 😅

3 years ago

0 Is There A Way To Force Clearml To First Parse The Config File, Before Running

Yes 😅 I want ClearML to load and parse the config before that. But now I'm not even sure those settings in the config are even exposed as environment variables?

2 years ago

0 Is It Expected That K8S Helm Chart Also Starts A Clearml Worker?

I will! (once our infra guy comes back from holiday and updates the install, for some reason they setup server 1.1.1???)

Meanwhile wondering where I got a random worker from

3 years ago

0 Is It Expected That K8S Helm Chart Also Starts A Clearml Worker?

Can I query where the worker is running (IP)?

3 years ago

0 I Have Some Code That Launches Ml Tasks And It Accepts A Yaml File,

Thanks AgitatedDove14 , I'll first have to prove viability with the free version :)

4 years ago

0 Any Plans To Add Support For Multiindex Tables? That Comma-Separation Feels A Bit Odd. (Also This Shows Up In

Indeed. I'll open an issue, sure!

4 years ago

0 Any Plans To Add Support For Multiindex Tables? That Comma-Separation Feels A Bit Odd. (Also This Shows Up In

One more UI question TimelyPenguin76 , if I may -- it seems one cannot simply report single integers. The report_scalar feature creates a plot of a single data point (or single iteration).
For example if I want to report a scalar "final MAE" for easier comparison, it's kinda impossible 😞

4 years ago

0 <Edited> Two Things:

The last kind when launching the agents.
The former on task execution.

2 years ago

0 Trying To Run Aws Autoscaler With

We're not using the docker setup though. The CLI run by the autoscaler is python -m clearml_agent --config-file /root/clearml.conf daemon --queue aws_small , so no docker

2 years ago

0 How Can I Ensure Tasks In A Pipeline Have The Same Environment As The Pipeline Itself? It Seems A Bit Counter-Intuitive That The Pipeline (Executed Remotely) Captures The Local Environment, But The Tasks (Executed Remotely) Do Not Use That Same Environmen

Yes. Though again, just highlighting the naming of foo-mod is arbitrary. The actual module simply has a folder structured with an implicit namespace:

foo/
  mod/
    __init__.py
    # stuff

FWIW, for the time being I’m just setting the packages to all the packages the pipeline tasks sees with:

    packages = get_installed_pkgs_detail()
    packages = [f"{name}=={version}" if version else name for name, version in packages.values()]
    packages = task.data.script.require...

2 years ago

0 Is There A Way To Save The Models Completely On The Clearml Server? It Seems That Clearml Server Does Not Store The Models Or Artifacts Itself, But They Are Stored Somewhere Else (E.G., Aws S3-Bucket) Or On My Local Machine And Clearml Server Is Only Sto

Heh, well, John wrote that in the first reply in this thread 🙂
And in Task.init main documentation page (nowhere near the code), it says the following -

2 years ago

0 Since Clearml 1.6.3, A Dataset Attached To A Task Now Renames That Task By Adding A

It also happens when use_current_task=False though. So the current best approach would be to not combine the task and the dataset?

3 years ago

0 Back To This

Basically when there are occasionally extreme values (i.e. most values fall in [0, 50] range, and one value suddenly falls in 50e+12 range), the plotting library (matplotlib or ClearML, unsure) hangs for a really long time

3 years ago

0 Can We Use S3 Buckets To Cache Environments?

Nice, that works 🙂 Thanks!

3 years ago

0 I Have Some Code That Launches Ml Tasks And It Accepts A Yaml File,

The S3 bucket credentials are defined on the agent, as the bucket is also running locally on the same machine - but I would love for the code to download and apply the file automatically!

4 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Or if it wasn't clear, that chunk of code is from clearml's dataset.py

3 years ago

0 Hey! Probably Missed Something, But I Recently Upgraded To 1.1.1, And I'Ve Just Noticed That Png Artifacts Are Not Displayed In The Preview. Is This Intentional? Edit: Ah, It Has To Be A

I've updated my feature request to describe that as well. A textual description is not necessarily a preview 😅 For now I'll use the debug samples.

These kind of things definitely show how ClearML was designed originally only for neural networks tbh, where images are almost always only part of the dataset. Same goes for the consistent use of iteration everywhere 😞

4 years ago

0 We'Re Trying To Use The Aws Autoscaler And Have Managed To Get It Up And Running With Spinning Up Instances. However, It Does Not Seem To Pull Any Of The Tasks For The Remote Instances. We See It Gets

The network is configured correctly 🙂 But the newly spun up instances need to be set to the same VPC/Subnet somehow

3 years ago

0 I Have Some Code That Launches Ml Tasks And It Accepts A Yaml File,

The overall flow I currently have is e.g.
Start an internal task (not ClearML Task; MLOps not initialized yet) Call some pre_init function with args so I can upload the environment file via StorageManager to S3 Call some start_run function with the configuration dictionary loaded, so I can upload the relevant CSV files and configuration file Finally initialize the MLOps (ClearML), start a task, execute remotely
I can play around with 3/4 (so e.g. upload CSVs and configuratio...

4 years ago

Show more results