UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Can We Report A Pandas Table With Styling To Be Retained In The Webui? It Would Be Nice To Report E.G.

Can we report a pandas table with styling to be retained in the WebUI? It would be nice to report e.g. df.style.background_gradient(cmap='coolwarm', axis=Non...

clearml

2 years ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

More Clarification On Documentation (Clearml Data):

More clarification on documentation (ClearML Data): > Dataset changes are stored using differentiable storage, meaning a version will store the change-set fr...

clearml

2 years ago

0 Votes

7 Answers

953 Views

0 Votes 7 Answers 953 Views

Is There A Way To Generate Usage Stats And Reports For Queues? For Example, How Often Is A Queue Used, How Much Cpu Does

Is there a way to generate usage stats and reports for queues? For example, how often is a queue used, how much CPU does the queue consume when it's used, et...

clearml

2 years ago

0 Votes

27 Answers

989 Views

0 Votes 27 Answers 989 Views

Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Is there an autoscaler equivalent for K8s? That is, a service that will launch pods based on incoming requests?

mlops

one year ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Can I Run An Autoscaler Listening To A Regular Queue (I.E. Combine Autoscaling With On Premise Machines)? Or Does It Run The Danger Of Creating An Ec2 Instance While An On-Premise Machine Takes The Job?

Can I run an autoscaler listening to a regular queue (i.e. combine autoscaling with on premise machines)? Or does it run the danger of creating an EC2 instan...

mlops

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

I Encountered A Weird Edge Case With The Aws Auto-Scaler, Wondering If There Are Any Solutions Or If This Is A Known Issue. Something As Follows Happened:

I encountered a weird edge case with the AWS Auto-scaler, wondering if there are any solutions or if this is a known issue. Something as follows happened: Th...

mlops

2 years ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

Is There Anywhere In The Web Ui Where One Can See The Clearml Server Version Running? I Keep Getting "Version 1.1.1 Is Now Available" Even Though I'M Pretty Sure I Took All The Steps To Update To The Latest Version

Is there anywhere in the web UI where one can see the clearml server version running? I keep getting "version 1.1.1 is now available" even though I'm pretty ...

clearml

3 years ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

How Do I Stop A Zombie Agent?

How do I stop a zombie agent? 🤔 I see it in clearml-agent list , but I do not see it in ps aux and when I try clearml-agent daemon --stop , I get Could not ...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

I'm trying to set up some initial experiments within our stack, but when I use the execute_remotely task, I get this error: clearml_agent: ERROR: Failed gett...

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Is There A Way To Force Clearml To First Parse The Config File, Before Running

Is there a way to force ClearML to first parse the config file, before running Task.init ? We’re relying in some code that the credentials in clearml.conf ar...

clearml

one year ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

The Clearml Helm Charts Page Is Gone?

The ClearML helm charts page is gone? https://allegroai.github.io/clearml-helm-charts/

clearml

one year ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Is There Any Way (Or Are There Any Plans) To Include Some View For Datasets In The Webui? One That Is Detached From The Generating Task?

Is there any way (or are there any plans) to include some view for datasets in the WebUI? One that is detached from the generating task?

clearml

2 years ago

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

We're using Ray and ClearML together, and suddenly we're seeing some hanging threads, and finally we got an error message: 2022-01-10 09:58:56,803 [ERROR] [C...

clearml

2 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

When I Upgrade The Server, Do I Need To Do Anything With The Agents/Queues (Redefine Them, Etc)?

When I upgrade the server, do I need to do anything with the agents/queues (redefine them, etc)?

clearml

2 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Are The Docs Broken?

Are the docs broken? https://clear.ml/docs/latest/docs/release_notes/ver_1_7 gives me this

clearml

2 years ago

0 Votes

7 Answers

981 Views

0 Votes 7 Answers 981 Views

Weird Encounter On Macos (Local Execution, Rerunning After It Failed - Worked Normally):

Weird encounter on macOS (local execution, rerunning after it failed - worked normally): Traceback (most recent call last): File "/Users/.../repositories/......

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

We'Re Running Into Errors Such As This:

We're running into errors such as this: Action failed <500/0: tasks.add_or_update_artifacts/v2.10 (Update failed (BSONObj size: 18564134 (0x11B4426) is inval...

clearml

2 years ago

0 Votes

6 Answers

996 Views

0 Votes 6 Answers 996 Views

Soooo… The Tooltip For Some Buttons In The Webui Hides The Actual Buttons

Soooo… the tooltip for some buttons in the WebUI hides the actual buttons 🤔 Any quick fixes?

clearml

one year ago

0 Votes

16 Answers

1K Views

0 Votes 16 Answers 1K Views

We’Re Running Clearml-Agent On K8S And I First Noticed Some Warnings From The Pod About Python 3.6..?

We’re running clearml-agent on k8s and I first noticed some warnings from the pod about Python 3.6..? > /usr/lib/python3/dist-packages/secretstorage/dhcrypto...

clearml

one year ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Trying To Run Aws Autoscaler With

Trying to run AWS autoscaler with poetry queue, and I get: Traceback (most recent call last): File "/root/.local/bin/poetry", line 5, in from poetry.console....

mlops

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

One more follow-up still; we're trying to run non-GPU scaler, and I've finally sorted out subnet and security groups issues, only to run into this: Executing...

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Does Anyone Know How We Can Restore Clearml On Helms Chart From Existing Snapshots (Aws)?

Does anyone know how we can restore ClearML on Helms chart from existing Snapshots (AWS)?

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Good Week Clearml Team

Good week ClearML team 🙂 I have found two more WebUI bugs (on the self-hosted server): If I look at a specific experiment (say, the Artifacts tab), and then...

clearml

2 years ago

0 Votes

17 Answers

1K Views

0 Votes 17 Answers 1K Views

If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

If I clone a task, I suppose all artifacts are not cloned with it, even if they are registered, right?

clearml

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

How/Where Can I Change The Session Timeout Time For The Webui? It'S Timing Out Way Too Fast After The Latest Updates

How/where can I change the session timeout time for the WebUI? It's timing out WAY too fast after the latest updates

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

How Does Clearml Store Some Of The More Intrinsic Data (Scalars, Plots, Etc)? We Recently Recovered Mongo From A Backup And While The Projects And Tasks Are There, The Console, Info, Etc Of All Of Them Is Missing

How does ClearML store some of the more intrinsic data (scalars, plots, etc)? We recently recovered Mongo from a backup and while the projects and tasks are ...

clearml

one year ago

0 Votes

7 Answers

999 Views

0 Votes 7 Answers 999 Views

In The Pipeline Examples, Components Have The Following Note:

In the pipeline examples, components have the following note: > # notice all package imports inside the function will be automatically logged as # required p...

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

We'Re Moving From On-Premise To Aws - Is There An Easy Way To Migrate Wall The Tasks, Projects, Etc To The New Instance?

We're moving from on-premise to AWS - is there an easy way to migrate wall the tasks, projects, etc to the new instance?

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

If, For Starters, I'D Only Like To Use Clearml For Logging Purposes (That Is, Experiments Run Locally, And I Report Metrics, Graphs, Etc To Clearml). Do I Still Need To Set Up The

If, for starters, I'd only like to use ClearML for logging purposes (that is, experiments run locally, and I report metrics, graphs, etc to ClearML). Do I st...

clearml

3 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

... And Yet Another

... and yet another 😄 When using the UI with regex to search for experiments, due to the greedy nature of the search, it consistently pops up the "ERROR Fet...

clearml

2 years ago

Show more results

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Right, but that's as defined in the services agent, which is not immediately transparent

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Sorry, I misspoke, yes of course, the agents config file, not the queues

2 years ago

0 When I Upgrade The Server, Do I Need To Do Anything With The Agents/Queues (Redefine Them, Etc)?

Would be good if that's mentioned explicitly in the docs 😄 Thanks!

2 years ago

0 In The Pipeline Examples, Components Have The Following Note:

CostlyOstrich36 so internal references are not resolved somehow? Or, how should one achieve:

def my_step(): from ..utils import foo foo("bar")

one year ago

0 Automatic Logging For Generated Matplotlib Works Nicely, But For Some Reason It Names The Graphs As "00 Untitled" And Uses Series "Plot Image" In Clearml Ui -- Any Way To Circumvent This, Other Than Explicit Reporting?

The screenshot is small since the data is private anyway, but it's enough to see:
"Metric: untitled 00" "plot image" as the image title The attached histogram has a title ("histogram of ...")

3 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

It failed on some missing files in my remote_execution, but otherwise seems fine now

3 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

Thanks for your help SuccessfulKoala55 ! Appreciate the patience 🙏

3 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

I guess following the example https://github.com/allegroai/clearml/blob/master/examples/advanced/execute_remotely_example.py , it's not clear to me how the server has access to the data loaders location when it hits execute_remotely

3 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

The idea is that the features would be copied/accessed by the server, so we can transition slowly and not use the available storage manager for data monitoring

3 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

A follow up question (instead of opening a new thread), is there a way I could signal some files/directories to be copied to the execute_remotely task?

3 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

From the log you shared, the task is picked up by the

worker_d1bd92a3b039400cbafc60a7a5b1e52b_4e831c4cbaf64e02925b918e9a3a1cf6_<hostname>:gpu0,1

worker

I can try and target the default one if it helps..?

3 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

I was thinking of using the --volume settings in clearml.conf to mount the relevant directories for each user (so it's somewhat customizable). Would that work?

It would be amazing if one can specify specific local dependencies for remote execution, and those would be uploaded to the file server and downloaded before the code starts executing

3 years ago

0 Weird Encounter On Macos (Local Execution, Rerunning After It Failed - Worked Normally):

This could be relevant SuccessfulKoala55 ; might entail some serious bug in ClearML multiprocessing too - https://stackoverflow.com/questions/45665991/multiprocessing-returns-too-many-open-files-but-using-with-as-fixes-it-wh

2 years ago

0 Weird Encounter On Macos (Local Execution, Rerunning After It Failed - Worked Normally):

This happened again 🤔
How many files does ClearML touch? :shocked_face_with_exploding_head:

2 years ago

0 Weird Encounter On Macos (Local Execution, Rerunning After It Failed - Worked Normally):

Let me know if there's any additional information that can help SuccessfulKoala55 !

2 years ago

0 Does Clearml Exposes (Even Temporarily) The Contents Of E.G.

Because setting env vars and ensuring they exist on the remote machine during execution etc is more complicated 😁

There are always ways around, I was just wondering what is the expected flow 🙂

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

Of course. We'd like to use S3 backends anyway, I couldn't spot exactly where to configure this in the chart (so it's defined in the individual agent's configuration)

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

Okay, I'll test it out by trying to downgrade to 4.0.0 and then upgrade to 4.1.2
Just to make sure, the chart_ref is allegroai/clearml right? (for some reason we had clearml/clearml and it seems like it previously worked?)

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

Full log:
` command: /usr/sbin/helm --version=4.1.2 upgrade -i --reset-values --wait -f=/tmp/tmp77d9ecye.yml clearml clearml/clearml
msg: |-
Failure when executing Helm command. Exited 1.
stdout:
stderr: W0728 09:23:47.076465 2345 warnings.go:70] policy/v1beta1 PodDisruptionBudget is deprecated in v1.21+, unavailable in v1.25+; use policy/v1 PodDisruptionBudget
W0728 09:23:47.126364 2345 warnings.go:70] policy/v1beta1 PodDisruptionBudget is deprecated in v1.21+, unava...

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

Removing the PVC is just setting the state to absent AFAIK

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

For now this is okay - no data lost, really - but I'd like to make sure we're not missing any steps in the next upgrade

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

3.3.0 😅

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

Hm, I'm not sure I follow 🤔 How does the API server config relate to the file server?

2 years ago

0 Our Mac Users Are Having Some Issues. They Have Their Respective ~/Clearml.Conf, And Yet They Get: Clearml 1.1.5

We have a mini default config (if you remember from a previous discussion we had) that actually uses the second form you suggested.
I wrote a small "fixup" script that combines this default with the one generated by clearml-init , and it simply does:
def_config = ConfigFactory.parse_file(DEF_CLEARML_CONF, resolve=False) new_config = ConfigFactory.parse_file(new_config_file, resolve=False) updated_new_config = ConfigTree.merge_configs(new_config, def_config)

2 years ago

0 The Comparison Page Seems To Resize The Experiments So That All Tags Will Fit In The Screen, But Then The Experiments Are Pretty Much Impossible To Compare

I can scroll sideways but if I open any of the comparison items, I pretty much can only see one experiment's values

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

AgitatedDove14 The keys are there, and there is no specifically defined user in .gitmodules :
[submodule "xxx"] path = xxx url =I believe this has to do with how ClearML sets up the git credentials perhaps?

2 years ago

0 Any Plans To Add Support For Multiindex Tables? That Comma-Separation Feels A Bit Odd. (Also This Shows Up In

Indeed. I'll open an issue, sure!

3 years ago

0 I'D Like The Console In A Clearml Run To Show Only The Stdout/Stderr As It Does Now, But I'D Also Like Clearml To Capture Debug Level Logs. Is There An Easy Around This? It Would Be Nice If One Could E.G. Set

Yes, exactly. I have not yet had a chance to try this out -- should it work?

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

We have a read-only user with personal access token for these things, works seamlessly throughout and in our current on premise servers... So perhaps something missing in the autoscaler definitions?

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Sounds like a nice idea 😁
Follow-up; any ideas how to avoid PEP 517 with the auto scaler? 🤔 Takes a long time to build the wheels

2 years ago

Show more results