SolidGoose91

12 Questions, 46 Answers

Active since 15 March 2023

Last activity one year ago

Reputation

Badges 1

46 × Eureka!

Questions 12
Answers 46

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

(I'D Open An Issue On Github If I Had Found Which Is The Right Repo To Address It)

(I'd open an issue on GitHub if I had found which is the right repo to address it)

clearml

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi Folks! How Do We Get

Hi folks! How do we get clearml-session to run in a private docker image from a private container registry, in particular from an AWS ECR , please? Very usef...

clearml

one year ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Could Some Help Me Figure Out Why Figures Have Disappeared From A Report, Please?

Could some help me figure out why figures have disappeared from a report, please? My colleague made this report 6 weeks ago ago None , but all the graphs hav...

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

On a related line but more complicated: how can we ask the Autoscaler to queue, say, N jobs on an N-GPU machine, please? For example, on AWS, NVIDIA A100 GPU...

clearml

one year ago

0 Votes

16 Answers

2K Views

0 Votes 16 Answers 2K Views

Hi Team! Is There A Way To Make Clearml’S Aws Autoscaler And Queues Resource-Aware Please? I.E. If We Can Say, As We Enqueue Our Job, How Much Ram Or Gpu-Ram Or Even Gpus It Needs, Have The Scheduler/Autoscaler Dispatch The Job To Instances That Are Of Th

Hi team! Is there a way to make ClearML’s AWS Autoscaler and queues resource-aware please? I.e. if we can say, as we enqueue our job, how much RAM or GPU-RAM...

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Hi All, In The Aws Scheduler, What’S The Difference Between:

Hi all, in the AWS scheduler, what’s the difference between: - Regular Instance Rollback Timeout - Spot Instance Blackout Periodplease?

clearml

one year ago

0 Votes

20 Answers

2K Views

0 Votes 20 Answers 2K Views

Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

Is there any way to change the x-axis on the charts for scalars, to say e.g. "epochs" instead of "iterations"? Or is that hardcoded?

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Hi! How Can We Edit The Parameters Of The Clearml Pro Aws Autoscaler E.G. To Add An Init Script Or To Expand Its Capacity, Please? At The Moment The Only Way We Found Is To Wait Until All The Jobs On It Are Finished, Clone It, Kill It, Start A New One

Hi! How can we edit the parameters of the ClearML PRO AWS autoscaler e.g. to add an init script or to expand its capacity, please? At the moment the only way...

mlops

one year ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

Hi Everyone! Loving Clearml So Far. Has Anyone Managed To Make Its Autoscaler And Scheduler Work With Lambda Gpu Clouds By Any Chance, Please?

Hi everyone! Loving ClearML so far. Has anyone managed to make its autoscaler and scheduler work with Lambda GPU Clouds by any chance, please? https://lambda...

mlops

one year ago

0 Votes

26 Answers

1K Views

0 Votes 26 Answers 1K Views

Is It Possible To Merge

Is it possible to merge None please? It’s blocking us from using ClearML sessions. Thank you 🙂

clearml

one year ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Hi Good Folks Here! Does Clearml Allow Auto-Rerun Of Failed Jobs, For Example When A Spot Instance Gets Interrupted, Please? (Or Auto-Resume, If Checkpointing Logic In Place)

Hi good folks here! Does ClearML allow auto-rerun of Failed jobs, for example when a SPOT instance gets interrupted, please? (or auto-resume, if checkpointin...

clearml

one year ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

Would Be Super Helpful To Have It, Even As A Pro Feature!

Would be super helpful to have it, even as a Pro feature!

clearml

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

(do you welcome PRs?)

one year ago

0 Could Some Help Me Figure Out Why Figures Have Disappeared From A Report, Please?

Dang, so unlike screenshots, reports do not survive task deletion :/

one year ago

0 Hi Team! Is There A Way To Make Clearml’S Aws Autoscaler And Queues Resource-Aware Please? I.E. If We Can Say, As We Enqueue Our Job, How Much Ram Or Gpu-Ram Or Even Gpus It Needs, Have The Scheduler/Autoscaler Dispatch The Job To Instances That Are Of Th

OK, so no way to have an automatic dispatch to different, correctly-sized instances, it’s only achievable by submitting to different queues?

one year ago

Can the “multiple agents on a single queue” scenario, combined with the autoscaler, spawn multiple agents on a single EC2 instance, by chance, please? (thinking e.g. 8 agents on a 8xGPU machine)

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

Thanks @<1523701070390366208:profile|CostlyOstrich36> ! I'll do - and might even peek under the hood see if I can make a PR. What's the best repo for that? Is it that of the ClearML python package?

one year ago

@<1523701205467926528:profile|AgitatedDove14> great! (I'm on the Pro version :) ).

one year ago

0 Is It Possible To Merge

@<1523701087100473344:profile|SuccessfulKoala55> I think you’ve been tagged in the PR 🙂

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

Yes, exactly. Here is the logical sense it makes: I have plots where iterations represent different units: for some these plots iterations (call them A) are optimization steps, while for others (call them B) they are evaluation iterations, occuring every N optimization steps. I would like to either:

Change the X label so these different plots do not have the same label when they represent different things.
Or, even better, keep the unique "iterations" label but be able to change how I lo...

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

What is the best way to achieve that please?

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

(actually, that might even be feasible without touching the UI, depending how the plot is rendered, but I'll check)

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

Happy to jump on a call if easier to make sense of it :)

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

From the doc I seemed to find ways to log 2D scatter plots, but not line plots :/ (found)
It also seems simpler to keep the scalar logging structure, but be able to pass a multiplier (reflecting the eval_n_steps in for example Torch Lightning)

one year ago

0 Hi Folks! How Do We Get

Amazing, thanks a lot @<1523701087100473344:profile|SuccessfulKoala55> !

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

The problem with logging as a 2D plot is we lose the streaming: if I understand correctly the documentation, Logger.current_logger().report_scatter2d logs a single, frozen 2D plot when you know the full X and Y data. And you would do that at each evaluation step.
Logging scalars allows to log a growing time series, i.e. add to the existing series/plot at every "iteration", thus being able to monitor the progress over time in one single plot. It's a much more logical setting.

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

Logging scalars also leverages ClearML automatic logging. One problem is that this automatic logging seems to keep its own internal "iteration" counter for each scalar, as opposed to keeping track of, say, the optimizer's number of steps.
That can be simply fixed on clearML python lib by allowing to set a per-scalar iteration-multiplier.

one year ago

0 Could Some Help Me Figure Out Why Figures Have Disappeared From A Report, Please?

Tagging my colleague @<1529271085315395584:profile|AmusedCat74> who made that report.

one year ago

@<1523701070390366208:profile|CostlyOstrich36> Any idea please? We could use our 8xA100 as 8 workers, for 8 single-gpu jobs running faster than on a single 1xV100 each.

one year ago

Thanks @<1523701070390366208:profile|CostlyOstrich36> !

I hadn’t found the multiple-resources within the same autoscaler. Could you point me to the right place please? Are they all used interexchangeably based upon availability, rather than based on job needs?
We thought of using separate queues (we do that for CPU vs GPU queues), but having ClearML automatically dispatch to the right based on a job specification would be more flexible. (for example, we could then think to dispath dynami...

one year ago

0 Is There Any Way To Change The X-Axis On The Charts For Scalars, To Say E.G. "Epochs" Instead Of "Iterations"? Or Is That Hardcoded?

Great, thanks both! I suspect this might need an extra option to be passed via the SDK, to save the iteration scaling at logging time, which the UI can then use at rendering time.

one year ago

0 Hi All, In The Aws Scheduler, What’S The Difference Between:

Hi 🙂 Anyone having any idea on that one please? Or could point me in the right place or the right person to find out? Thanks for any help!

one year ago

0 Hi All, In The Aws Scheduler, What’S The Difference Between:

Brilliant, thanks a lot for the answer Jake, much appreciated and clearer!
@<1529271085315395584:profile|AmusedCat74> @<1548115177340145664:profile|HungryHorse70> here we have the answer :)

one year ago

0 Hi All, In The Aws Scheduler, What’S The Difference Between:

Is the doc on GitHub so we can copy that into a PR?

one year ago

0 Hi! How Can We Edit The Parameters Of The Clearml Pro Aws Autoscaler E.G. To Add An Init Script Or To Expand Its Capacity, Please? At The Moment The Only Way We Found Is To Wait Until All The Jobs On It Are Finished, Clone It, Kill It, Start A New One

@<1523701087100473344:profile|SuccessfulKoala55> yes I am 🙂 And thanks, looking forward to it!

one year ago

0 Hi Good Folks Here! Does Clearml Allow Auto-Rerun Of Failed Jobs, For Example When A Spot Instance Gets Interrupted, Please? (Or Auto-Resume, If Checkpointing Logic In Place)

We’re on the PRO 🙂

one year ago

0 Is It Possible To Merge

Should we have run it from the git repo?

one year ago

0 Hi Good Folks Here! Does Clearml Allow Auto-Rerun Of Failed Jobs, For Example When A Spot Instance Gets Interrupted, Please? (Or Auto-Resume, If Checkpointing Logic In Place)

Tagging my colleague @<1529271085315395584:profile|AmusedCat74> who needs this with me 🙂

one year ago

0 Hi Good Folks Here! Does Clearml Allow Auto-Rerun Of Failed Jobs, For Example When A Spot Instance Gets Interrupted, Please? (Or Auto-Resume, If Checkpointing Logic In Place)

Do Pipelines work with Hyperparameter search, and with single training jobs?

one year ago

0 Hi Good Folks Here! Does Clearml Allow Auto-Rerun Of Failed Jobs, For Example When A Spot Instance Gets Interrupted, Please? (Or Auto-Resume, If Checkpointing Logic In Place)

And yes, I was also referring to tasks ran by the Autoscaler (potentially via the HPO) app, too.

one year ago

0 Is It Possible To Merge

It was a debugging session. We haven’t yet tried a “Standard” non-debugging clearml session.

one year ago

0 Is It Possible To Merge

Tagging @<1529271085315395584:profile|AmusedCat74> my colleague with whom we ran into this issue.

one year ago

Show more results