ThickKitten19

3 Questions, 9 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

9 × Eureka!

Questions 3
Answers 9

0 Votes

12 Answers

1K Views

0 Votes 12 Answers 1K Views

Hello, Everyone! I Have A Question Regarding Clearml Features. We Run Into The Situation When Some Of The Agents That Are Working On A Hpo Die Due To Variable Reasons. Some Workers Go Offline Or Resources Need Temporarily Be Detached For Other Needs. Thu

Hello, everyone! I have a question regarding ClearML features. We run into the situation when some of the agents that are working on a HPO die due to variabl...

clearml

2 years ago

0 Votes

1 Answers

622 Views

0 Votes 1 Answers 622 Views

Hello, Community, I Hope This Message Finds You All Well. I Am Currently Working On A Project Involving Hyperparameter Optimization (Hpo) Using The Optuna Optimizer. Specifically, I'Ve Been Trying To Navigate The Parameters 'Min_Iteration_Per_Job' And 'M

Hello, community, I hope this message finds you all well. I am currently working on a project involving Hyperparameter Optimization (HPO) using the Optuna op...

clearml

8 months ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi! I Am Trying To Update Existing Models That Were Already Created By Some Model Registration Task. I Can Not Find In The Documentation If It Is Possible To Update The Model'S Tag Programmatically? There Is A Way To Update The Model'S Tag From The Web Pa

Hi! I am trying to update existing models that were already created by some model registration task. I can not find in the documentation if it is possible to...

clearml

one year ago

0 Hello, Everyone! I Have A Question Regarding Clearml Features. We Run Into The Situation When Some Of The Agents That Are Working On A Hpo Die Due To Variable Reasons. Some Workers Go Offline Or Resources Need Temporarily Be Detached For Other Needs. Thu

I see! Then the command clearml-agent execute --id <task_id here> should reload the reported scalars and the task needs to reload last checkpoints only, right?

That's good question too! We didn't figure out the best way of continuing for both the grid and optuna. Can you suggest something?

2 years ago

AgitatedDove14 Let me clarify I think you have misunderstood me.

The main reason we need the above mentioned functionality is because there are some experiments that need to run for a long time. Let's say weeks.
However, the importance of the experiment is low so when other, more important experiments appear. We need to temporarily pause(kill or something else) running HPO task and reassign the resource for other needs.
Later, when more important experiments has been completed, we can conti...

2 years ago

Thanks for the answers AgitatedDove14 .
I will look GH issues in and open one if there isn't related one.

2 years ago

Hi AgitatedDove14 I get the reported scalars from the web using
model_task = Task.get_task(task_id=model_task_id) scalars = model_task.get_reported_scalars()then register each of the scalars with something like
logger.report_scalar(title=metric_key, series=series_val['name'], value=y, iteration=x)Then you have reported scalars to which I am able to append rest of the model training reports.
Workers are running across multiple machines and you can monitor if a task is dead by looking...

2 years ago

AgitatedDove14 I am not restarting the agent itself, I just need to be able continue the experiment from the same progress point. It can be a different agent. In fact, I am just loading the progress to another agent within the available queue.

2 years ago

Quick question when you say the HPO Task, you mean the HPO controller logic Task (i.e. the one launching the training jobs), or do you mean the actual training job itself (i.e. running with a specific set of parameters decided by the HPO controlling task) ?

AgitatedDove14 Sorry, my bad! By HPO task I mean the actual training job itself.
We run the HPO controller logic Task on a separate cpu only machine, so we can think that this task is always on. Only the training jobs can go ...

2 years ago

0 Hi! I Am Trying To Update Existing Models That Were Already Created By Some Model Registration Task. I Can Not Find In The Documentation If It Is Possible To Update The Model'S Tag Programmatically? There Is A Way To Update The Model'S Tag From The Web Pa

Is there a way we can update the docs webpage?

one year ago

Oh, I see! Apparently, there are no tags setter in the documentation, even though it is in the source code itself. Thanks!

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> The Model object does not seem to have the update method, is there a different version of SDK you are looking at?

one year ago