UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

662 × Eureka!

Answers 749

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

That was a good idea, unfortunately did not help too much, but I think I may have a found a work around, thanks!

2 years ago

0 How Do I Stop A Zombie Agent?

Aw you deleted your response fast CostlyOstrich36 xD

Indeed it does not appear in ps aux so I cannot simply kill it (or at least, find it).
I was wondering if it's maybe just a zombie in the server API or similar

2 years ago

0 How Do I Stop A Zombie Agent?

It's removed now, magic of asking for help and doing nothing 😄

2 years ago

0 How Do I Stop A Zombie Agent?

Literally just as you said it - it happened

2 years ago

0 How Do I Stop A Zombie Agent?

Thanks!

2 years ago

0 Bug Report? We Noticed That The Aws Autoscaler Will Lose Track Of Instances Crashing Due To No Space Left On Device, And The Ec2 Instance Will Remain Running Indefinitely.

Created None

one year ago

0 Bug Report? We Noticed That The Aws Autoscaler Will Lose Track Of Instances Crashing Due To No Space Left On Device, And The Ec2 Instance Will Remain Running Indefinitely.

We're using the example autoscaler, nothing modified

one year ago

0 Bug Report? We Noticed That The Aws Autoscaler Will Lose Track Of Instances Crashing Due To No Space Left On Device, And The Ec2 Instance Will Remain Running Indefinitely.

We're using self hosted account

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

Nothing I can spot --

ClearML results page:


ClearML pipeline page:


Launching the next 2 steps
Launching step [...]
Launching step [...]
Launching step: ...
Parameters:
{...}
Configurations:
{}
Overrides:
{}
Launching step: ...
Parameters:
{...}
Configurations:
{}
Overrides:
{}
ClearML Monitor: GPU monitoring failed getting GPU reading, switching off GPU monitoring
2023-02-21 13:53:48
ClearML Monitor: Could not detect iteration reporting, falling back to itera...

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

@<1523701070390366208:profile|CostlyOstrich36> I added None btw

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

I believe that a Pipeline should have the system tags ( pipeline , maybe hidden ), even if it created in a running Task .

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

Happens with the latest version indeed.
I can’t share our code, but the gist of it is:

pipe = PipelineController(name=..., project=..., version=...)

pipe.add_function_step(...)  # Many calls

pipe.set_default_execution_queue(...)
pipe.start(queue=..., wait=True)

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

So the pipeline runs successfully, I can find all the different tasks, but I cannot see them in the Pipelines tab…

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

FWIW running clearml ==1.9.1 with WebApp: 1.9.2-317 • Server: 1.9.2-317 • API: 2.23

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

When I use the APIClient to fetch the tags for the project, I get an empty collection of system tags:

<projects.GetProjectTagsResponse: {
    "tags": [],
    "system_tags": []
}>

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

Ah I see, if the pipeline controller begins in a Task it does not add the tags to it…

one year ago

0 I'D Like The Console In A Clearml Run To Show Only The Stdout/Stderr As It Does Now, But I'D Also Like Clearml To Capture Debug Level Logs. Is There An Easy Around This? It Would Be Nice If One Could E.G. Set

Yes, exactly. I have not yet had a chance to try this out -- should it work?

2 years ago

I... did not, ashamed to admit. The documentation says only boolean values.

2 years ago

0 What Could Cause A Queue To Be Recreated Automatically? I Experimented With The Autoscaler With Queue Name

No, I have no running agents listening to that queue. It's as if it's retained in some memory somewhere and the server keeps creating it.

2 years ago

0 What Could Cause A Queue To Be Recreated Automatically? I Experimented With The Autoscaler With Queue Name

Could also be related to K8, so pinging JuicyFox94 just in case 😉

2 years ago

0 Is There A Way To Save The Models Completely On The Clearml Server? It Seems That Clearml Server Does Not Store The Models Or Artifacts Itself, But They Are Stored Somewhere Else (E.G., Aws S3-Bucket) Or On My Local Machine And Clearml Server Is Only Sto

I can only say I’ve found ClearML to be very helpful, even given the documentation issue.
I think they’ve been working on upgrading it for a while, hopefully something new comes out soon.
Maybe @<1523701205467926528:profile|AgitatedDove14> has further info 🙂

one year ago

0 Hello Gals And Guys! Happy New Year!

Looks great, looking forward to the all the new treats 😉
Happy new year! 🎉

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

I guess it does not do so for all settings, but only those that come from Session()

2 years ago

@<1523704157695905792:profile|VivaciousBadger56> It seems like whatever you pickled in the zip file relies on some additional files that are not pickled.

one year ago

0 Hello! What Considerations Are There To Upgrading Clearml Kubernetes Installation From 1.6.0 To 1.7.0? Will It Suffice To Just Update The Image_Tag Within The Helm Charts And Keep Rest Of The Config As It Was? Br, -Ville P.

Samoja ongelmia täälläkin 😅

2 years ago

0 How Can I Ensure Tasks In A Pipeline Have The Same Environment As The Pipeline Itself? It Seems A Bit Counter-Intuitive That The Pipeline (Executed Remotely) Captures The Local Environment, But The Tasks (Executed Remotely) Do Not Use That Same Environmen

… And it’s failing on typing hints for functions passed in pipe.add_function_step(…, helper_function=[…]) … I guess those aren’t being removed like the wrapped function step?

one year ago

Heh, well, John wrote that in the first reply in this thread 🙂
And in Task.init main documentation page (nowhere near the code), it says the following -

one year ago

I have no idea what’s the difference, but it does not log the internal repository 😞 If I knew why, I would be able to solve it myself… hehe

one year ago

I wouldn't put past ClearML automation (a lot of stuff depend on certain suffixes), but I don't think that's the case here hmm

one year ago

FWIW It’s also listed in other places @<1523704157695905792:profile|VivaciousBadger56> , e.g. None says:

In order to make sure we also automatically upload the model snapshot (instead of saving its local path), we need to pass a storage location for the model files to be uploaded to.
For example, upload all snapshots to an S3 bucket…

one year ago

Show more results