ThickCrow29

7 Questions, 27 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

27 × Eureka!

Questions 7
Answers 27

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hey Everybody - I Am Using The Pipelinecontroller With Add_Function_Step To Add Different Step To The Pipeline. Is There A Way To Specify A Callback Upon An Abort Action From The User ? I Tried Using The Post_Execute_Callback Or The Status_Change_Callback

Hey everybody - I am using the PipelineController with add_function_step to add different step to the pipeline. Is there a way to specify a callback upon an ...

clearml

2 years ago

0 Votes

22 Answers

2K Views

0 Votes 22 Answers 2K Views

Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Hi - quick question. I am using the PipelineController with abort_on_failure set to False. I have a pipe with a first task that branch out in 3 branches. - I...

clearml

one year ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi, I Have A Question Regarding The Autoscaler. I Implemented A Custom Driver For Gcp And I Manager To Launch The Clearml.Automation.Auto_Scaler.Autoscaler Which Runs Smoothly (Kudos!!). I Can See Instance Being Created/Destroyed On Demand As Expected. Th

Hi, I have a question regarding the autoscaler. I implemented a custom driver for gcp and I manager to launch the clearml.automation.auto_scaler.AutoScaler w...

gcp

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, I Am Surprised By The Behavior Of

Hi, I am surprised by the behavior of clearml-agent daemon --stop which stop the worker fine - but also flag ongoing Task as Completed instead of Aborted or ...

mlops

one year ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi, I Got A Pop Up This Morning On The Clearml Dashboard Asking To Update - Once I Clicked Yes, All The Debug Samples From All My Experiments Have Disappeared - Even If I Start New Experiments, It Does Not Show Them Anymore. Any Way I Can Roll Back ? We

Hi, I got a pop up this morning on the ClearML Dashboard asking to update - Once I clicked yes, all the debug samples from all my experiments have disappeare...

clearml

4 years ago

0 Votes

12 Answers

2K Views

0 Votes 12 Answers 2K Views

Hi I Have An Issue With The Clearml-Agent In Docker Mode. I Am Trying To Mount Additional Stuff In The Container Via Sur Default Via The Agent.Default_Docker.Arguments - Namely The Docker Socket. I See

Hi I have an issue with the clearml-agent in docker mode. I am trying to mount additional stuff in the container via sur default via the agent.default_docker...

clearml

one year ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Hello ! I Am Using The Pipelinecontroller To Orchestrate A Set Of Job. The Clearm-Agent (Workers) Are Hosted On Spot Instance (Gcp) And The Pipellinecontroller Does Not Mark The Task As Failure Whenever A Worker Gets Preempted. Is There A Way To Mark The

Hello ! I am using the PipelineController to orchestrate a set of job. The clearm-agent (workers) are hosted on spot instance (gcp) and the PipellineControll...

clearml

2 years ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

So I can confirm I have the same behavior with this minomal example

#!/usr/bin/env python3
import fire
from typing import Optional
import time
from clearml import PipelineController
def step_one(a=1):
    print("Step 1")
    time.sleep(120)
    return True
def step_two(a=1):
    print("Step 2")
    time.sleep(120)
    return True
def launch():
    pipe = PipelineController(
        project="TEST",
        name="Pipeline demo",
        version="1.1",
        add_pipeline_tags=False,
   ...

one year ago

0 Hi, I Am Surprised By The Behavior Of

I also created an issue in the repo directly. Thx for your help.

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

And same behavior if I make the dependance explicty via the retunr of the first one

#!/usr/bin/env python3
import fire
from typing import Optional
import time
from clearml import PipelineController
def step_one(a=1):
    import time
    print("Step 1")
    time.sleep(120)
    return True
def step_two(a=1):
    import time
    print("Step 2")
    time.sleep(120)
    return True
def launch(
    tenant: str = "demo",
    loc_id: str = "common",
    tag: str = "test",
    pipeline_id: Optio...

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Yes, I agree, it should be considered as failed and the PipelineController should not trigger the following task which depends on the first one. My problem is that it’s not the behavior I observe, the second task still get scheduled for execution. Is there a way to specify that to the PipelineController logic ?

one year ago

0 Hello ! I Am Using The Pipelinecontroller To Orchestrate A Set Of Job. The Clearm-Agent (Workers) Are Hosted On Spot Instance (Gcp) And The Pipellinecontroller Does Not Mark The Task As Failure Whenever A Worker Gets Preempted. Is There A Way To Mark The

No just just the clearml-agent

2 years ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Step 1 was aborted, but the second still was scheduled

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Thx working now on 1.14.2 🙌

one year ago

0 Hey Everybody - I Am Using The Pipelinecontroller With Add_Function_Step To Add Different Step To The Pipeline. Is There A Way To Specify A Callback Upon An Abort Action From The User ? I Tried Using The Post_Execute_Callback Or The Status_Change_Callback

I am running clearml-agent 1.6.1

2 years ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

yes

one year ago

Neat - looks like exactly what I looking for thxx

2 years ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Thx

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Hi, any chance you got some time to look if you could replicate on your side ?

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Yep - sounds perfect 😉

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Ok - good to know this is odd 🙂
It’s created like this (I remove some bits for readability)

def _run(pipeline_id, step):
    from pipeline_broker import pipeline

    pipeline.run_step(pipeline_id=pipeline_id, step=step)


def launch(
    cfg,
    queue: str = "default",
    abort_on_failure: bool = False,
    project: str = "TrainingPipeline",
    start_locally: bool = False,
    task_regex: str = ".*",
):
    ...

    pipe = PipelineController(
        project=project,
        name...

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

It’s a pipeline from Tasks.
clearml==1.13.2
For instance, in this pipeline, if the first task failed - then the remaining task are not schedule for execution which is what I expect. I am just surprised that if the first task is aborted instead by the user, the following task is still schedule for execution (and will fail cause it’s dependant on the first one to complete).

one year ago

0 Hi, I Have A Question Regarding The Autoscaler. I Implemented A Custom Driver For Gcp And I Manager To Launch The Clearml.Automation.Auto_Scaler.Autoscaler Which Runs Smoothly (Kudos!!). I Can See Instance Being Created/Destroyed On Demand As Expected. Th

Ok - I customized it a bit to our workflow — so I wanted to keep our “fork” of the autoscaler but I guess this is not supported.

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Hey, finally got to try it, sorry about the delay.
However, I tried on 1.14.1 but i still get the same behavior

one year ago

0 Hi I Have An Issue With The Clearml-Agent In Docker Mode. I Am Trying To Mount Additional Stuff In The Container Via Sur Default Via The Agent.Default_Docker.Arguments - Namely The Docker Socket. I See

Neat - it works ! Thanks for the quick response 🚀

one year ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

thx

one year ago

I’ll try this

one year ago

0 Hi, I Got A Pop Up This Morning On The Clearml Dashboard Asking To Update - Once I Clicked Yes, All The Debug Samples From All My Experiments Have Disappeared - Even If I Start New Experiments, It Does Not Show Them Anymore. Any Way I Can Roll Back ? We

Thx - working now 😉

4 years ago

All right - make sense !

one year ago

root@clement-controller-1:~# head clearml.conf
agent {
default_docker {
arguments: ["-v","/var/run/docker.sock:/var/run/docker.sock"]
}}

one year ago

python3 -m clearml_agent --config-file clearml.conf daemon --foreground --queue services --service --docker --cpu-only

one year ago

yes it does

one year ago

And If I create myself a Pro account - can I somehow piggyback on the existing UIs to display the state of the Autoscaler Task?

one year ago