FancyOtter74

4 Questions, 18 Answers

Active since 15 September 2023

Last activity 9 months ago

Reputation

Badges 1

18 × Eureka!

Questions 4
Answers 18

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

My Data Processing Scripts Are Run In The Cloud With The Help Of Clearml Autoscaler. The Cloud Doesn'T (And Won'T) Have Access To Git, Which Is In Our Internal Network. So I'M Left With Using

My data processing scripts are run in the cloud with the help of ClearML autoscaler. The cloud doesn't (and won't) have access to Git, which is in our intern...

clearml

one year ago

0 Votes

4 Answers

875 Views

0 Votes 4 Answers 875 Views

Dataset Uploading Failed, But Task Finished Successfully. As A Result - Dataset Is In Inconsistent State, Where It Thinks That There'S A File Inside, But There Isn'T:

Dataset uploading failed, but task finished successfully. As a result - dataset is in inconsistent state, where it thinks that there's a file inside, but the...

clearml

11 months ago

0 Votes

5 Answers

736 Views

0 Votes 5 Answers 736 Views

I'M Setting

I'm setting task.publish_on_completion(True) right after initializing the task, and this works as expected if I run the task locally. But when executed on a ...

clearml

11 months ago

0 Votes

13 Answers

2K Views

0 Votes 13 Answers 2K Views

I Am Using Clearml Free Saas. I Have A Task "Mytask" Of Type "Data_Processing" In Project "Myproject" Which Uploads A Dataset In The End Of Its Execution. For Some Reason, After Uploading The Dataset, My Task Appears In Ui Not Under "Myproject", But Under

I am using clearml free saas. I have a task "MyTask" of type "data_processing" in project "MyProject" which uploads a dataset in the end of its execution. Fo...

clearml

2 years ago

0 I Am Using Clearml Free Saas. I Have A Task "Mytask" Of Type "Data_Processing" In Project "Myproject" Which Uploads A Dataset In The End Of Its Execution. For Some Reason, After Uploading The Dataset, My Task Appears In Ui Not Under "Myproject", But Under

result: None

2 years ago

0 Dataset Uploading Failed, But Task Finished Successfully. As A Result - Dataset Is In Inconsistent State, Where It Thinks That There'S A File Inside, But There Isn'T:

but what's the best way to catch the exception? All high-level clearml function calls return normally

11 months ago

0 My Data Processing Scripts Are Run In The Cloud With The Help Of Clearml Autoscaler. The Cloud Doesn'T (And Won'T) Have Access To Git, Which Is In Our Internal Network. So I'M Left With Using

    common_module = task.connect_configuration("../common.py", "common.py")
    if not task.running_locally():
        import shutil
        shutil.copy(common_module, "common.py")

    from common import test_common
    test_common()

one year ago

0 Dataset Uploading Failed, But Task Finished Successfully. As A Result - Dataset Is In Inconsistent State, Where It Thinks That There'S A File Inside, But There Isn'T:

Is there any way to make sure that task would fail?

11 months ago

0 I'M Setting

Task completes normally. I'm using clearml's aws autoscaler.
Task is started the following way: airflow job run finds an older task, clones it, changes some params and enqueues it.

11 months ago

Any updates? Should I provide any extra context?

2 years ago

0 My Data Processing Scripts Are Run In The Cloud With The Help Of Clearml Autoscaler. The Cloud Doesn'T (And Won'T) Have Access To Git, Which Is In Our Internal Network. So I'M Left With Using

That sounds like an interesting hack 😃 I'll try it out, thanks!

one year ago

0 My Data Processing Scripts Are Run In The Cloud With The Help Of Clearml Autoscaler. The Cloud Doesn'T (And Won'T) Have Access To Git, Which Is In Our Internal Network. So I'M Left With Using

it seems that connecting it as config is more convenient than uploading an artifact, because artifacts are deleted when cloning a task. Code is very simple:

one year ago

0 Dataset Uploading Failed, But Task Finished Successfully. As A Result - Dataset Is In Inconsistent State, Where It Thinks That There'S A File Inside, But There Isn'T:

my code:

    dataset = Dataset.create(
        dataset_project=PROJECT_NAME,
        dataset_name=f"processed_{mode}",
        dataset_tags=task.get_tags(),
        parent_datasets=None,
        use_current_task=False,
        output_uri=BUCKET,
    )

dataset.add_files(path, verbose=True)
dataset.upload(verbose=True)    dataset.finalize(verbose=True)

11 months ago

clearml==1.12.2

2 years ago

0 My Data Processing Scripts Are Run In The Cloud With The Help Of Clearml Autoscaler. The Cloud Doesn'T (And Won'T) Have Access To Git, Which Is In Our Internal Network. So I'M Left With Using

I also cannot create a package out of common code, because the package registry is inside the internal network as well

one year ago

0 I'M Setting

@<1744891825086271488:profile|RoundElephant20> , any updates?

9 months ago

So you'd recommend setting use_current_task=False when creating the dataset in this task or should this be done somehow differently?

2 years ago

Tried it on 1.13.1. Same problem. @<1523701087100473344:profile|SuccessfulKoala55> any advice?

2 years ago

it was a year ago, so yeah.

2 years ago

I did similarly at my previous work (we had open source clearml deployed). The problem I described here was not present there. I liked this approach. It was convenient that dataset_id and task_id are the same.

2 years ago

from clearml import Task, Dataset

task = Task.init(
    project_name="MyProject",
    task_name="MyTask",
    task_type=Task.TaskTypes.data_processing,
    reuse_last_task_id=False,
    output_uri="

"
)

with open("new_file.txt", "w") as file:
    file.write("Hello, world!")

dataset = Dataset.create(parent_datasets=None, use_current_task=True)
dataset.add_files(".", wildcard="new_file.txt", verbose=True)
dataset.upload(verbose=True)
dataset.finalize(verbose=True)

2 years ago