ScrawnyCrocodile51

13 Questions, 33 Answers

Active since 05 February 2025

Last activity 6 months ago

Reputation

Badges 1

33 × Eureka!

Questions 13
Answers 33

0 Votes

1 Answers

727 Views

0 Votes 1 Answers 727 Views

Hello, I Recently Start To Running Into Sslerror When Using Task.Init():

Hello, I recently start to running into SSLError when using Task.init(): 2025-03-03 12:30:55,981:WARNING:urlopen:Retrying (Retry(total=239, connect=240, read...

clearml

8 months ago

0 Votes

4 Answers

597 Views

0 Votes 4 Answers 597 Views

Hi Clearml Community, Is There A Way To Install Additional Packages On Top Of Base Docker Env When Using

Hi clearml community, is there a way to install additional packages on top of base docker env when using Task.force_store_standalone_script() and task.execut...

clearml

6 months ago

0 Votes

3 Answers

651 Views

0 Votes 3 Answers 651 Views

Hi, I want to setup task that can "overwrite configuration on the UI". Found some reference in the doc but couldn't make it work yet. I feel it should be ver...

clearml

8 months ago

0 Votes

3 Answers

739 Views

0 Votes 3 Answers 739 Views

Hi, I Am Exploring The Following Workflow To Submit A Task To Remote Server:

Hi, i am exploring the following workflow to submit a task to remote server: task = Task.create( branch=branch, repo=repo, script=script, requirements_file=r...

clearml

9 months ago

0 Votes

2 Answers

684 Views

0 Votes 2 Answers 684 Views

Hi, I Am Wondering After A Task Submitted To Remote Server Finishing Running. Will The Docker Container / Disk Space (Really I Am More Interested About The Dataset That Download By The Task) Get Automatically Clean Up?

Hi, I am wondering after a task submitted to remote server finishing running. Will the docker container / disk space (really I am more interested about the d...

clearml

9 months ago

0 Votes

2 Answers

722 Views

0 Votes 2 Answers 722 Views

Hello, I Am Trying To Programmatically Retrieve The Artifact

Hello, I am trying to programmatically retrieve the artifact FILE_PATH information that get displayed in the UI. So I have a pandas dataframe uploaded as art...

clearml

8 months ago

0 Votes

4 Answers

808 Views

0 Votes 4 Answers 808 Views

Hi Clearml Team, Is There A Way To Overwrite Working_Dir When Creating Task From Task.Init() Workflow? The Underlying Function I Am Triggering Relying On The Assumption On Running From Certain Directory.

Hi clearml team, is there a way to overwrite working_dir when creating task from task.init() workflow? the underlying function I am triggering relying on the...

clearml

8 months ago

0 Votes

10 Answers

835 Views

0 Votes 10 Answers 835 Views

Hi Community, I Had A Clearml Experiment Seems Become "Unresponsive" Even Though It Is Showing "Running" With The Following Logging (I Didn'T Skip Any Logging Between 3-15 And 3-16, It Is Unexpected Behavior That The Experiment Just Went Mia): Any Idea Wh

Hi community, I had a clearml experiment seems become "unresponsive" even though it is showing "running" with the following logging (I didn't skip any loggin...

clearml

8 months ago

0 Votes

2 Answers

777 Views

0 Votes 2 Answers 777 Views

Hi Clearml Team, Is There Best Practice To Improve Dataset'S Storage Efficiency? For Example, I Don'T Really Need All 5 Versions Of The Same Dataset Get Saved/Remembered, Is There A Way To Prune Old Versions Of Datasets To Be More Storage Efficient?

Hi clearml team, is there best practice to improve dataset's storage efficiency? For example, I don't really need all 5 versions of the same dataset get save...

clearml

7 months ago

0 Votes

16 Answers

896 Views

0 Votes 16 Answers 896 Views

Also, Any Advice On Using Best Practice Of Using Task.Create() Instead Of Task.Init()? I Have The Need Of Specifying Docker And Repository, So Only Find Task.Create() Can Achive What I Need. But Then I End Up With Always Creating 2 Scripts For Each Task I

Also, any advice on using best practice of using Task.create() instead of Task.init()? I have the need of specifying docker and repository, so only find Task...

clearml

8 months ago

0 Votes

2 Answers

721 Views

0 Votes 2 Answers 721 Views

Hi, Is There A Configuration Somewhere That Allow Me To See Stuff Logged From Regular Logging In Clearml Console? Or Clearml'S Logger.Report_Text(Print_Console=True) Is The Only Way To Get Messages Logged To The Console? Thanks!

Hi, is there a configuration somewhere that allow me to see stuff logged from regular logging in clearml console? Or clearml's logger.report_text(print_conso...

clearml

9 months ago

0 Votes

5 Answers

815 Views

0 Votes 5 Answers 815 Views

Hi, Is There A Way To Wait Until A Dataset Finish Uploading Before Proceed? Because I Want To Upload Dataset If It Is Not Already Exist And Then Process The Dataset

Hi, is there a way to wait until a dataset finish uploading before proceed? because I want to upload dataset if it is not already exist and then process the ...

clearml

9 months ago

0 Votes

9 Answers

652 Views

0 Votes 9 Answers 652 Views

Hi Clearml Team, I Am Trying To Figure Out The Best Practice To Keep Track Of Different Models. The Lineage Functionality In The Models Tap Is Generally Helpful. But First Of All, I Need To Be Able To Find Which Project A Model Belongs To, Then Navigate T

Hi clearml team, I am trying to figure out the best practice to keep track of different models. The lineage functionality in the models tap is generally help...

clearml

8 months ago

0 Hi Community, I Had A Clearml Experiment Seems Become "Unresponsive" Even Though It Is Showing "Running" With The Following Logging (I Didn'T Skip Any Logging Between 3-15 And 3-16, It Is Unexpected Behavior That The Experiment Just Went Mia): Any Idea Wh

yes

8 months ago

Actually I am not sure. for enterprise users. Is it most commonly self-hosted?

8 months ago

0 Also, Any Advice On Using Best Practice Of Using Task.Create() Instead Of Task.Init()? I Have The Need Of Specifying Docker And Repository, So Only Find Task.Create() Can Achive What I Need. But Then I End Up With Always Creating 2 Scripts For Each Task I

that is just to compare the same functionality used in task.create() can be achieved when using task.init workflow. Not technically required

8 months ago

Executing task id [7605f1e5ce6b45e99e9302d93bc3bac6]:
repository = git@xxx
branch = xxx
version_num = 9dca88fa23ff93d446eb2ff7d615d7ade213c8aa
tag = 
docker_cmd = iocr.io/xxx
entry_point = clearml_init.py
working_dir = dev

Based on the logging,
working_dir = dev is the problem. I need to have a way to overwrite the working_dir.

8 months ago

0 Hi Clearml Team, I Am Trying To Figure Out The Best Practice To Keep Track Of Different Models. The Lineage Functionality In The Models Tap Is Generally Helpful. But First Of All, I Need To Be Able To Find Which Project A Model Belongs To, Then Navigate T

Hi John, exactly. If I have the model_id, I want to find its project. Do you have tip to do it in both way?

8 months ago

Confirmed that without task.set_repo, it come down to the same error:
ModuleNotFoundError: No module named 'src'

8 months ago

function's signature in the source code

8 months ago

m = Model(model_id)
print("Model.id:", m.id) # <- this is returning model_id, but I need project_id or project_name 

print("Model.data:", m.data) # <- AttributeError: 'Model' object has no attribute 'data'

8 months ago

0 Hi, I Am Exploring The Following Workflow To Submit A Task To Remote Server:

install in dev mode is the easiest without having to publish it first

9 months ago

Would it cause problem to manually set repo when using task.init()?

8 months ago

0 Hi, Is There A Way To Wait Until A Dataset Finish Uploading Before Proceed? Because I Want To Upload Dataset If It Is Not Already Exist And Then Process The Dataset

Roughly I am trying to do this:

def upload_clearml_dataset_from_external_source(
    source_url,
    dataset_name: str,
    dataset_project: str,
):
    # reference:


    dataset = Dataset.create(dataset_name=dataset_name, dataset_project=dataset_project)
    dataset.add_external_files(source_url=source_url)
    dataset.upload()

    dataset.finalize()


upload_clearml_dataset_from_external_source("

", name, project)

Dataset.get(dataset_project=project, dataset...

9 months ago

0 Hello, I Am Trying To Programmatically Retrieve The Artifact

Aha, I see. Thanks for the tip

8 months ago

0 <no title>

PROJECT_NAME = "test"
TASK_NAME = "test_connect"
QUEUE_NAME = "default"
task = Task.init(project_name=PROJECT_NAME, task_name=TASK_NAME)

config = {
    "name": "foo",
    "arg1": "bar",
}
task.connect(config)

task.execute_remotely(queue_name=QUEUE_NAME)
# ------------- end of setup -------------

def dummy_op(config):
    pprint(config)

    return config


dummy_op(config)

Sreenshot also provided to show what "edit" button only appear in user property not hyperparamter
![image](...

8 months ago

0 Hi, Is There A Way To Wait Until A Dataset Finish Uploading Before Proceed? Because I Want To Upload Dataset If It Is Not Already Exist And Then Process The Dataset

I used these setup to load a pretty big dataset from s3:

dataset.add_external_files(
            source_url
        )
        dataset.upload(
            verbose=verbose
        )
        dataset.finalize()

But then seeing error complain about dataset doesn't exist. So my best guess is that the uploading is still happening in the background while the code has move forward to try to do something with that dataset.

So I am questioning if I have to explicitly add some logic to wait f...

9 months ago

0 <no title>

Thanks John! This is exactly it! need to reset the task first then the edit feature will show up

8 months ago

I can try taking it out see if it fix the issue. But i feel it is not the root cause

8 months ago

0 Hi Clearml Team, Is There A Way To Overwrite Working_Dir When Creating Task From Task.Init() Workflow? The Underlying Function I Am Triggering Relying On The Assumption On Running From Certain Directory.

Thanks! I will give this a try

8 months ago

0 Hi Clearml Community, Is There A Way To Install Additional Packages On Top Of Base Docker Env When Using

Hi @<1523701070390366208:profile|CostlyOstrich36> , I tried out Task.add_requirements way to add packages, but it doesn't seem to be working as I expected. here is the snippet i used to setup this up:

Task.force_store_standalone_script()
    add_packages = ["fastparquet"]
    for pkg in add_packages:
        Task.add_requirements(pkg)
    task = Task.init(project_name=project_name, task_name=task_name)
    task.set_base_docker(docker_arguments="--env CLEARML_AGENT_SKIP_PYTHON_ENV_INSTA...

6 months ago

0 Hi, Is There A Configuration Somewhere That Allow Me To See Stuff Logged From Regular Logging In Clearml Console? Or Clearml'S Logger.Report_Text(Print_Console=True) Is The Only Way To Get Messages Logged To The Console? Thanks!

Thanks!

9 months ago

I gave this it a try to switch from Task.create() to Task.init(). I think I am pretty close to switch to using init(). But still have issue of ModuleNotFoundError: No module named 'src' when using task.init().

My project setup look like this:

project_root/
    |--src/
    |--runbooks/
        |--run_task.py

So if I use Task.create(repo=xx, script="runbooks/run_task.py"), it works but if I switch to using Task.init() with the same repo setup (task.set_repo, and then follow by ...

8 months ago

0 Hi, I Am Exploring The Following Workflow To Submit A Task To Remote Server:

Sure, essentially my local python project organized using "src layout", look like this:

foo/
    |--src/
        |--module.py
    |--pyproject.toml
    |--clearml_tasks/
        |--task1.py

in the project, it would use absolute import like from foo import module , and I would install foo project in a editable mode during setup.

When I trying to create clearml task and send it to remote server using above way (leverage requirements.txt to configure library dependencies, and pro...

9 months ago

Thanks for the tips! I will take them for a spin

8 months ago

0 Hi Clearml Team, Is There Best Practice To Improve Dataset'S Storage Efficiency? For Example, I Don'T Really Need All 5 Versions Of The Same Dataset Get Saved/Remembered, Is There A Way To Prune Old Versions Of Datasets To Be More Storage Efficient?

Hi John, the dataset.squash doc says "If a set of versions are given it will squash the versions diff into a single version", I want to double check will it only keep the latest version?

Because I don't want any old version stuff even old version have more stuff than the latest version

7 months ago

0 Hi Clearml Community, Is There A Way To Install Additional Packages On Top Of Base Docker Env When Using

Any followup on this question? Recap:
Task,add_requirements() doesn't seem to do install the package from my experiment

Additionally, as alternative of add_requirements() if I can't get it working, is there an example of using docker bash init script you can point me to

6 months ago

Ok, thanks for the guidance.

8 months ago

Aha I see. that works to retrieve the project id.

A side question, I notice in the clearml design, Task, Model, Dataset those object all have its base model, but not project concept. In terms of project, is there a quick way to retrieve project name based on its id?

8 months ago

Thanks, that works

8 months ago

The ultimate goal is to make sure run.py get run under project root directory because the underlying code has this assumption when writing it. Locally pycharm take care of this, but when execute remotely, need to take care of this otherwise it will complain Module Not Found .

A typical project setup I am working with look like this:

project_root/
    |--src/
        |--utilities.py
        |--foo.py
    |--runbooks/
        |--run.py

And typical import look like this, for ...

8 months ago

0 Hi, I Am Wondering After A Task Submitted To Remote Server Finishing Running. Will The Docker Container / Disk Space (Really I Am More Interested About The Dataset That Download By The Task) Get Automatically Clean Up?

Awesome, thanks for the clarification!

9 months ago

task = Task.init(
        project_name=PROJECT_NAME,
        task_name=TASK_NAME,
        task_type=Task.TaskTypes.data_processing,
    )
    task.set_repo(
        repo="git@xxx.git",
        branch=branch
    )
    task.set_base_docker(
        docker_image="docker-image",
    )
    task.execute_remotely(queue_name=QUEUE_NAME)

This is how I use the task init

8 months ago

Show more results