GrievingKoala83

22 Questions, 45 Answers

Active since 11 June 2023

Last activity 7 months ago

Reputation

Badges 1

43 × Eureka!

Questions 22
Answers 45

0 Votes

1 Answers

652 Views

0 Votes 1 Answers 652 Views

Hello! As Fas As I Understand, Files Are Being Sorted By Its Last Modification Time During Cache Cleaning? So That Files That Were Downloaded Long Time Ago But Are Now Actively Used - They Will Be Deleted Firstly?

Hello! As fas as I understand, files are being sorted by its last modification time during cache cleaning? So that files that were downloaded long time ago b...

clearml

7 months ago

0 Votes

7 Answers

2K Views

0 Votes 7 Answers 2K Views

Hello Everyone! The Cache For Pip Does Not Work For Agent In K8S Mode. I Specify Agent.Docker_Pip_Cache As /Mnt/Pip_Cache In The Clearml.Conf. But Nothing Is Saved Along This Path

Hello everyone! The cache for pip does not work for agent in k8s mode. I specify agent.docker_pip_cache as /mnt/pip_cache in the clearml.conf. But nothing is...

mlops

one year ago

0 Votes

1 Answers

518 Views

0 Votes 1 Answers 518 Views

Is It Possible To Use The Resource Configuration (

Is it possible to use the Resource Configuration ( None ) in the clearml free tier?

clearml

7 months ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hello Everyone! How Can I Conveniently Pass A Large Number Of Parameters To The Pipeline In Order To Re-Run It Through Ui?

Hello everyone! How can I conveniently pass a large number of parameters to the pipeline in order to re-run it through ui?

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi Everyone! I'M Trying To Use

Hi everyone! I'm trying to use task.launch_multi_node(nodes, devices=gpus, hide_children=True) in conjunction with pytorch-ligtning. I am using the latest ve...

clearml

11 months ago

0 Votes

3 Answers

799 Views

0 Votes 3 Answers 799 Views

Hello Everyone! I Tried To Remove Models From Clearml Using

Hello everyone! I tried to remove models from ClearML using clearml.Model.remove( model=model_id, delete_weights_file=True, force=True, raise_on_errors=True,...

clearml

9 months ago

0 Votes

1 Answers

762 Views

0 Votes 1 Answers 762 Views

Hello Everyone! I Need To Run The Pipeline On Schedule. Are There Any Restrictions On Running Pipelines Through The Scheduler? Would Hidden Project For Pipe Step Tasks Be A Problem For Scheduler?

Hello everyone! I need to run the pipeline on schedule. Are there any restrictions on running pipelines through the scheduler? Would hidden project for pipe ...

clearml

9 months ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hello! Is There A Way To Launch Clearml Apps (For Example Clearml Schedulers) Via Api Or Code With Status Tracking On Clearml Application Tab? If We Run Clearml Schedulers Via Code With

Hello! Is there a way to launch clearml apps (for example clearml schedulers) via API or code with status tracking on ClearML application tab? If we run clea...

clearml

one year ago

0 Votes

2 Answers

881 Views

0 Votes 2 Answers 881 Views

Hello Everyone! Is It Possible To Transfer Data (Datasets, Models) From One Clearml Instance To Another? How Can I Do This?

hello everyone! is it possible to transfer data (datasets, models) from one ClearML instance to another? How can I do this?

clearml

one year ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Why Can'T I Find The Task Created For The Pipeline In The Project Through The Main Dashboard?

Why can't I find the task created for the pipeline in the project through the main dashboard?

clearml

2 years ago

0 Votes

10 Answers

825 Views

0 Votes 10 Answers 825 Views

Hello! I Need To Run Clearml Pipeline With Caching Of Steps. I Specify Cache_Executed_Step=True For Each Step, But My Steps Are Not Cached And Ended With The Status - Completed.

Hello! I need to run clearml pipeline with caching of steps. I specify cache_executed_step=True for each step, but my steps are not cached and ended with the...

clearml

8 months ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hello Everyone! Could You Help Me With The Authorization Question? Is It Possible To Add A New User Through The Api To Access Clearml Webserver? I Found Three Methods In The Clearml Rest Api: Auth.Credentials_Key, Auth.Credentials, Auth.Role. Is There A D

Hello everyone! Could you help me with the authorization question? Is it possible to add a new user through the API to access ClearML webserver? I found thre...

clearml

one year ago

0 Votes

28 Answers

1K Views

0 Votes 28 Answers 1K Views

Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

Hi! I'm running launch_multi_mode with pytorch-lightning task.execute_remotely(queue_name='my-queue' config = task.launch_multi_node(args.nodes)) ddp = DDPSt...

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hello! Is There A Way To Launch Clearml Apps (For Example Clearml Schedulers) Via Api Or Code With Status Tracking On Clearml Application Tab? If We Run Clearml Schedulers Via Code With

Hello! Is there a way to launch clearml apps (for example clearml schedulers) via API or code with status tracking on ClearML application tab? If we run clea...

clearml

one year ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hello! How To Determine The Cache For An Agent In Kubernetes? I'M Going To Mount S3 As A Cache Folder As A Local Path Using S3Fs. What Variable Needs To Be Set In Values.Yaml For Agent Helm Chart?

Hello! How to determine the cache for an agent in Kubernetes? I'm going to mount s3 as a cache folder as a local path using s3fs. What variable needs to be s...

mlops

one year ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Hi Everyone! I Have A Clearml Dataset That Takes Up 10 Tb. Its Local Download (Get_Local_Copy) Takes About A Month. Can You Tell Me How To Speed Up This Process? I Keep The Cache In Nfs.

Hi everyone! I have a ClearML dataset that takes up 10 Tb. Its local download (get_local_copy) takes about a month. Can you tell me how to speed up this proc...

clearml

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hello Everyone! Is It Possible To Transfer Data (Datasets, Models) From One Clearml Instance To Another? How Can I Do This? I Store My Data In S3 And Clearml Tracks This Data. I Want To Migrate This Data From One Clearml Instance To Another, That Is, Tran

hello everyone! is it possible to transfer data (datasets, models) from one ClearML instance to another? How can I do this? I store my data in s3 and clearml...

clearml

one year ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi! When Running A Remote Task On The Agent, Clearml Installs Additional System Packages. How Can This Be Disabled? The Variable Agent.Package_Manager.System_Site_Packages Does Not Work

Hi! When running a remote task on the agent, clearml installs additional system packages. how can this be disabled? the variable agent.package_manager.system...

clearml

one year ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi All! I Write A Data Processing Pipeline. It Is Necessary To Define Many Hyperparameters That Are Inconvenient To Redefine In A Pop-Up Window When Restarting The Pipeline From Ui. Is It Possible To Overrided The Parameters Through The Configuration File

Hi all! I write a data processing pipeline. It is necessary to define many hyperparameters that are inconvenient to redefine in a pop-up window when restarti...

clearml

2 years ago

0 Votes

5 Answers

888 Views

0 Votes 5 Answers 888 Views

Hello! Can You Help Me With Model Endpoints Tab - How To Connect It To Existing Clearml-Serving Instance?

Hello! Can you help me with Model Endpoints tab - how to connect it to existing clearml-serving instance?

clearml

8 months ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hello Everyone! Can I Create A Report Via Api Or Sdk? Сan The Model Inference Task Generate A Report That Will Be Displayed In The Reports Tab?

Hello everyone! Can I create a report via API or SDK? Сan the model inference task generate a report that will be displayed in the reports tab?

clearml

one year ago

0 Votes

0 Answers

524 Views

0 Votes 0 Answers 524 Views

Hello! Why In Get_Local_Copy Method (

Hello! Why in get_local_copy method ( None ) dependencies_by_order variable include only 1 parent dataset while there are 3 (pic. 2)? If I understand it corr...

clearml

7 months ago

0 Hello! Is There A Way To Launch Clearml Apps (For Example Clearml Schedulers) Via Api Or Code With Status Tracking On Clearml Application Tab? If We Run Clearml Schedulers Via Code With

@<1523701087100473344:profile|SuccessfulKoala55>

one year ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

for example, global rank from failed task in first scenario

one year ago

0 Hello! Is There A Way To Launch Clearml Apps (For Example Clearml Schedulers) Via Api Or Code With Status Tracking On Clearml Application Tab? If We Run Clearml Schedulers Via Code With

Hi @<1523701087100473344:profile|SuccessfulKoala55> No, I am using self-hosted ClearML enterprise server

one year ago

0 Hello! As Fas As I Understand, Files Are Being Sorted By Its Last Modification Time During Cache Cleaning? So That Files That Were Downloaded Long Time Ago But Are Now Actively Used - They Will Be Deleted Firstly?

@<1523701070390366208:profile|CostlyOstrich36> Can you help me with my question?

7 months ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

@<1523701435869433856:profile|SmugDolphin23>
Logs of rank0:

Environment setup completed successfully
 
Starting Task Execution:
 
 
1718702244585 gpuvm-01:gpu3,0 DEBUG InsecureRequestWarning: Certificate verification is disabled! Adding certificate verification is strongly advised. See:


ClearML results page:

 /projects/0eae440b14054464a3f9c808ad6447dd/experiments/beaa8c380f3c46f0b6f5a3feab514dc8/output/log
task id [beaa8c380f3c46f0b6f5a3feab514dc8]
world=4
...

one year ago

0 Hello! I Need To Run Clearml Pipeline With Caching Of Steps. I Specify Cache_Executed_Step=True For Each Step, But My Steps Are Not Cached And Ended With The Status - Completed.

@<1523701070390366208:profile|CostlyOstrich36>

8 months ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

@<1523701435869433856:profile|SmugDolphin23> hi! it works! thanks!

one year ago

0 Hi Everybody! I'M Running An Example Pipeline From A Web Ui. I Notice Very Strange Behavior. After The First Local Run, I Can Create A New Run And Pass Initialization Parameters There, But After A Successful Run, I Lose The Ability To Create New Runs With

I had a similar behavior: the parameters for starting the pipeline are not selected in a detailes view, only in the table view

2 years ago

0 Hello! I Need To Run Clearml Pipeline With Caching Of Steps. I Specify Cache_Executed_Step=True For Each Step, But My Steps Are Not Cached And Ended With The Status - Completed.

@<1523701070390366208:profile|CostlyOstrich36> If I run the pipeline with the same input parameters, all the steps will also be re-run, nothing will be taken from the cache

8 months ago

0 Hello! I Need To Run Clearml Pipeline With Caching Of Steps. I Specify Cache_Executed_Step=True For Each Step, But My Steps Are Not Cached And Ended With The Status - Completed.

@<1523701070390366208:profile|CostlyOstrich36> Any ideas?

8 months ago

0 Hello Everyone! I Tried To Remove Models From Clearml Using

thank you!

9 months ago

0 Hello! I Need To Run Clearml Pipeline With Caching Of Steps. I Specify Cache_Executed_Step=True For Each Step, But My Steps Are Not Cached And Ended With The Status - Completed.

@<1523701070390366208:profile|CostlyOstrich36> can you help me?

8 months ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

Hi @<1523701435869433856:profile|SmugDolphin23> ! I set NODE_RANK in the environment and now

if gpus=2, node=2, task.launch_multi_node(node) : three tasks are created, and two of which are completed, but one is failed. In this case, are created (gpus*nodes-1) of tasks, some of which crashes with an error, or they all fall with an error. the behavior is inconsistent.
if gpus=2, node=2, task.launch_multi_node(node*gpus) : seven tasks are created.I n this case, all tasks are failed except t...

one year ago

0 Hi Everyone! I'M Trying To Use

@<1523701435869433856:profile|SmugDolphin23> Everything worked after setting the variables: --env NCCL_IB_DISABLE=1 --env NCCL_SOCKET_IFNAME=ens192 --env NCCL_P2P_DISABLE=1. But previously, these variables were not required for a successful launch. When I run ddp training with two nodes , everything works for me now. But as soon as I increase their number ( nodes > 2 ), I get the following error.

Traceback (most recent call last):
  File "/root/.clearml/venvs-builds/3.11/code/light...

11 months ago

0 Hello! Can You Help Me With Model Endpoints Tab - How To Connect It To Existing Clearml-Serving Instance?

@<1523701070390366208:profile|CostlyOstrich36> yes

8 months ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

The errors that occur in the second case are presented in this screenshots.

one year ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

@<1523701435869433856:profile|SmugDolphin23> Each task shows that process allocates only 1 gpu out of 2 (all task have the same scalar as below)

one year ago

0 Hello Everyone! The Cache For Pip Does Not Work For Agent In K8S Mode. I Specify Agent.Docker_Pip_Cache As /Mnt/Pip_Cache In The Clearml.Conf. But Nothing Is Saved Along This Path

If I understand correctly, the cache for pip is stored at /root/.cache/pip. How can I change it? The agent.docker_internal_mounts.pip_cache variable in the config also does not change anything.

one year ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

@<1523701435869433856:profile|SmugDolphin23> if task.aunch_multi_node(4) , then all 4 tasks are failed

one year ago

0 Hello Everyone! Is It Possible To Transfer Data (Datasets, Models) From One Clearml Instance To Another? How Can I Do This?

I store my data in s3 and clearml tracks this data. I want to migrate this data from one ClearML instance to another, that is, transfer it to another s3 and have a new ClearML instance track it

one year ago

0 Hello! Can You Help Me With Model Endpoints Tab - How To Connect It To Existing Clearml-Serving Instance?

@<1523701070390366208:profile|CostlyOstrich36> Any ideas?

8 months ago

0 Hello! Is There A Way To Launch Clearml Apps (For Example Clearml Schedulers) Via Api Or Code With Status Tracking On Clearml Application Tab? If We Run Clearml Schedulers Via Code With

Can you help me?

one year ago

0 Hello Everyone! The Cache For Pip Does Not Work For Agent In K8S Mode. I Specify Agent.Docker_Pip_Cache As /Mnt/Pip_Cache In The Clearml.Conf. But Nothing Is Saved Along This Path

kubectl exec -it clearml-agent-85fd8ccc6d-7fdk7 -n clearml bash
kubectl exec [POD] [COMMAND] is DEPRECATED and will be removed in a future version. Use kubectl exec [POD] -- [COMMAND] instead.
Defaulted container "k8s-glue" out of: k8s-glue, init-k8s-glue (init)
root@clearml-agent-85fd8ccc6d-7fdk7:~# cat /root/clearml.conf 
agent.git_user=gitlab_agent
agent.git_pass=682S-pH9ay1nidsxBGyT
agent.cuda_version=118
#agent.docker_internal_mounts.venv_build=/home/s3_cache/venvs-builds
#agent.do...

one year ago

0 Hi Everyone! I'M Trying To Use

@<1523701435869433856:profile|SmugDolphin23> It is possible to request up to 5 workers in the toy example with Feed Forward and MNIST, BUT it is not possible to request more than 2 workers on a real large model

11 months ago

0 Hello! I Need To Run Clearml Pipeline With Caching Of Steps. I Specify Cache_Executed_Step=True For Each Step, But My Steps Are Not Cached And Ended With The Status - Completed.

I create a pipeline via PipelineController with adding a step as a function

pipe = PipelineController(
        name=cfg.clearml.pipeline_name,
        project=cfg.clearml.project_name,
        target_project=True,
        version=cfg.clearml.version,
        add_pipeline_tags=True,
        docker=cfg.clearml.dockerfile,
        docker_args=DefaultMLPLATparam().docker_arg,
        packages=packages,
        retry_on_failure=3
    )

for parameter in cfg.clearml.params:
        pipe.add_...

8 months ago

0 Hello Everyone! The Cache For Pip Does Not Work For Agent In K8S Mode. I Specify Agent.Docker_Pip_Cache As /Mnt/Pip_Cache In The Clearml.Conf. But Nothing Is Saved Along This Path

in the clearml section in values.yaml:

clearml:
  ...
  clearmlConfig: |-
   agent.docker_pip_cache="/mnt/pip_cache"

one year ago

0 Hi! I'M Running Launch_Multi_Mode With Pytorch-Lightning

@<1523701435869433856:profile|SmugDolphin23> gloo doesn't work for me either

but torch work with nccl and task.launch_multi_node

problems arise specifically with pytorch-lightning

one year ago

0 Hi! When Running A Remote Task On The Agent, Clearml Installs Additional System Packages. How Can This Be Disabled? The Variable Agent.Package_Manager.System_Site_Packages Does Not Work

do I understand correctly that it is impossible to disable the installation of system packages without CLEARML_AGENT_SKIP_PIP_VENV_INSTALL and CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL?

one year ago

0 Hello Everyone! Can I Create A Report Via Api Or Sdk? Сan The Model Inference Task Generate A Report That Will Be Displayed In The Reports Tab?

Hi @<1523701087100473344:profile|SuccessfulKoala55> where can I get examples of REST API requests for creating reports?

one year ago

0 Hello! Can You Help Me With Model Endpoints Tab - How To Connect It To Existing Clearml-Serving Instance?

@<1523701070390366208:profile|CostlyOstrich36> i have 2 clearm-serving instances with endpoints

8 months ago

Show more results