RattyBluewhale45

6 Questions, 91 Answers

Active since 13 August 2024

Last activity one year ago

Reputation

Badges 1

89 × Eureka!

Questions 6
Answers 91

0 Votes

29 Answers

2K Views

0 Votes 29 Answers 2K Views

Hello! I Have An Issue Reproducing My Runs. The Task.Create Completes Successfully. When I Clone And Enqueue A Completed Task The Clone Fails. It Fails During The Python Requirements Installation. Why Is This? Do You Know How I Can Debug? Thank You In Adv

Hello! I have an issue reproducing my runs. The task.Create completes successfully. When I clone and enqueue a completed task the clone fails. It fails durin...

clearml

one year ago

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

Hello! My Workers Utilization is empty and not showing any graphs. Do you know how I can troubleshoot this?

clearml

one year ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Hello! I Am Having A Dependency Issue With Clearml. Would Someone Be Able To Help Me Understand How To Debug It/Replicate It?

Hello! I am having a dependency issue with clearml. Would someone be able to help me understand how to debug it/replicate it? from ultralytics import YOLO Im...

clearml

one year ago

0 Votes

5 Answers

959 Views

0 Votes 5 Answers 959 Views

Hello! I Get This Error When Running Multiple Jobs On The Same Dataset, Would Someone Be Able To Help Debug?:

Hello! I get this error when running multiple jobs on the same dataset, would someone be able to help debug?: FileNotFoundError: Image Not Found /clearml_age...

clearml

one year ago

0 Votes

13 Answers

1K Views

0 Votes 13 Answers 1K Views

Hello! Are You Able To Help Be Debug This Message?

Hello! Are you able to help be debug this message? RuntimeError: unable to write to file : No space left on device (28) 2024-09-09 14:29:50,124 - clearml.rep...

clearml

one year ago

0 Votes

41 Answers

131K Views

0 Votes 41 Answers 131K Views

Hey Guys! I'M Having Some Issues With Pytorch And Clearml. I Am Starting A New Task Using Task.Create And Setting Pytorch As A Requirement Under `Packages`. For Some Reason Pytorch With Cuda 12 Is Being Installed, But I Need Cuda 11. Do You Know How To Se

Hey guys! I'm having some issues with pytorch and clearml. I am starting a new task using task.create and setting pytorch as a requirement under packages. Fo...

pytorch

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

Thank you

one year ago

0 Hello! I Am Having A Dependency Issue With Clearml. Would Someone Be Able To Help Me Understand How To Debug It/Replicate It?

@<1717350332247314432:profile|WittySeal70> what's strange is I can import the package in the docker container when I run it outside of clearML

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

Hey yes it's self deployed

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

[2024-08-13 16:56:36,447] [9] [INFO] [clearml.service_repo] Returned 200 for workers.get_activity_report in 342ms
[2024-08-13 16:56:36,462] [9] [INFO] [clearml.service_repo] Returned 200 for workers.get_activity_report in 261ms

one year ago

0 Hello! I Have An Issue Reproducing My Runs. The Task.Create Completes Successfully. When I Clone And Enqueue A Completed Task The Clone Fails. It Fails During The Python Requirements Installation. Why Is This? Do You Know How I Can Debug? Thank You In Adv

"Original PIP" is empty as for this task we can rely on the docker image to provide the python packages

one year ago

0 Hey Guys! I'M Having Some Issues With Pytorch And Clearml. I Am Starting A New Task Using Task.Create And Setting Pytorch As A Requirement Under `Packages`. For Some Reason Pytorch With Cuda 12 Is Being Installed, But I Need Cuda 11. Do You Know How To Se

It's hanging at


Installing collected packages: zipp, importlib-resources, rpds-py, pkgutil-resolve-name, attrs, referencing, jsonschema-specifications, jsonschema, certifi, urllib3, idna, charset-normalizer, requests, pyparsing, PyYAML, six, pathlib2, orderedmultidict, furl, pyjwt, psutil, python-dateutil, platformdirs, distlib, filelock, virtualenv, clearml-agent
Successfully installed PyYAML-6.0.2 attrs-23.2.0 certifi-2024.7.4 charset-normalizer-3.3.2 clearml-agent-1.8.1 distlib-0.3....

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

None

one year ago

Solved that by setting docker_args=["--privileged", "--network=host"]

one year ago

Thank you I will try that

one year ago

Isn't the problem that CUDA 12 is being installed?

one year ago

0 Hello! I Am Having A Dependency Issue With Clearml. Would Someone Be Able To Help Me Understand How To Debug It/Replicate It?

pip install ultralytics --no-deps would also work. Is there a way to pass this to clearML?

one year ago

agent.package_manager.pip_version=""

one year ago

within a docker

one year ago

Maybe it's related to this section?

WARNING:clearml_agent.helper.package.requirements:Local file not found [anaconda-anon-usage @ file:///croot/anaconda-anon-usage_1710965072196/work], references removed

one year ago

0 Hello! Are You Able To Help Be Debug This Message?

Running on K8s on AWS

one year ago

0 Hello! I Get This Error When Running Multiple Jobs On The Same Dataset, Would Someone Be Able To Help Debug?:

We are getting the dataset like this:

clearml_dataset = Dataset.get(
    dataset_id=config.get("dataset_id"), alias=config.get("dataset_alias")
)
dataset_dir = clearml_dataset.get_local_copy()

one year ago

@<1523701205467926528:profile|AgitatedDove14> if we go with the ultralytics case:

INSTALLED PACKAGES for working manual execution

absl-py==2.1.0
albucore==0.0.13
albumentations==1.4.14
anaconda-anon-usage @ file:///croot/anaconda-anon-usage_1710965072196/work
annotated-types==0.7.0
anyio==4.4.0
archspec @ file:///croot/archspec_1709217642129/work
astor==0.8.1
asttokens @ file:///opt/conda/conda-bld/asttokens_1646925590279/work
astunparse==1.6.3
attrs @ file:///croot/attrs_169571782329...

one year ago

I can install the correct torch version with this command:
pip install --pre torchvision --force-reinstall --index-url ` None ```

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

one year ago

0 Hello! I Am Having A Dependency Issue With Clearml. Would Someone Be Able To Help Me Understand How To Debug It/Replicate It?

Using docker="ultralytics/ultralytics:latest" and docker_args=["--privileged"] seems to work!

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

Yes, sure!

one year ago

0 Hello! Are You Able To Help Be Debug This Message?

None

one year ago

Docker mode

one year ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

We are using allegroai/clearml:latest API server

one year ago

Full log for the failed clone

one year ago

I have set agent.package_manager.pip_version="" which resolved that message

one year ago

@<1523701070390366208:profile|CostlyOstrich36> do you have any ideas?

one year ago

0 Hello! Are You Able To Help Be Debug This Message?

ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm).
�Traceback (most recent call last):
  File "/root/.clearml/venvs-builds/3.10/task_repository/script.py", line 36, in <module>
Traceback (most recent call last):
  File "/opt/conda/lib/python3.10/multiprocessing/queues.py", line 244, in _feed
    obj = _ForkingPickler.dumps(obj)
  File "/opt/conda/lib/python3.10/multiprocessing/reduction.py", line 51, in dumps
    cls(buf, protoco...

one year ago

0 Hello! Are You Able To Help Be Debug This Message?

Setting ultralytics workers=0 seems to work as per the thread above!

one year ago

Show more results