HarebrainedToad56

1 Question, 19 Answers

Active since 13 July 2023

Last activity one year ago

Reputation

Badges 1

3 × Eureka!

Questions 1
Answers 19

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Hey :wave: *Tensorboard Logs Overwhelming Elasticsearch* I am running a clear ml server, however when running experiments with tensorboard logging I am seeing the elastic indexing time increase drastically and in some cases I have also seen timeout erro

Hey 👋 Tensorboard Logs Overwhelming Elasticsearch I am running a clear ml server, however when running experiments with tensorboard logging I am seeing the ...

tensorboard

one year ago

0 Hello All, I Am Trying To Report A Confusion Matrix For My Output Model As Follows:

What does it look like when you instantiate the output_model object?

2 years ago

0 Hello Everyone, I'D Like To Stop Clearml From Automatically Logging Models From Yolov8. Is There Any Way To Do It Without Disconnecting From Framework (Pytorch)?

If you added a print there like:

def filter_out_pt_files(operation_type, model_info):
    print(model_info.__dict__)

    return model_info

You can see what is bring picked up. If there is a common path etc you can filter that out

2 years ago

0 Hello Everyone, I'D Like To Stop Clearml From Automatically Logging Models From Yolov8. Is There Any Way To Do It Without Disconnecting From Framework (Pytorch)?

Hey 🙂 I had a similar issue today and found this solution:

In my case this codebase was using a .pt filetype which was being picked up and logged as a model even though it was not.

import os
from clearml import Task
from clearml.binding.frameworks import WeightsFileHandler

task = Task.init(
    project_name="task_project",
    task_name="task_name",
    task_type=Task.TaskTypes.training,
)


def filter_out_pt_files(operation_type, model_info):
    is_pt_file = os.path.splitext...

2 years ago

0 Hi All, I Dont Know What Happened But I Am Unable To Download A Dataset I Used To Download To Cached Folder. Now, When I Try To Download, The Dataset Show The Following Error. Just Few Day Ago, I Still Can Download And Run With The Dataset Sucessfully. I

Might be worth running the command again with the --verbose flag. It will likely give more details on what is causing the failure

2 years ago

Looks like its a /mnt which might mean its a drive or something similar that was connected and may not be any more?

For something quick, if you create a new folder to put your dataset:
mkdir ./test_dataset_location
Then you can run your command with
CLEARML_CACHE_DIR='./test_dataset_location' clearml-data ... <your command here>

It will try to download into that folder

2 years ago

0 Hello All, I Am Trying To Report A Confusion Matrix For My Output Model As Follows:

That looks good to me, not sure

2 years ago

0 Hi Guys, From The

As pytorch lightning is a framework on top of Pytorch it will work the same, if not better with Clear ML

2 years ago

0 Hi Guys, From The

This is a great place to start on PyTorch Lightning

None

2 years ago

Hope you can get something to work 🤞

2 years ago

One option might be to delete the local copy of the dataset and try to re-download it. Perhaps something has gone wrong with the local copy?

2 years ago

Also the error you are showing is inside the calculate_metrics.py

Is that a clear-ml lib or something custom

2 years ago

It happens, happy training 🚀

2 years ago

0 Hello Everyone, I'D Like To Stop Clearml From Automatically Logging Models From Yolov8. Is There Any Way To Do It Without Disconnecting From Framework (Pytorch)?

If you can identify a patten in the YOLOv8 output files you can probably also filter them out 🙂

2 years ago

0 Hello Everyone, I'D Like To Stop Clearml From Automatically Logging Models From Yolov8. Is There Any Way To Do It Without Disconnecting From Framework (Pytorch)?

If anyone knows a better way, would love to hear about it 🙂

2 years ago

0 How Would Ya'Ll Approach Backing Up The Elastic-Search/Redis/Etc. Data In Self-Hosted Clearml? Any Drawbacks/Risks Of Doing A Simple Process That Periodically Zips Up The

Also interested in how this is being approached 🙂 What you mentioned is exactly what I am doing

2 years ago

0 Hello Everyone, I'D Like To Stop Clearml From Automatically Logging Models From Yolov8. Is There Any Way To Do It Without Disconnecting From Framework (Pytorch)?

I need to add callback for it to filter out anything with .pt

2 years ago

0 Hey :wave: *Tensorboard Logs Overwhelming Elasticsearch* I am running a clear ml server, however when running experiments with tensorboard logging I am seeing the elastic indexing time increase drastically and in some cases I have also seen timeout erro

Yep, almost a self DDoS

one year ago

For an update 🙂
I think we identified that when moving from a training to fine tuning dataset (which was 1/1000th the size) our training script was set to upload every epoch. Seems like this resulted in a torrent of metrics being uploaded.

Since modifying this to be less frequent we have seen the index latency drop dramatically

one year ago

Currently running it on a t3.xlarge which has 4CPU's, 16GB RAM and 300GB SSD

one year ago