ExasperatedCrab78

Moderator

2 Questions, 221 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Badges 1

2 × Eureka!

Questions 2
Answers 221

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

We'Re Working On Clearml Serving Right Now And Are Very Interested In What You All Are Searching For In A Serving Engine, So We Can Make The Best Serving Engine We Can

We're working on ClearML serving right now and are very interested in what you all are searching for in a serving engine, so we can make the best serving eng...

clearml

2 years ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

A Little Something Else: Using Clearml, An Oak-1 Ai Camera And A Raspberry Pi To Create A Pushup Counter That Locks My Pc Every Hour And Only Unlocks Again When It Detected Some Pushups

A little something else: Using ClearML, an OAK-1 AI camera and a raspberry pi to create a pushup counter that locks my PC every hour and only unlocks again w...

clearml

3 years ago

0 Hi All

Hi @<1546303293918023680:profile|MiniatureRobin9> !

Would you mind sending me a screenshot of the model page (incl the model path) both for the task you trained locally as well as the one you trained on the agent?

2 years ago

0 Hi All

Oh wait, you have a self-hosted server?

2 years ago

0 Hi All

With the screenshots above, the locally run experiment (left), does it have an http url for the model url field? The one you whited out?

2 years ago

0 Hey, We Are Using Clearml 1.9.0 With Transformers 4.25.1… And We Started Getting Errors That Do Not Reproduce In Earlier Versions (Only Works In 1.7.2 All 1.8.X Don’T Work):

Hi @<1523701949617147904:profile|PricklyRaven28> sorry that this is happening. I tried to run your minimal example, but get a IndexError: Invalid key: 5872 is out of bounds for size 0 error. That said, I get the same error without the code running in a pipeline. There seems to be no difference between simply running the code and the pipeline (for me). Do you have an updated example, maybe also including getting a local copy of an artifact, so I can check?

2 years ago

0 Hello Again, I Would Like To Ask You If Something Like This Is Possible In Clearml (See Screenshot)? For Each Experiment (

2 years ago

0 Hello Again, I Would Like To Ask You If Something Like This Is Possible In Clearml (See Screenshot)? For Each Experiment (

Hi! Have you tried adding custom metrics to the experiment table itself? You can add any scalar as a column in the experiment list, it does not have color formatting, but it might be more like what you want in contrast to the compare functionality 🙂

2 years ago

0 Hey Everyone, Is It Possible To Use The

Yes you can! The filter syntax can be quite confusing, but for me it helps to print task.__ dict__ on an existing task object to see what options are available. You can get values in a nested dict by appending them into a string with a .

Example code:

` from clearml import Task

task = Task.get_task(task_id="17cbcce8976c467d995ab65a6f852c7e")
print(task.dict)

list_of_tasks = Task.query_tasks(task_filter={
"all": dict(fields=['hyperparams.General.epochs.value'], p...

2 years ago

0 Hello Again, I Would Like To Ask You If Something Like This Is Possible In Clearml (See Screenshot)? For Each Experiment (

Thank you so much! In the meantime, I check once more and the closest I could get was using report_single_value() . It forces you to report each an every row though, but the comparison looks a little better this way. No color coding yet, but maybe it can already help you a little 🙂

2 years ago

0 When Using Dataset.Get_Local_Copy(), Once I Get The Location, Can I Add Another Folder Inside Location Add Some Files In It, Create A New Dataset Object, And Then Do Dataset.Upload(Location)? Should This Work? Or Since Its Get_Local_Copy, I Won'T Be Able

Cool! 😄 Yeah, that makes sense.

So (just brainstorming here) imagine you have your dataset with all samples inside. Every time N new samples arrive they're just added to the larger dataset in an incremental way (with the 3 lines I sent earlier).
So imagine if we could query/filter that large dataset to only include a certain datetime range. That range filter is then stored as hyperparameter too, so in that case, you could easily rerun the same training task multiple times, on differe...

3 years ago

0 Hi, I'M Using Hyperparameteroptimizer Alongside Optimizeroptuna And I Am Unsure How To Implement Pruning On Tasks That Are Not Producing Good Results. Is There A Way To Implement This On These Modules?

Hi UnevenBee3 , the OptimizerOptuna class should already be able to prune any bad tasks, provided the model itself is iteration-based (so no SVM etc. need iterations for early stopping). You can read our blogpost here: https://clear.ml/blog/how-to-do-hyperparameter-optimization-better/

2 years ago

0 Hi All! I Recently Started Working With Clearml Serving. I Got This Example Working

Hi NuttyCamel41 !

Your suspicion is correct, there should be no need to specify the config.pbtxt manually, normally this file is made automatically using the information you provide using the command line.

It might be somehow silently failing to parse your CLI input to correctly build the config.pbtxt . One difference I see immediately is that you opted for "[1, 64]" notation instead of the 1 64 notation from the example. Might be worth a try to change the input for...

2 years ago

0 Hi All! I Recently Started Working With Clearml Serving. I Got This Example Working

I can see 2 kinds of errors:
Error: Failed to initialize NVML and Unable to allocate pinned system memory, pinned memory pool will not be available: CUDA driver version is insufficient for CUDA runtime version
These 2 lines make me think something went wrong with the GPU itself. Chances are you won't be able to run nvidia-smi this looks like a non-clearml issue 🙂 It might be that triton hogs the GPU memory if not properly closed down (doubl ctrl-c). It says the driver ver...

2 years ago

It part of the design I think. It makes sense that if we want to keep track of changes, we always build on top of what we already have 🙂 I think of it like a commit: I'm adding files in a NEW commit, not in the old one.

3 years ago

So you train the model only on those N preprocessed data points then? Never combined with the previous datapoints before N?

3 years ago

0 Question On Using Clearml-Data To Manage Contents Of Datasets. I’M Having An Issue Deleting A Directory Within A Dataset Uploaded. Here Are A Few Ways I’Ve Tried, Create New Dataset With Parent, Remove --Files <Path To Folder>. That Doesn’T Work, Only

For the record, this is a minimal reproducible example:

Local folder structure:
` ├── remove_folder
│ ├── batch_0
│ │ ├── file_0_0.txt
│ │ ├── file_0_1.txt
│ │ ├── file_0_2.txt
│ │ ├── file_0_3.txt
│ │ ├── file_0_4.txt
│ │ ├── file_0_5.txt
│ │ ├── file_0_6.txt
│ │ ├── file_0_7.txt
│ │ ├── file_0_8.txt
│ │ └── file_0_9.txt
│ └── batch_1
│ ├── file_1_0.txt
│ ├── file_1_1.txt
│ ├── file_1_2.txt
│ ├── file_1_3.txt
│ ├── fi...

2 years ago

0 Hello Again, I Would Like To Ask You If Something Like This Is Possible In Clearml (See Screenshot)? For Each Experiment (

Hi CurvedHedgehog15 , so my previous reply does assume you have reported a scalar for each individual FAR level. Then you can add individual levels as shown in the gif. But like you siad, that might actually cause you to loose your overview in the scalars tab.

So I don't think there's an immediate way to do this in ClearML right now, but would you mind opening an issue on github for it? It might be interesting to add it to the tool?

2 years ago

0 Hi There, Another Triton-Related Question: Are We Able To Deploy

Hi @<1547028116780617728:profile|TimelyRabbit96> Awesome that you managed to get it working!

2 years ago

0 Hi There, Another Triton-Related Question: Are We Able To Deploy

Yes, you will indeed need to add all ensemble endpoints separately 🙂

2 years ago

0 Hello, I’M Using The Free Self-Hosted Version Of Clearml On Our K8S Cluster ( The Latest Chart Version). I’M Trying To Deploy And Undeploy The Server Several Times But Each Time It Keeps Deleting The Data Associated With The Experiments (It Keeps Deleting

Hi there! There are several services who need persistent storage, check here for an overview diagram.

If I'm not mistaken, there's the fileserver, elastic, mongo and redis. All info is scattered over these (e.g. model files on fileserver, logs on elastic) so there is no one server holding everything.

I'm not a k8s expert, but I think that even a dynamic PVC should not delete itself. Just to be sure though, you can indee...

2 years ago

0 Hi I Saw This Announcement From Nvidia On Tao'S Integration With Clearml. How Can We Use It?

Hi Jax! We have a blogpost explaining how to use it almost ready to go. I'll ping you here when its out.

In the meantime you can check out the https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/resources/tao-getting-started of TAO. Download the zipfile with examples and under notebooks>tao_launcher_starter_kit>detectnet_v2 you'll find a notebook with an example on how to use the integration.

2 years ago

0 Tasks Can Be Put In Draft State - If We Will Execute:

It depends on how complex your configuration is, but if config elements are all that will change between versions (i.e. not the code itself) then you could consider using parameter overrides.

A ClearML Task can have a number of "hyperparameters" attached to it. But once that task is cloned and in draft mode, one can EDIT these parameters and change them. If then the task is queued, the new parameters will be injected into the code itself.

A pipeline is no different, it can have pipeline par...

2 years ago

0 Hi Community, How Can I Prevent Clearml Creating A New Experiment, Each Time I Interrupt And Restart Training On The Same Task? I'M Training Yolov8 And Clearml Docker Usage Is Up To 30Gb. I Can'T See A Yaml Config Parameter For This.

Hey @<1539780305588588544:profile|ConvolutedLeopard95> , unfortunately this is not built-in into the YOLOv8 tracker. Would you mind opening an issue on the YOLOv8 github page and atting me? (I'm thepycoder on github)

I can then follow up the progress on it, because it makes sense to expose this parameter through the yaml.

That said, to help you right now, please change [this line](https://github.com/ultralytics/ultralytics/blob/fe61018975182f4d7645681b4ecc09266939dbfb/ultralytics/yolo/uti...

2 years ago

0 Hello, Trying To Figure Out How To Run A Machine In Docker Mode (Ecr Private Repo) Using Clearml. For Some Reason I Cannot Get This To Work With :

Does it help to also run docker login in the init bash script?
You should be able to access your AWS credentials from the environment (the agent will inject them based on your config)

2 years ago

0 Hi All! I Was Checking The Configuration Logged Under "Hyperparameters" In The Web Ui And The Values Of Some Parameters Are Not Displayed. At First I Thought The Problem Was Coming From My Code, But Later I Realized The Values Disappear When You Scroll Do

Do you have a screenshot of what happens? Have you checked the console when pressing f12?

3 years ago

0 Hi All

Thanks! I know that you posted these locations before in text, I just wanted to make sure that they are the ones I was thinking. It seems like the model isn't properly uploaded to the clearml server. Instead, it's saving only the local path to the model file.

Normally that's what the output_uri=True in the Task.init(...) call is for, but it seems there is a bug that's not uploading the model.

Would you mind testing out [manual model uploading](https://clear.ml/docs/latest/docs/clea...

2 years ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Sure! This is an example of running a custom model. It basically boils down to defining a preprocess, process and postprocess function. Inside the process function can be anything, including just a basic call to huggingface to run inference 🙂
I have not tested this myself mind you, but I see no reason why it wouldn't work!
In fact, I think even Triton itself supports running on CPU these days, so you still ...

2 years ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Sorry, I jumped the gun before I fully understood your question 🙂 So with simple docker compose file, you mean you don't want to use docker-compose-triton.yaml file and so want to run the huggingface model on CPU instead of Triton?

Or do you want to know if the general docker compose version is able to handle a huggingface model?

2 years ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

That wasn't my intention! Not a dumb question, just a logical one 😄

2 years ago

0 Hi, We Have A Workflow Which Goes Over List Of Directories And Processes All Movies From Them. "Process" - Means Run Certain Detection Algorithms On Each Movie Frame. We Built Clearml Task From This Workflow, And Created Hpo Application Based On This Task

Unfortunately, ClearML HPO does not "know" what is inside the task it is optimizing. It is like that by design, so that you can run HPO with no code changes inside the experiment. That said, this also limits us in not being able to "smartly" optimize.

However, is there a way you could use caching within your code itself? Such as using functools' LRU cache? This is built-in in python and will cache function return values if it's ever called again with the same input arguments.

There also see...

2 years ago

0 Hi Guys! I Am New To Clearml And I Was Trying Out This Simple Code And It Took 4Min To Run. Is This Normal?

Hey @<1541592213111181312:profile|PleasantCoral12> thanks for doing the profiling! This looks pretty normal to me. Although 37 seconds for a dataset.get is definitely too much. I just checked and for me it takes 3.7 seconds. Mind you the .get() method doesn't actually download the data, so the dataset size is irrelevant here.

But the slowdowns do seem to only occur when doing api requests. Possible next steps could be:

Send me your username and email address (maybe dm if you don't wa...

2 years ago

Show more results compactanswers