SuccessfulRaven86

16 Questions, 63 Answers

Active since 12 April 2023

Last activity one year ago

Reputation

Badges 1

62 × Eureka!

Questions 16
Answers 63

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hello, I Have The Same Issue As This Github Issue:

Hello, I have the same issue as this github issue: None I tried setting up my AWS autoscaler conf file with the following params: sdk.development.store_uncom...

mlops

one year ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hey Channel, I Would Like To Setup Kubernetes For Serving My Models Only. Does It Mean I Can Use Clearml-Serving Helm Chart Alone? What Would Be The Use Case Of The Two Other Charts (Agent And Clearml Server). I Am Not Sure To Understand That Properly. I

Hey channel, I would like to setup Kubernetes for serving my models only. Does it mean I can use clearml-serving helm chart alone? What would be the use case...

mlops

one year ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Hello Channel, Two Other Related Questions:

Hello channel, Two other related questions: - ClearML is supposed to automatically detect GIT repo directly. It works when I run a python script but it does ...

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Can Someone Give More Information About What An Api Call Means? Our Team Has Been Charged For 10 Millions Api Calls, But We Struggle To Understand Where They Are Coming From (We Are Only Making Training Tasks). Thanks

Hi, Can someone give more information about what an API call means? Our team has been charged for 10 Millions API calls, but we struggle to understand where ...

clearml

one year ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hello, I Am Trying To Modify My Clearml-Agent Running On A Aws Autoscaler (From Clearml Applications). I Want To Be Able To Clone My Repo (Working), And Install My Poetry Dependencies From

Hello, I am trying to modify my clearml-agent running on a AWS autoscaler (From ClearML applications). I want to be able to clone my repo (working), and inst...

mlops

one year ago

0 Votes

3 Answers

836 Views

0 Votes 3 Answers 836 Views

Hello everyone, *Context:* I am currently facing a headache-inducing issue regarding the integration of flash attention V2 for LLM training. I am running a python script locally, that then runs remotely. Without the integration of flash attention, the co

Hello everyone, Context: I am currently facing a headache-inducing issue regarding the integration of flash attention V2 for LLM training. I am running a pyt...

clearml

one year ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

Hey Channel, Clearml-Serving Question Is It Good Practice To Save A .Zip File As Model, And Unzip It In The Custom Endpoint For Usage?

Hey channel, Clearml-serving question Is it good practice to save a .zip file as model, and unzip it in the custom endpoint for usage?

clearml

one year ago

0 Votes

3 Answers

821 Views

0 Votes 3 Answers 821 Views

Hello! I Have A Small Question Regarding Storage Data Retrieval With Clearml

Hello! I have a small question regarding storage data retrieval with ClearML 😉 Context: My team uploads thousands of data samples for training as one ClearM...

mlops

one year ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hello Channel, I Have A Question Regarding Clearml Serving In Production. I Have Different Environments, And Different Models Each Of Them Linked To A Use Case. I Would Like To Spin Up One Kubernetes Cluster (From Triton Gpu Docker Compose) Taking Into

Hello channel, I have a question regarding clearml serving in production. I have different environments, and different models each of them linked to a use ca...

kubernetes scikit

one year ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, I Have A Small Question Regarding K8S Clearml-Serving Behavior. I Have In My Cluster One Gpu Of 16Gb Ram, And Another One Of 24 Gb Ram. I Have A Llm Model Fitting The 24Gb But Not The 16Gb Gpu. When I Call The Endpoint, How Will I Know To Which Gpu I

Hi, I have a small question regarding k8s clearml-serving behavior. I have in my cluster one GPU of 16GB RAM, and another one of 24 GB RAM. I have a LLM mode...

clearml

one year ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi Channel, I Am Using K8S Clearml-Serving Helm Chart And Noticed A Small Issue. The Current Implementation Of

Hi channel, I am using K8s clearml-serving helm chart and noticed a small issue. The current implementation of ...ingress.yaml resource does not contain the ...

clearml

one year ago

0 Votes

12 Answers

1K Views

0 Votes 12 Answers 1K Views

Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Can we use the simple docker-compose.yml file for clearml serving on a huggingface model (not processed to tensorrt)?

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Hey, Can Clearml Uploads Data To Private Blob Storage Azure? I Have Authorization Errors. Is It Due To The Fact That I Do Not Have The Required Permission On The Storage Account (Could Be Possible) Or The Fact That The Storage Account Is Set As Private A

Hey, Can ClearML uploads data to private blob storage Azure? I have authorization errors. Is it due to the fact that I do not have the required permission on...

clearml

one year ago

0 Votes

40 Answers

50K Views

0 Votes 40 Answers 50K Views

Hello Channel, I Am Struggling A Lot On An Issue Linked To

Hello channel, I am struggling a lot on an issue linked to ClearMl agent and AWS Autoscaler . This issue is very problematic and urgent, please help me out! ...

mlops

one year ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hello, Question About The Time Of Upload: Is It Faster Or Exactly The Same To Upload 1 File Of 1Gb Compared To 10 Files Of 100 Mb?

Hello, Question about the time of upload: Is it faster or exactly the same to upload 1 file of 1Gb compared to 10 files of 100 Mb?

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hey, I Saw Previously That Grafana/Prometheus Were Not Supported As Part Of The Clearml-Serving Helm Chart. I Guess This Is Outdated Right? I See These Charts As Part Of Your Serving Chart. So They Should Be Accessible If The K8S Cluster Is Accessible Fro

Hey, I saw previously that Grafana/Prometheus were not supported as part of the clearml-serving helm chart. I guess this is outdated right? I see these chart...

clearml

one year ago

0 Hello Channel, I Have A Question Regarding Clearml Serving In Production. I Have Different Environments, And Different Models Each Of Them Linked To A Use Case. I Would Like To Spin Up One Kubernetes Cluster (From Triton Gpu Docker Compose) Taking Into

@<1523701118159294464:profile|ExasperatedCrab78> do you have any inputs for this one? 🙂

one year ago

0 Hello Channel, Two Other Related Questions:

@<1523701205467926528:profile|AgitatedDove14> If you have any other insights, pls do not hesitate! Thanks a lot

one year ago

On the helm charts clearml repos, can we use the clearml-serving chart alone ?

one year ago

0 Hey Channel, I Would Like To Setup Kubernetes For Serving My Models Only. Does It Mean I Can Use Clearml-Serving Helm Chart Alone? What Would Be The Use Case Of The Two Other Charts (Agent And Clearml Server). I Am Not Sure To Understand That Properly. I

@<1523701070390366208:profile|CostlyOstrich36> @<1523701205467926528:profile|AgitatedDove14> Any ideas on this one?

one year ago

0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

Is it a bug inside the AWS autoscaler??

one year ago

0 Hello Channel, Two Other Related Questions:

No problem. I guess this might be a small visualisation bug, but I really have the impression that these workers still pick up tasks, which is strange. I should test again to be sure.

one year ago

0 Hello, I Am Trying To Modify My Clearml-Agent Running On A Aws Autoscaler (From Clearml Applications). I Want To Be Able To Clone My Repo (Working), And Install My Poetry Dependencies From

@<1523701070390366208:profile|CostlyOstrich36> The base docker image of the AWS autoscaler is nvidia/cuda:10.2-runtime-ubuntu18.04 . According to me, the python version is not set inside the image, but I am might be wrong and it could be the problem indeed... ?

one year ago

Thanks ! So regarding question2, it means that I can spin up a K8s cluster with triton enabled, and by specifiying the type of model while creating the endpoint, it will use or not the triton engine.
Linked to that, Is the triton engine expecting the tensorrt format or is it just an improvement step compared to other model weights ?

Finally, last question ( I swear 😛 ) : How is the serving on Kubernetes flow supposed to look like? Is it something like that:

Create en...

one year ago

I do not remember, but I was afraid.... Thanks for the output ! Maybe in a bad dream ? 😜

one year ago

I still do not get the K8s clearml server usefulness of it then?

one year ago

I read that, the hosted clearml server was periodically resetted. Does it mean my team would lose all our work?

one year ago

0 Hi All! Couldn'T Find This In The Documentation, How Do You Specify A "Setup Shell Script", So It Is Used For That Specific Task?

Task.set_base_docker 🙂

one year ago

0 Hello, I Am Trying To Modify My Clearml-Agent Running On A Aws Autoscaler (From Clearml Applications). I Want To Be Able To Clone My Repo (Working), And Install My Poetry Dependencies From

Thank you for the quick replies!

I might do it the wrong way but the above snippet of code is the additional clearml.conf file I add to the AWS autoscaler. Should I add a complete clearml.conf file to it?

That is a good question @<1537605940121964544:profile|EnthusiasticShrimp49> ! I am not sure the image has python 3.9. I tried to check it but did not find the answer. I am using the following AMI: AWS Deep Learning AMI (Ubuntu 18.04) with Support by Terracloudx (Nvidia deep learni...

one year ago

0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

I guess it makes no sense because of the steps a clearml-agent works...
I also thought about going to pip mode but not all packages are detected from our poetry.lock file unfortunately so cannot do that.

one year ago

0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

This is really extremely hard to debug. I am thinking to create another repo and iterate on the packages to hopefully find the problem, but it will take ages.

one year ago

0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

I am literrally trying with 1 package and python and it fails. I tried with python 3.8 3.9 and 3.9.16. and it always fail --> not linked to python version. What is the problem then? I am wondering if there is not an intrinsic bug

one year ago

0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

I literrally connected to it at runtime, and ran poetry install -n and it worked

one year ago

0 Hello everyone, *Context:* I am currently facing a headache-inducing issue regarding the integration of flash attention V2 for LLM training. I am running a python script locally, that then runs remotely. Without the integration of flash attention, the co

It is due to the caching mechanism of Clearml. Is there a python command to update the venvs-cache?

one year ago

Hi @<1523701087100473344:profile|SuccessfulKoala55> , the EC2 instance is spinned-up from the AWS autoscaler provided by ClearML. I use this following docker image: nvidia/cuda:11.8.0-devel-ubuntu20.0

So the EC2 instance runs a docker container

one year ago

0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

but I still had time to go inside the container, export the PATH variables for my poetry and python versions, and run the poetry install command there

one year ago

0 Hi, I Have A Small Question Regarding K8S Clearml-Serving Behavior. I Have In My Cluster One Gpu Of 16Gb Ram, And Another One Of 24 Gb Ram. I Have A Llm Model Fitting The 24Gb But Not The 16Gb Gpu. When I Call The Endpoint, How Will I Know To Which Gpu I

The servingtaskid is linked to the helm chart, which means that your solution would propose to create multiple kubernetes cluster according to our requirements, no?

one year ago

Hey @<1523701205467926528:profile|AgitatedDove14> , thank you for your input
Could you clarify what you mean by clearml-serving session?

Are you refering to the servingTaskId ?

one year ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

I basically would like to know if we can serve the model without tensorrt format which is highly efficient but more complicated to get.

one year ago

0 Hey, Can Clearml Uploads Data To Private Blob Storage Azure? I Have Authorization Errors. Is It Due To The Fact That I Do Not Have The Required Permission On The Storage Account (Could Be Possible) Or The Fact That The Storage Account Is Set As Private A

Yep got it working thanks.

one year ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Thank you! I will try this 🙂

one year ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Prerequisites, PyTorch models require Triton engine support, please use docker-compose-triton.yml / docker-compose-triton-gpu.yml or if running on Kubernetes, the matching helm chart.

one year ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

I would like to know if it is possible to run any pytorch model on the basic docker compose file ? Without triton?

one year ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Sorry to come back to this! Regarding the Kubernetes Serving helm chart, I can see horyzontal scaling of docker containers. What about vertical scaling? Is it implemented? More specifically, where is defined the SKU of the VMs in use?

one year ago

Great, and can we specify an environment variable of ClearML that directly updates the clearml.conf file regarding the azure config or do something similar. I do not want to ask every engineer of my team to modify its clearml.conf file? @<1523701070390366208:profile|CostlyOstrich36> Thanks

one year ago

0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

In production, we should use the clearml-helm-charts right? Docker-compose in the clearml-serving is more for local testing

one year ago

Show more results