Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
SuccessfulRaven86
Moderator
16 Questions, 63 Answers
  Active since 12 April 2023
  Last activity 6 months ago

Reputation

0

Badges 1

62 × Eureka!
0 Votes
7 Answers
664 Views
0 Votes 7 Answers 664 Views
one year ago
0 Votes
8 Answers
721 Views
0 Votes 8 Answers 721 Views
one year ago
0 Votes
4 Answers
407 Views
0 Votes 4 Answers 407 Views
7 months ago
0 Votes
12 Answers
720 Views
0 Votes 12 Answers 720 Views
Can we use the simple docker-compose.yml file for clearml serving on a huggingface model (not processed to tensorrt)?
one year ago
0 Votes
1 Answers
605 Views
0 Votes 1 Answers 605 Views
one year ago
0 Votes
3 Answers
740 Views
0 Votes 3 Answers 740 Views
Hi channel, I am using K8s clearml-serving helm chart and noticed a small issue. The current implementation of ...ingress.yaml resource does not contain the ...
one year ago
0 Votes
11 Answers
738 Views
0 Votes 11 Answers 738 Views
Hello channel, Two other related questions: - ClearML is supposed to automatically detect GIT repo directly. It works when I run a python script but it does ...
one year ago
0 Votes
5 Answers
775 Views
0 Votes 5 Answers 775 Views
Hello, I am trying to modify my clearml-agent running on a AWS autoscaler (From ClearML applications). I want to be able to clone my repo (working), and inst...
one year ago
0 Votes
3 Answers
420 Views
0 Votes 3 Answers 420 Views
Hello! I have a small question regarding storage data retrieval with ClearML 😉 Context: My team uploads thousands of data samples for training as one ClearM...
7 months ago
0 Votes
1 Answers
675 Views
0 Votes 1 Answers 675 Views
Hi, Can someone give more information about what an API call means? Our team has been charged for 10 Millions API calls, but we struggle to understand where ...
one year ago
0 Votes
4 Answers
750 Views
0 Votes 4 Answers 750 Views
one year ago
0 Votes
40 Answers
23K Views
0 Votes 40 Answers 23K Views
Hello channel, I am struggling a lot on an issue linked to ClearMl agent and AWS Autoscaler . This issue is very problematic and urgent, please help me out! ...
one year ago
0 Votes
0 Answers
699 Views
0 Votes 0 Answers 699 Views
Hey channel, Clearml-serving question Is it good practice to save a .zip file as model, and unzip it in the custom endpoint for usage?
one year ago
0 Votes
3 Answers
713 Views
0 Votes 3 Answers 713 Views
Hello, Question about the time of upload: Is it faster or exactly the same to upload 1 file of 1Gb compared to 10 files of 100 Mb?
one year ago
0 Votes
3 Answers
425 Views
0 Votes 3 Answers 425 Views
7 months ago
0 Votes
5 Answers
778 Views
0 Votes 5 Answers 778 Views
Hello, I have the same issue as this github issue: None I tried setting up my AWS autoscaler conf file with the following params: sdk.development.store_uncom...
one year ago
0 Hello, I Am Trying To Modify My Clearml-Agent Running On A Aws Autoscaler (From Clearml Applications). I Want To Be Able To Clone My Repo (Working), And Install My Poetry Dependencies From

Thank you for the quick replies!

I might do it the wrong way but the above snippet of code is the additional clearml.conf file I add to the AWS autoscaler. Should I add a complete clearml.conf file to it?

That is a good question @<1537605940121964544:profile|EnthusiasticShrimp49> ! I am not sure the image has python 3.9. I tried to check it but did not find the answer. I am using the following AMI: AWS Deep Learning AMI (Ubuntu 18.04) with Support by Terracloudx (Nvidia deep learni...

one year ago
0 Hello, I Am Trying To Modify My Clearml-Agent Running On A Aws Autoscaler (From Clearml Applications). I Want To Be Able To Clone My Repo (Working), And Install My Poetry Dependencies From

@<1523701070390366208:profile|CostlyOstrich36> The base docker image of the AWS autoscaler is nvidia/cuda:10.2-runtime-ubuntu18.04 . According to me, the python version is not set inside the image, but I am might be wrong and it could be the problem indeed... ?

one year ago
0 Hello Channel, Two Other Related Questions:

No problem. I guess this might be a small visualisation bug, but I really have the impression that these workers still pick up tasks, which is strange. I should test again to be sure.

one year ago
0 Hello Channel, Two Other Related Questions:

The flask command is ran inside the git project, which is the strange behavior. It is executed in ~/code/repo/ as flask train ...

one year ago
0 Hello Channel, Two Other Related Questions:

I tried playing with those, but I do not succeed to have a role on the source code detection. I can modify the env variables, nothing happen on CLearML server unfortunately.

one year ago
0 Hello Channel, Two Other Related Questions:

I will check that. Do you think we could bypass it using Task.create ? And passing all the needed params?

one year ago
0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Sorry to come back to this! Regarding the Kubernetes Serving helm chart, I can see horyzontal scaling of docker containers. What about vertical scaling? Is it implemented? More specifically, where is defined the SKU of the VMs in use?

one year ago
0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

I basically would like to know if we can serve the model without tensorrt format which is highly efficient but more complicated to get.

one year ago
0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

In production, we should use the clearml-helm-charts right? Docker-compose in the clearml-serving is more for local testing

one year ago
one year ago
0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

Prerequisites, PyTorch models require Triton engine support, please use docker-compose-triton.yml / docker-compose-triton-gpu.yml or if running on Kubernetes, the matching helm chart.

one year ago
0 Can We Use The Simple Docker-Compose.Yml File For Clearml Serving On A Huggingface Model (Not Processed To Tensorrt)?

I would like to know if it is possible to run any pytorch model on the basic docker compose file ? Without triton?

one year ago
0 Hello Channel, Two Other Related Questions:

@<1523701205467926528:profile|AgitatedDove14> If you have any other insights, pls do not hesitate! Thanks a lot

one year ago
0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

And I just tried with Python 3.8 (default version of the image) and it still fails.

Poetry Enabled: Ignoring requested python packages, using repository poetry lock file!
Creating virtualenv debug in /root/.clearml/venvs-builds/3.8/task_repository/clearmldebug.git/.venv
Using virtualenv: /root/.clearml/venvs-builds/3.8/task_repository/clearmldebug.git/.venv
2023-04-18 15:03:52
Installing dependencies from lock file
Finding the necessary packages for the current system
Package operation...
one year ago
0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

Is it a bug inside the AWS autoscaler??

one year ago
0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

How do you explain that it works when I ssh-ed into the same AWS container instance from the autoscaler?

one year ago
0 Hello Channel, I Am Struggling A Lot On An Issue Linked To

I literrally connected to it at runtime, and ran poetry install -n and it worked

one year ago
0 Hello, Question About The Time Of Upload: Is It Faster Or Exactly The Same To Upload 1 File Of 1Gb Compared To 10 Files Of 100 Mb?

For now, I am uploading to the basic-available ClearML server to store my data. But I will soon use S3 buckets to store data. So the question is for both use cases 🙂

one year ago
0 Hello Channel, Two Other Related Questions:

I have my Task.init inside a train() function inside the flask command. We basically have flask commands allowing to trigger specific behaviors. When running it locally, everything works properly except the repository information. The use case is linked to the way our codebase works. For example, I am going to do flask train {arguments} and it will trigger the training of a model (that I want to track).

I stopped the autoscaler and deleted it manually. I did it because I want to test...

one year ago
one year ago
0 Hello everyone, *Context:* I am currently facing a headache-inducing issue regarding the integration of flash attention V2 for LLM training. I am running a python script locally, that then runs remotely. Without the integration of flash attention, the co

Hi @<1523701087100473344:profile|SuccessfulKoala55> , the EC2 instance is spinned-up from the AWS autoscaler provided by ClearML. I use this following docker image: nvidia/cuda:11.8.0-devel-ubuntu20.0

So the EC2 instance runs a docker container

7 months ago
0 Hello! I Have A Small Question Regarding Storage Data Retrieval With Clearml

One possible solution I could see as well, is putting the data storage to S3 bucket to improve download performance as it is the same cloud provider. No transfer latency.

7 months ago
0 Hello, I Have The Same Issue As This Github Issue:

Sure, here is the updated clearml.conf file of the AWS autoscaler instance:

agent {
    vcs_cache.enabled: false

    package_manager: {
          type: poetry,
          poetry_version: "1.4.2",
     }  
}

sdk {
    development {
         store_code_diff_from_remote: false,
    }

}

I see uncommited changes, where as I would like to have nothing.

one year ago
0 Hello, I Have The Same Issue As This Github Issue:

Ok. I spinned up three AWS autoscalers, each with different conf. I also fixed a submodule issue in my repo (which I was believing was the problem of the git diff) and every run now passes and fails after (not this problem). So I think store_code_diff_from_remote is of no help from me but my problem is gone...

one year ago
Show more results compactanswers