CostlyOstrich36

0 Questions, 4213 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Answers 4213

0 Hi All, I Use Autoscalers In My Training Configuration, And I Have An Issue With Them.

You can always add the relevant configurations to the docker image itself as well. From my understanding a new version should be released towards the end of the month and with it the ability to run without docker image required on the autoscaler

one year ago

0 Does Anyone Know Why Uploading A Dataset From Your Local Machine Sets A

Hi @<1840924578885406720:profile|VictoriousFish46> , how are you uploading the dataset? Did you set output_uri? What is set as your files server in the api section of your clearml.conf ?

5 months ago

0 Hi All, I Use Autoscalers In My Training Configuration, And I Have An Issue With Them.

That or a private docker registry

one year ago

0 Hi, I’Ve Recently Started Experimenting With Clearml And The Various Features It Offers. I’M Primarily Working On Creating Different Pipelines, And I’Ve Encountered An Issue I’D Appreciate Your Help With. I’Ve Noticed That, Somewhat Inconsistently, The In

Hi @<1815919815257231360:profile|UpsetFrog68> , can you provide a standalone code snippet that would reproduce this occasional behaviour?

7 months ago

0 Is There Anywhere In The Web Ui Where One Can See The Clearml Server Version Running? I Keep Getting "Version 1.1.1 Is Now Available" Even Though I'M Pretty Sure I Took All The Steps To Update To The Latest Version

UnevenDolphin73 , can you please provide a screenshot of the window, message and the URL in sight?

4 years ago

UnevenDolphin73 , sorry for the delay 🙂
Please go to the profile page, hit F12 and do CTRL+F5

In 'Network' there should be a call to ' http://server.info '. Can you please copy paste the response here?

4 years ago

Also, can you copy here the contents of your docker-compose file here?

4 years ago

0 Hi All, I Use Autoscalers In My Training Configuration, And I Have An Issue With Them.

Yes, this will cause the code to run inside the container.

if so it won't work as my environment is in the hist linux

Not sure I understand this part, can you please elaborate?

one year ago

0 Hi All, I Use Autoscalers In My Training Configuration, And I Have An Issue With Them.

Hi @<1708653001188577280:profile|QuaintOwl32> , you can set some default image to use. My default for most jobs is nvcr.io/nvidia/pytorch:23.03-py3

one year ago

0 Hi All. I Have A General Question Regarding The Logging Mechanism. Using

CrookedWalrus33 , you can set in the Task.init , set the output_uri = True . This should upload to the fileserver since by default models are saved locally

3 years ago

0 Hello, I Encountered An Issue While Deploying A Self-Hosted Clearml Server, By Following The Official Docker Compose

Now try logging in

3 years ago

0 Hey! Did Anyone Had Experience With Setting Up Clearml K8S-Based Agents To Create K8S Jobs Connected To The Node'S Gpu? Running K3S Over A Local Server Thanks, As This Is Currently Blocking Us

Hi @<1710827340621156352:profile|HungryFrog27> , what seems to be the issue?

11 months ago

0 My Task Is In Pending State, After I Enqueued The Task Its In Pending State, Need Help!!

@<1570583227918192640:profile|FloppySwallow46>

2 years ago

0 Why It Is Unable To Acess The File If I Change It While Rerun

It looks like you're running on different machines and the file your code is looking for is not available on the other machine

2 years ago

0 Hello! I'Ve Been Trying To Use Clearml For The First Time, But I Cannot Seem To Run The First Serving Model. First, I Run The Following In Powershell: >>> Clearml-Serving Create --Name "June Test" Log: Clearml-Serving - Cli For Launching Clearml Serving

Can you add such an attempt and the outputs please?

5 months ago

0 As Soon As I Refactor My Project Into Multiple Folders, Where On Top-Level I Put My Pipeline File, And Keep My Tasks In A Subfolder, The Clearml Agent Seems To Have Problems:

Hi @<1724960468822396928:profile|CumbersomeSealion22> , what was the structure that worked previously for you and what is the new structure?

one year ago

0 Hi! I'M Using Func

DepressedFish57 , Hi 🙂

What do you mean by downloading a previous part of the dataset? get_local_copy fetches the entire dataset if I'm not mistaken. Am I missing something?

3 years ago

0 Hi, I Have Problem With Retrieving Models From Storage I Have Following Code

Hi @<1858319200146165760:profile|PoisedDeer30> , can you provide a standalone snippet that reproduces this behaviour?

Also do you have a log of this? From where did you delete it?

3 months ago

0 Hi Team, I Have Been Trying To Setup Pipeline Using Clearml. I Set Up An Agent In Colab And Also In Ec2. However In Both Cases, The Pipeline Is Pending. I Checked The Logs In The Agent And It Is As Below. Can Someone Help Me Understand What Is Wrong? Fy

Hi @<1552101447716311040:profile|SteadySeahorse58> , if the experiment is still in pending mode it means that it wasn't picked up by any worker. Please note that in a pipeline you have the controller that usually runs on the services queue and then you have the steps where they all can run on different queues - depending on what you set

2 years ago

0 Hey, Im Trying To Run Some Example Pipelines But Have The Problem, That Inside The Steps No Requirements Are Installed. In The Controller Task Itself All Requirements From The Repo Requirements.Txt Are Installed But In The Steps Only Cython. Are The Steps

Hi JumpyRabbit71 , I think each step has it's own requirements

3 years ago

0 Hi Everyone, I Have A Question On How To Disable Running Pipelines In Hybrid Mode, So That Agent Only Considers The Code From Git And Not From Local Repo While Running A Task Remotely. I Am Trying To Run A Demo Pipeline Remotely Using Clearml-Agent. Some

Hi @<1731483438642368512:profile|LoosePigeon2> , you need to set the following:

sdk: {
    development: {
        store_code_diff_from_remote: false
        store_uncommitted_code_diff: false

On the machine you're running your pipeline from

one year ago

0 I Have A Question Regarding Dataset Versioning Lets Say I Create Dataset A Which Has 1000 Files Then I Create Dataset B With Dataset A As It'S Parent. All I Did Was Delete 900 Files That Were Corrupted. When I Pull A Local Copy Of Dataset B, Do I Just Pul

Yes

2 years ago

0 Is There Any Way To: Within The Ui, Select And Compare The Scalars For More Than 10 Experiments? I'D Like To Do Something Like:

SmallDeer34 , great, thanks for the info 🙂

4 years ago

0 I Am Trying To Implement A Service Task Which Will Do Some Orchestration Of Jobs, Queues, Vm Instances, Etc. I'M Using Clearml.Backend_Api.Session.Client.Apiclient To Fetch The List Of Queues And Tasks With Their Statuses. What I Also Need And Cannot Figu

Hi @<1631102016807768064:profile|ZanySealion18> , I would suggest using the web UI as a reference. Open developer tools and check what is being sent/received when looking at the workers/queues pages

one year ago

0 Trying To Add Someone To My Workspace And We Keep Getting A Cannot Resolve Link Destination

Can you please open developer tools (F12) and see what is returned in network when you try to do this?

2 years ago

0 Hi All, I Have A Question Regarding Multiple Parents: I Have A Pipe That Runs On Multiple Datasets, And The Last Step Does Something On The Bulk Of Those Sets (The Thing Itself Is Not Important). Sometimes One Of The Parents Fails Or Skipped Due To A Prev

Hi @<1639799308809146368:profile|TritePigeon86> , if I understand you correctly, you're basically looking for a switch in pipelines (per step) to say "even if step failed, continue the pipeline"?

one year ago

0 Hi Team,In Clearml.Config File Can We Change This

Hi @<1533159639040921600:profile|JoyousReindeer30> , the pipeline controller is currently pending. I am guessing it is enqueued into the services queue. You would need to run an agent on the services queue for the pipeline to start executing 🙂

2 years ago

0 Hi! Do Anyone Have Problems With Access To Clear Ml Website?

Are you still having issues?

3 years ago

0 Back To Autoscaler; Is There Any Way To Ensure The Environment Variables On The Services Queue (Where The Scaler Runs) Will Be Automatically Exposed To New Ec2 Instance? Some Bash Hack Or Similar Would Be Nice, Really

UnevenDolphin73 , if you're launching the Autoscaler through the apps, you can also add bash init script or additional configs - that's another way to inject env vars 🙂