HungryFrog27

8 Questions, 13 Answers

Active since 10 June 2024

Last activity 10 months ago

Reputation

Badges 1

13 × Eureka!

Questions 8
Answers 13

0 Votes

3 Answers

813 Views

0 Votes 3 Answers 813 Views

Hello All, General Question - We'Re Currently Intend To Move Our Clearml Server From A Self-Hosted One To Using The Saas-Based Server, In Tandem With A Local Agent. We'D Like To Validate Two Stuff Before We Migrate Though -

Hello all, general question - We're currently intend to move our ClearML server from a self-hosted one to using the SaaS-based server, in tandem with a local...

clearml

10 months ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hello Everyone! We'Ve Been Using Clearml For A Couple Weeks Now With Everything Working Out Fine, But In Recent Days We'Ve Run Into An Issue With Parameter Optimizer Tasks - Without Any Apparent Change On Our Side, All Child Tasks Of The Optimizer Are Abo

Hello everyone! We've been using ClearML for a couple weeks now with everything working out fine, but in recent days we've run into an issue with parameter o...

clearml

one year ago

0 Votes

0 Answers

2K Views

0 Votes 0 Answers 2K Views

Hey! We'Re Currently Running Clearml On-Premise, With A Helm Chart Deployment For The Server Itself And A Bare-Metal Deployment For The Agent - On A Different Compute Core. We'Ve Set Up A Ssh Key For A Gh User With Access To The Our Private Repos We Wish

Hey! We're currently running ClearML on-premise, with a helm chart deployment for the server itself and a bare-metal deployment for the agent - on a differen...

mlops

one year ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Hey There! I'M Encountering An Odd Issue - I'M Running My Agents As Python Processes On A Windows Pc Endpoints. I Recently Had A Bug That Forced Me To Delete All Cache And All (Non-Core) Venv-Builds. My Firstly Booted Agent Uses The ''First'' Venv-Build

hey there! I'm encountering an odd issue - I'm running my agents as python processes on a windows PC endpoints. I recently had a bug that forced me to delete...

mlops

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hey! We'Ve Set An Hyperparameter Optimization Task On One Of Our Experiments, And It Seems As Though The Optimization Itself Is Running - Yet We Don'T See The Iterations Table In The 'Plots' Section In The Ui. Providing Some Screenshots To How We'Ve Set I

Hey! We've set an hyperparameter optimization task on one of our experiments, and it seems as though the optimization itself is running - Yet we don't see th...

clearml

one year ago

0 Votes

1 Answers

769 Views

0 Votes 1 Answers 769 Views

Hey! As Part Of Migrating From The On-Premise Server To The Saas Solution Server, We'D Need To Migrate Some Of The Workloads & Experiment Log From Clearml. Is It Possible?

Hey! As part of migrating from the on-premise server to the SaaS solution server, we'd need to migrate some of the workloads & experiment log from ClearML. I...

clearml

10 months ago

0 Votes

0 Answers

829 Views

0 Votes 0 Answers 829 Views

Hi All, I'M Running The Clearml Helm Chart Over Argocd. When I Apply The Same Chart Over Docker Desktop Everything Works Fine, But When I Apply It Over Argocd, The Readiness Probe Of The Mongodb Pod (Part Of The Mongodb Chart That'S Part Of The Clearml Di

Hi all, I'm running the ClearML Helm Chart over ArgoCD. When I apply the same chart over Docker Desktop everything works fine, but when I apply it over ArgoC...

clearml

11 months ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hey! Did Anyone Had Experience With Setting Up Clearml K8S-Based Agents To Create K8S Jobs Connected To The Node'S Gpu? Running K3S Over A Local Server Thanks, As This Is Currently Blocking Us

Hey! Did anyone had experience with setting up clearml k8s-based agents to create k8s jobs connected to the node's gpu? running k3s over a local server Thank...

clearml

11 months ago

0 Hey There! I'M Encountering An Odd Issue - I'M Running My Agents As Python Processes On A Windows Pc Endpoints. I Recently Had A Bug That Forced Me To Delete All Cache And All (Non-Core) Venv-Builds. My Firstly Booted Agent Uses The ''First'' Venv-Build

Does this provide any more context? @<1523701087100473344:profile|SuccessfulKoala55>

one year ago

huh. It is weird. is there any way to force deletion of it? it seems its still being held be some task and the server has been restarted several times since

one year ago

0 Hey! Did Anyone Had Experience With Setting Up Clearml K8S-Based Agents To Create K8S Jobs Connected To The Node'S Gpu? Running K3S Over A Local Server Thanks, As This Is Currently Blocking Us

Not yet, I tried making it work manually. Might give it a try, thanks!

11 months ago

0 Hello All, General Question - We'Re Currently Intend To Move Our Clearml Server From A Self-Hosted One To Using The Saas-Based Server, In Tandem With A Local Agent. We'D Like To Validate Two Stuff Before We Migrate Though -

Hey John,
thanks on 1!
regarding 2 - your detailing answered my question, thanks!

10 months ago

I can

one year ago

0 Hello Everyone! We'Ve Been Using Clearml For A Couple Weeks Now With Everything Working Out Fine, But In Recent Days We'Ve Run Into An Issue With Parameter Optimizer Tasks - Without Any Apparent Change On Our Side, All Child Tasks Of The Optimizer Are Abo

Hey @<1523701087100473344:profile|SuccessfulKoala55> , 1.8.1

one year ago

0 Hey! Did Anyone Had Experience With Setting Up Clearml K8S-Based Agents To Create K8S Jobs Connected To The Node'S Gpu? Running K3S Over A Local Server Thanks, As This Is Currently Blocking Us

Hi @<1523701070390366208:profile|CostlyOstrich36> ,
I tried setting up in the clearml-agent helm chart values requests & limits under the k8sGlue configuration in order to force the pods to pick up the gpu from the server, while of course choosing a pod image for the k8s jobs that includes a gpu in it (we're using nvidia/cuda:12.4.1 for testing)

the job is created - but simply can't detect a GPU. attaching the value overrides im using for the chart -

agentk8sglue:
          apiServer...

11 months ago

@<1523701087100473344:profile|SuccessfulKoala55> Thanks, I'll check around that!

one year ago

0 Hey! We'Ve Set An Hyperparameter Optimization Task On One Of Our Experiments, And It Seems As Though The Optimization Itself Is Running - Yet We Don'T See The Iterations Table In The 'Plots' Section In The Ui. Providing Some Screenshots To How We'Ve Set I

redacted log attached

one year ago

Hi @<1523701087100473344:profile|SuccessfulKoala55> - Each worker uses its own venv-builds folder

one year ago

is would definetly seem that way - Although when I look at the error logs, the failure is actually in creating the venv-build folder - (under the task_repository subfolder) -

Repository cloning failed: [WinError 183] Cannot create a file when that file already exists: 'C:\\Users\\clearml-admin\\.clearml\\venvs-builds\\3.1\\task_repository\\<repo>.git'

one year ago

Hey!
I see in my agent debug logs that it's constantly dropping the connection with the ClearML Server. I also see my tasks being aborted as User aborted (3) - Just at the point where the (post requirements) venv is added into the local venv cache. Could there be any connection? And if not, does anyone have any clue as to where to continue my debugging?

![image](https://clearml-web-assets.s3.am...

one year ago

but again - The system denies my deletion requiest since it deems the venv-builds dir as in use

one year ago