SmallTurkey79

10 Questions, 118 Answers

Active since 12 April 2024

Last activity 3 months ago

Reputation

Badges 1

103 × Eureka!

Questions 10
Answers 118

0 Votes

1 Answers

467 Views

0 Votes 1 Answers 467 Views

Hi Everyone! I Just Wanted To Bring To Your Attention That Clearml 1.16.0 Introduced Authentication For The Self-Hosted Fileserver By Default.

Hi everyone! I just wanted to bring to your attention that ClearML 1.16.0 introduced authentication for the self-hosted fileserver by default. None If any of...

clearml

4 months ago

0 Votes

1 Answers

404 Views

0 Votes 1 Answers 404 Views

In Case Anyone Else Ever Comes Across Mongo Issues Using The Docker Compose Clearml Stack (In Case Of A Messy Shutdown), I Have Found This Script To Be A Lifesaver On Numerous Occasions:

in case anyone else ever comes across mongo issues using the docker compose clearml stack (in case of a messy shutdown), I have found this script to be a lif...

clearml

5 months ago

0 Votes

14 Answers

572 Views

0 Votes 14 Answers 572 Views

I'M Having A Hard Time With Git Cloning + Cache For A Private Repo Accessed Via Personal Access Token. This Happens 100% Of The Time, Across Both Bitbucket + Github. I Have A Simple "Hello World" Task In A Private Repo. The Worker Is Running In A Docker

I'm having a hard time with git cloning + cache for a private repo accessed via personal access token. This happens 100% of the time, across both bitbucket +...

clearml

7 months ago

0 Votes

9 Answers

513 Views

0 Votes 9 Answers 513 Views

Why Does Clearml Still Waste Time On Requirement Analysis When I Provide Them? Any Tips For How I Can Reduce Clearml Overhead ... (The Time Before Work Actually Starts)?

why does clearml still waste time on requirement analysis when I provide them? any tips for how I can reduce clearml overhead ... (the time before work actua...

clearml

5 months ago

0 Votes

2 Answers

586 Views

0 Votes 2 Answers 586 Views

Hello! Thank You For The Great Product. I Have A Bit Of A Request: This Hover Feature In Pipeline Overview Would Be Much More Useful If I Could Read Out The Whole Metric Name. (Not So Much An Issue With Things Like F1, "Acc", But Anything Longer Is Not

hello! thank you for the great product. I have a bit of a request: this hover feature in pipeline overview would be much more useful if I could read out the ...

clearml

7 months ago

0 Votes

13 Answers

334 Views

0 Votes 13 Answers 334 Views

Any Tips On Debugging Worker Graphs Not Showing Up? Seems To Be Some Js Errors In The Console That May Be Related. Running Localhost Against 1.16.1 Images

any tips on debugging worker graphs not showing up? seems to be some js errors in the console that may be related. running localhost against 1.16.1 images

clearml

4 months ago

0 Votes

54 Answers

17K Views

0 Votes 54 Answers 17K Views

I Have Set

I have set export CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL=true export CLEARML_AGENT_SKIP_PIP_VENV_INSTALL=true in my entrypoint.sh (which runs clearml-agent da...

clearml

6 months ago

0 Votes

43 Answers

11K Views

0 Votes 43 Answers 11K Views

I Dont Exactly Know How To Ask For Help On This... Nor Have A Reproducible Minimal Example... I Downgraded Back To 1.15.1 From 1.16.2 And Have The Same Issue There. I Have A Pipeline That'S Repeatedly Failing To Complete. It Correctly Marks Things As Cach

i dont exactly know how to ask for help on this... nor have a reproducible minimal example... I downgraded back to 1.15.1 from 1.16.2 and have the same issue...

clearml

3 months ago

0 Votes

7 Answers

737 Views

0 Votes 7 Answers 737 Views

Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

Thread re: Pipelines and how they're meant to be used / how long they take to orchestrate. @<1523701205467926528:profile|AgitatedDove14> I appreciated your a...

clearml

6 months ago

0 Votes

31 Answers

12K Views

0 Votes 31 Answers 12K Views

I Noticed After Upgrading To The Latest Clearml That App Credentials Now Disappear On Restart. Is This An Intentional Design Choice? I'M In A Bit Of A Chicken-And-Egg Situation: Trying To Generate Valid Keys For

I noticed after upgrading to the latest clearml that App Credentials now disappear on restart. Is this an intentional design choice? I'm in a bit of a chicke...

clearml

4 months ago

0 I Have Set

i would love some advice on that though - should I be using services mode + docker and some max # of instances to be spinning up multiple tasks instead?

my thinking was to avoid some of the docker overhead. but i did try this approach previously and found that the container limit wasn't exactly respected.

6 months ago

0 I Noticed After Upgrading To The Latest Clearml That App Credentials Now Disappear On Restart. Is This An Intentional Design Choice? I'M In A Bit Of A Chicken-And-Egg Situation: Trying To Generate Valid Keys For

i am definitely not seeing it persist after upgrading. previously it wasn't a problem on other upgrades

4 months ago

thank you!
out of curiosity: how come the clearml-webserver upgrades weren't included in this release? was it just to patch the api part of the codebase?

4 months ago

yeah. thats how I've been generating credentials for agents as well as for my dev environment .

4 months ago

App Credentials now persist (I upgraded 1.15.1 -> 1.16.1 and the same keys exist!)
thanks!

4 months ago

0 I Dont Exactly Know How To Ask For Help On This... Nor Have A Reproducible Minimal Example... I Downgraded Back To 1.15.1 From 1.16.2 And Have The Same Issue There. I Have A Pipeline That'S Repeatedly Failing To Complete. It Correctly Marks Things As Cach

damn, it just happened again... "queued" steps in the viz are actually complete. the pipeline task disappeared again without completion, logs mid-stream.

3 months ago

0 Any Tips On Debugging Worker Graphs Not Showing Up? Seems To Be Some Js Errors In The Console That May Be Related. Running Localhost Against 1.16.1 Images

thanks!
I've been experiencing enough weird behavior on my new deployment that I need to stick to 1.15.1 for a bit to get work done. The graphs show up there just fine, and it feels like (since I no longer need auth) it's the more stable choice right now.

When clearml-web receives the updates that are on the main branch now, I'll definitely be rushing to upgrade our images and test the latest again. (for now I'm still running a sidecar container hosting the built version of the web app o...

3 months ago

it happens consistently with this one task that really should be all cache.
I disabled cache in the final step and it seems to run now.

3 months ago

trying to run the experiment that kept failing right now, watching logs (they go by fast)... will try to spot anything anamolous

3 months ago

nothing came up in the logs. all 200's

3 months ago

it's pretty reliably happening but the logs are just not informative. just stops midway

3 months ago

N/A (still shows as running despite Abort being sent)

3 months ago

I have tried other queues, they're all running the same container.
so far the only thing reliable is pipe.start_locally()

3 months ago

that's the final screenshot. it just shows a bunch of normal "launching ..." steps, and then stops all the sudden.

3 months ago

let me downgrade my install of clearml and try again.

3 months ago

yeah this problem seems to happen on 1.15.1 and 1.16.2 as well, prior runs were on the same version even. It just feels like it happens absolutely randomly (but often).
just happened again to me.

The pipeline is constructed from tasks, it basically does map/reduce. prepare data -> model training + evaluation -> backtesting performance summary.

It figures out how wide to go by parsing the date range supplied as input parameter. Been running stuff like this for months but only recently did ...

3 months ago

ugh. again. it launched all these tasks and then just died. logs go silent.

3 months ago

the workers connect to the clearml server via ssh-tunnels, so they all talk to "localhost" despite being deployed in different places. each task creates artifacts and metrics that are used downstream

3 months ago

I really can't provide a script that matches exactly (though I do plan to publish something like this soon enough), but here's one that's quite close / similar in style:
None where I tried function-steps out instead, but it's a similar architecture for the pipeline (the point of the example was to show how to do a dynamic pipeline)

3 months ago

its odd... I really dont see tasks except the controller one dying

3 months ago

enqueuing. pipe.start("default") but I think it's picking up on my local clearml install instead of what I told it to use.

my tasks have this in them... what's the equivalent for pipeline controllers?

3 months ago

did you take a look at my connect.sh script? I dont think it's a problem since only the controller task is the problem.

Is there some sort of culling procedure that kills tasks by any chance? the lack of logs makes me think it's something like that.

I can also try different agent versions.

3 months ago

would it be on the pipeline task itself then, since that's what's disappearing?
I will do some experiment comparisons and see if there are package diffs. thanks for the tip.

3 months ago

0 Why Does Clearml Still Waste Time On Requirement Analysis When I Provide Them? Any Tips For How I Can Reduce Clearml Overhead ... (The Time Before Work Actually Starts)?

thank you very much.

for remote workers, would this env variable get parsed correctly?
CLEARML_API_HTTP_RETRIES_BACKOFF_FACTOR=0.1

5 months ago

0 I'M Having A Hard Time With Git Cloning + Cache For A Private Repo Accessed Via Personal Access Token. This Happens 100% Of The Time, Across Both Bitbucket + Github. I Have A Simple "Hello World" Task In A Private Repo. The Worker Is Running In A Docker

so, i got around this with env vars

in my worker entrypoint script , I do

export CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL=1
export CLEARML_AGENT_SKIP_PIP_VENV_INSTALL=$(which python)

5 months ago

and for what its worth it seems I dont have anything special for agent cloning

i did find agent.vcs_cache.clone_on_pull_fail to be helpful . but yah, updating the agent was the biggest fix

5 months ago

0 Any Tips On Debugging Worker Graphs Not Showing Up? Seems To Be Some Js Errors In The Console That May Be Related. Running Localhost Against 1.16.1 Images

not quite seeing that one. hoping these views help

3 months ago

0 Hi There, I Have A Question Regarding The Automatic Requirements Collection Of Clearml-Agent. Currently I Have Two Pip Packages Installed In The Base-Docker Image. When I Run My Experiment I Use:

https://clearml.slack.com/archives/CTK20V944/p1719091692671319?thread_ts=1718986620.161789&channel=CTK20V944&message_ts=1719091692.671319

4 months ago

0 Hello! Thank You For The Great Product. I Have A Bit Of A Request: This Hover Feature In Pipeline Overview Would Be Much More Useful If I Could Read Out The Whole Metric Name. (Not So Much An Issue With Things Like F1, "Acc", But Anything Longer Is Not

took me a while to deliver enough functionality to my team to justify working on open source... but I finally go back around to investigating this to write a proper issue, but ended up figuring it out myself and opening a PR:
None

4 months ago

Show more results