SmallTurkey79

10 Questions, 118 Answers

Active since 12 April 2024

Last activity 3 months ago

Reputation

Badges 1

103 × Eureka!

Questions 10
Answers 118

0 Votes

1 Answers

467 Views

0 Votes 1 Answers 467 Views

Hi Everyone! I Just Wanted To Bring To Your Attention That Clearml 1.16.0 Introduced Authentication For The Self-Hosted Fileserver By Default.

Hi everyone! I just wanted to bring to your attention that ClearML 1.16.0 introduced authentication for the self-hosted fileserver by default. None If any of...

clearml

4 months ago

0 Votes

1 Answers

404 Views

0 Votes 1 Answers 404 Views

In Case Anyone Else Ever Comes Across Mongo Issues Using The Docker Compose Clearml Stack (In Case Of A Messy Shutdown), I Have Found This Script To Be A Lifesaver On Numerous Occasions:

in case anyone else ever comes across mongo issues using the docker compose clearml stack (in case of a messy shutdown), I have found this script to be a lif...

clearml

5 months ago

0 Votes

14 Answers

572 Views

0 Votes 14 Answers 572 Views

I'M Having A Hard Time With Git Cloning + Cache For A Private Repo Accessed Via Personal Access Token. This Happens 100% Of The Time, Across Both Bitbucket + Github. I Have A Simple "Hello World" Task In A Private Repo. The Worker Is Running In A Docker

I'm having a hard time with git cloning + cache for a private repo accessed via personal access token. This happens 100% of the time, across both bitbucket +...

clearml

7 months ago

0 Votes

54 Answers

17K Views

0 Votes 54 Answers 17K Views

I Have Set

I have set export CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL=true export CLEARML_AGENT_SKIP_PIP_VENV_INSTALL=true in my entrypoint.sh (which runs clearml-agent da...

clearml

6 months ago

0 Votes

2 Answers

588 Views

0 Votes 2 Answers 588 Views

Hello! Thank You For The Great Product. I Have A Bit Of A Request: This Hover Feature In Pipeline Overview Would Be Much More Useful If I Could Read Out The Whole Metric Name. (Not So Much An Issue With Things Like F1, "Acc", But Anything Longer Is Not

hello! thank you for the great product. I have a bit of a request: this hover feature in pipeline overview would be much more useful if I could read out the ...

clearml

7 months ago

0 Votes

9 Answers

517 Views

0 Votes 9 Answers 517 Views

Why Does Clearml Still Waste Time On Requirement Analysis When I Provide Them? Any Tips For How I Can Reduce Clearml Overhead ... (The Time Before Work Actually Starts)?

why does clearml still waste time on requirement analysis when I provide them? any tips for how I can reduce clearml overhead ... (the time before work actua...

clearml

5 months ago

0 Votes

7 Answers

739 Views

0 Votes 7 Answers 739 Views

Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

Thread re: Pipelines and how they're meant to be used / how long they take to orchestrate. @<1523701205467926528:profile|AgitatedDove14> I appreciated your a...

clearml

6 months ago

0 Votes

43 Answers

11K Views

0 Votes 43 Answers 11K Views

I Dont Exactly Know How To Ask For Help On This... Nor Have A Reproducible Minimal Example... I Downgraded Back To 1.15.1 From 1.16.2 And Have The Same Issue There. I Have A Pipeline That'S Repeatedly Failing To Complete. It Correctly Marks Things As Cach

i dont exactly know how to ask for help on this... nor have a reproducible minimal example... I downgraded back to 1.15.1 from 1.16.2 and have the same issue...

clearml

3 months ago

0 Votes

31 Answers

12K Views

0 Votes 31 Answers 12K Views

I Noticed After Upgrading To The Latest Clearml That App Credentials Now Disappear On Restart. Is This An Intentional Design Choice? I'M In A Bit Of A Chicken-And-Egg Situation: Trying To Generate Valid Keys For

I noticed after upgrading to the latest clearml that App Credentials now disappear on restart. Is this an intentional design choice? I'm in a bit of a chicke...

clearml

4 months ago

0 Votes

13 Answers

336 Views

0 Votes 13 Answers 336 Views

Any Tips On Debugging Worker Graphs Not Showing Up? Seems To Be Some Js Errors In The Console That May Be Related. Running Localhost Against 1.16.1 Images

any tips on debugging worker graphs not showing up? seems to be some js errors in the console that may be related. running localhost against 1.16.1 images

clearml

4 months ago

0 I Dont Exactly Know How To Ask For Help On This... Nor Have A Reproducible Minimal Example... I Downgraded Back To 1.15.1 From 1.16.2 And Have The Same Issue There. I Have A Pipeline That'S Repeatedly Failing To Complete. It Correctly Marks Things As Cach

thank you

3 months ago

0 Question About Pipeline : My Setup Is As Follow:

the pipeline is to orchestrate tasks to create more complex functionality, and take advantage of caching, yes.

here I run backtesting (how well did i predict the future), and can control frequency "every week, every month" etc.
so if I increase frequency, I dont need to rerun certain branches of the pipeline and therefore they are cached. another example: if I change something that impacts layer 3 but not layer 1-2, then about half my tasks are cached.

the pictured pipeline is: "create data...

4 months ago

clearml_agent v1.8.1

3 months ago

0 I Have Set

fwiw - i'm starting to wonder if there's a difference between me "resetting the task" vs cloning it.

6 months ago

0 Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

# imports

...

if __name__ == "__main__:

    pipe = PipelineController(...)

    # after instantiation, before "the code" that creates the pipeline.
    # normal tasks can handle task.execute_remotely() at this stage...
    pipe = add_steps_to_pipe(pipe)
    ...

    # after the pipeline is defined. best I can tell, *has* to be last thing in code.
    pipe.start_locally()  # or just .start()

6 months ago

0 I Noticed After Upgrading To The Latest Clearml That App Credentials Now Disappear On Restart. Is This An Intentional Design Choice? I'M In A Bit Of A Chicken-And-Egg Situation: Trying To Generate Valid Keys For

thank you!

by any chance do you have insights into github.com/allegroai/clearml-server/issues/248 ? dont know if its related to this at all or not, but it is an issue I experienced after upgrading .

4 months ago

perfect. thank you. I verified that this was indeed reproducible on 1.16.0 with a fresh deployment.

4 months ago

0 Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

i understood that part, but noticed that when putting in the code to start remotely, the consequence seems to be that the dag computation happens twice - once on my machine as it runs, and then again remotely (this is at least part of why its slower) . if i put pipe.start earlier in the code, the pipeline fails to execute the actual steps .

this is unlike tasks, which somehow are smart enough to publish in draft form when task.execute_remotely is up top .

do i just leave off pipe.start?

6 months ago

Nope still dealing with it .

Oddly enough when i spin up a new instance on the new version, it doesnt seem to happen

4 months ago

0 Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

is it? I can't tell if these delays (DAG-computation) are pipeline-specific (i get that pipeline is just a type of task), but it felt like a different question as I'm asking "are pipelines like this appropriate?"

is there something fundamentally slower about using pipe.start() at the end of a pipeline vs pipe.run_locally() ?

6 months ago

0 I Have Set

ha! yup. that was it exactly. I posted about it too None lol

6 months ago

when I do a docker compose down; docker compose up -d ... these disappear.
to be clear... this was not happening before I upgraded to the latest version. That is why I am asking about this.

4 months ago

i am definitely not seeing it persist after upgrading. previously it wasn't a problem on other upgrades

4 months ago

yeah. thats how I've been generating credentials for agents as well as for my dev environment .

4 months ago

I did manage to figure this out with

docker compose stop agent-services
docker compose up --force-recreate --no-deps -d agent-services

and running an export for the newly generated key.

still though, noticing restarts cause App Credentials to be lost.

4 months ago

for now I'm just avoiding restarts of the service, but I do want to get to the bottom of it using a fresh instance.

as a backup plan: is there a way to have an API key set up prior to running docker compose up? Like, I need at least one set of credentials that I can reliably have remote agents use, one that I know persists across restarts and upgrades.

4 months ago

it's really frustrating, as I'm trying to debug server behavior (so I'm restarting often), and keep needing to re-create these.

4 months ago

thank you!
out of curiosity: how come the clearml-webserver upgrades weren't included in this release? was it just to patch the api part of the codebase?

4 months ago

hello @<1523701087100473344:profile|SuccessfulKoala55>
I appreciate your help. Thank you. Do you happen to have any updates? We had another restart and lost the creds again. So our deployment is in a brittle state on this latest upgrade, and I'm going back to 1.15.1 until I hear back.

4 months ago

yup. once again, rebooted and lost my credentials.

4 months ago

so, I tried this on a fresh deployment, and for some reason that stack allows me to restart without losing App Credentials.
It's just the one that I performed an update on.

4 months ago

App Credentials now persist (I upgraded 1.15.1 -> 1.16.1 and the same keys exist!)
thanks!

4 months ago

0 Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

mind-blowing... but somehow just later in the same day I got the same pipeline to create its DAG and start running in under a minute.

I don't know what exactly I changed. The pipeline task was run locally (which I've never done before), then cloned to run remotely in my services queue. And then it just flew through the experiment at the pace I expected.

so there's hope. i'll keep stress-testing it and see what causes differences. I was right to suspect that such a simple DAG should not take...

6 months ago

everything i just said comes from the screenshotted webpage and is regarding the CLEARML_API_ACCESS_KEY and CLEARML_API_SECRET_KEY env vars.

when i restart clearml server, the keys started disappearing . this was not the case before upgrading

4 months ago

this is not about storage access tokens . its about the App Credentials .
those things you set as CLEARML_API_KEY and SECRET so that clients can talk to the api

4 months ago

0 Why Does Clearml Still Waste Time On Requirement Analysis When I Provide Them? Any Tips For How I Can Reduce Clearml Overhead ... (The Time Before Work Actually Starts)?

thanks so much!
I've been running a bunch of tests with timers and seeing an absurd amount of variance. Ive seen parameters connect and task create in seconds and other times it takes 4 minutes.

Since I see timeout connection errors somewhat regularly, I'm wondering if perhaps I'm having networking errors. Is there a way (at the class level) to control the retry logic on connecting to the API server?

my operating theory is that some sort of backoff / timeout (eg 10s) is causing the hig...

5 months ago

0 I Have Set

yeah... still seeing variances from 1m to 10m for the same task. been testing parallel execution for hours.

6 months ago

if there's a process I'm not understanding please clarify...

but
(a) i start up the compose stack, log in via web browser as a user . this is on a remote server .
(b) i go to settings and generate a credential
(c) i use that credential to set up my local dev env, editing my clearml.conf
(d) i repeat (b) and use that credential to start up a remote workers to serve queues .

am i misunderstanding something? if there's another way to generate credentials I'm not familiar with it .

4 months ago

Show more results