Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
SmallTurkey79
Moderator
10 Questions, 118 Answers
  Active since 12 April 2024
  Last activity 5 months ago

Reputation

0

Badges 1

103 × Eureka!
0 Votes
7 Answers
919 Views
0 Votes 7 Answers 919 Views
Thread re: Pipelines and how they're meant to be used / how long they take to orchestrate. @<1523701205467926528:profile|AgitatedDove14> I appreciated your a...
8 months ago
0 Votes
1 Answers
492 Views
0 Votes 1 Answers 492 Views
in case anyone else ever comes across mongo issues using the docker compose clearml stack (in case of a messy shutdown), I have found this script to be a lif...
6 months ago
0 Votes
13 Answers
474 Views
0 Votes 13 Answers 474 Views
any tips on debugging worker graphs not showing up? seems to be some js errors in the console that may be related. running localhost against 1.16.1 images
5 months ago
0 Votes
2 Answers
706 Views
0 Votes 2 Answers 706 Views
9 months ago
0 Votes
1 Answers
565 Views
0 Votes 1 Answers 565 Views
Hi everyone! I just wanted to bring to your attention that ClearML 1.16.0 introduced authentication for the self-hosted fileserver by default. None If any of...
6 months ago
0 Votes
14 Answers
691 Views
0 Votes 14 Answers 691 Views
8 months ago
0 Votes
9 Answers
695 Views
0 Votes 9 Answers 695 Views
why does clearml still waste time on requirement analysis when I provide them? any tips for how I can reduce clearml overhead ... (the time before work actua...
6 months ago
0 Votes
31 Answers
20K Views
0 Votes 31 Answers 20K Views
I noticed after upgrading to the latest clearml that App Credentials now disappear on restart. Is this an intentional design choice? I'm in a bit of a chicke...
6 months ago
0 Votes
43 Answers
21K Views
0 Votes 43 Answers 21K Views
5 months ago
0 Votes
54 Answers
26K Views
0 Votes 54 Answers 26K Views
I have set export CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL=true export CLEARML_AGENT_SKIP_PIP_VENV_INSTALL=true in my entrypoint.sh (which runs clearml-agent da...
8 months ago
0 I Have Set

i was having a ton of git clone issues - disabled caching entirely... wonder if that may help too.

tysm for your help! will report back soon.

8 months ago
0 I Have Set

minute of silence between first two msgs and then two more mins until a flood of logs. Basically 3 mins total before this task (which does almost nothing - just using it for testing) starts.
image
image
image

8 months ago
0 I Have Set

fwiw - i'm starting to wonder if there's a difference between me "resetting the task" vs cloning it.

8 months ago
0 Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

mind-blowing... but somehow just later in the same day I got the same pipeline to create its DAG and start running in under a minute.

I don't know what exactly I changed. The pipeline task was run locally (which I've never done before), then cloned to run remotely in my services queue. And then it just flew through the experiment at the pace I expected.

so there's hope. i'll keep stress-testing it and see what causes differences. I was right to suspect that such a simple DAG should not take...

8 months ago
0 I Have Set

oooh thank you, i was hoping for some sort of debugging tips like that. will do.

from a speed-of-clearing-a-queue perspective, is a services-mode queue better or worse than having many workers "always up"?

8 months ago
0 I Have Set

I'm just working on speeding up the time from "queue experiment" to "my code actually runs remotely" - as of yesterday things would sit for many minutes at a time. trying to see if venv is the culprit .

8 months ago
0 I Have Set

thank you!
i'll take that design into consideration.

re: CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL in "docker venv mode" im still not quite sure I understand correctly - since the agent is running in a container, as far as it is concerned it may as well be on bare-metal.

is it just that there's no way for that worker to avoid venv? (i.e. the only way to bypass venv is to use docker-mode?)

8 months ago
0 I'M Having A Hard Time With Git Cloning + Cache For A Private Repo Accessed Via Personal Access Token. This Happens 100% Of The Time, Across Both Bitbucket + Github. I Have A Simple "Hello World" Task In A Private Repo. The Worker Is Running In A Docker

I can see agent.vcs_cache.enabled = true as a printout in the Console, but cannot find docs on how to set this via environment variable, since I'm trying to keep these containers from needing a clearml.conf file (though I can generate on in the entrypoint script if need be with <EOF> )

8 months ago
0 Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.
# imports

...

if __name__ == "__main__:

    pipe = PipelineController(...)

    # after instantiation, before "the code" that creates the pipeline.
    # normal tasks can handle task.execute_remotely() at this stage...
    pipe = add_steps_to_pipe(pipe)
    ...

    # after the pipeline is defined. best I can tell, *has* to be last thing in code.
    pipe.start_locally()  # or just .start()
8 months ago
0 I Have Set

is there a way for me to toggle CLEARML's log level? I'm doing some manual task-debugging in ipython and think it would be helpful to see network requests and timeouts if they're occurring.

8 months ago
0 Question About Pipeline : My Setup Is As Follow:

I think of draft tasks as "class definitions" that the pipeline uses to create task "objects" out of.

6 months ago
0 I Have Set

oh it's there, before running task.

from task pick-up to "git clone" is now ~30s, much better.

though as far as I understand, the recommendation is still to not run workers-in-docker like this:

export CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL=1
  export CLEARML_AGENT_SKIP_PIP_VENV_INSTALL=$(which python)

(and fwiw I have this in my entrypoint.sh )

cat <<EOF > ~/clearml.conf
agent {
    vcs_cache {
        enabled: true
    }

    package_manager: {
        type: pip,
      ...
8 months ago
0 Could I Get Some Feedback From People With Experience Using Clearml Pipelines On The Best Way To Handle Caching? My Team Is Working On Configuring Clearml Pipelines For Our Data Processing Workflow. We Currently Have An Experimental Pipeline Configured F

It sounds like you understand the limitations correctly.

As far as I know, it'd be up to you to write your own code that computes the delta between old and new and only re-process the new entries.

The API would let you search through prior experimental results.

so you could load up the prior task, check the ids that showed up in output (maybe you save these as a separate artifact for faster load times), and only evaluate the new inputs. perhaps you copy over the old outputs to the new task...

5 months ago
0 Thread Re: Pipelines And How They'Re Meant To Be Used / How Long They Take To Orchestrate.

i understood that part, but noticed that when putting in the code to start remotely, the consequence seems to be that the dag computation happens twice - once on my machine as it runs, and then again remotely (this is at least part of why its slower) . if i put pipe.start earlier in the code, the pipeline fails to execute the actual steps .

this is unlike tasks, which somehow are smart enough to publish in draft form when task.execute_remotely is up top .

do i just leave off pipe.start?

8 months ago
0 I Noticed After Upgrading To The Latest Clearml That App Credentials Now Disappear On Restart. Is This An Intentional Design Choice? I'M In A Bit Of A Chicken-And-Egg Situation: Trying To Generate Valid Keys For

when I do a docker compose down; docker compose up -d ... these disappear.
to be clear... this was not happening before I upgraded to the latest version. That is why I am asking about this.
image
image

6 months ago
0 I Have Set

ha! yup. that was it exactly. I posted about it too None lol

8 months ago
0 I Dont Exactly Know How To Ask For Help On This... Nor Have A Reproducible Minimal Example... I Downgraded Back To 1.15.1 From 1.16.2 And Have The Same Issue There. I Have A Pipeline That'S Repeatedly Failing To Complete. It Correctly Marks Things As Cach

yeah locally it did run. I then ran another via UI spawned from the successful one, it showed cached steps and then refused to run the bottom one, disappearing again. No status message, no status reason. (not running... actually dead)
image

5 months ago
0 I'M Having A Hard Time With Git Cloning + Cache For A Private Repo Accessed Via Personal Access Token. This Happens 100% Of The Time, Across Both Bitbucket + Github. I Have A Simple "Hello World" Task In A Private Repo. The Worker Is Running In A Docker

and for what its worth it seems I dont have anything special for agent cloning

i did find agent.vcs_cache.clone_on_pull_fail to be helpful . but yah, updating the agent was the biggest fix

6 months ago
0 Hello, Are There Any Resources For Trying To Reduce The Number Of Api Calls? I Am Trying Out Clear Ml And With Just 20 Epochs It Says There Have Been 80K Api Calls

I would assume a lot of them are logs streaming? So you can try reducing printouts / progress bars. That seems to help for me.

For context: I have noticed the large number of API calls can be a problem when networking is unreliable. It causes a cascade of slow retries and can really hold up task execution. So do be cautious of where work is occurring relative to where the server is, and what connects the two.

6 months ago
Show more results compactanswers