SoreSparrow36

Moderator

4 Questions, 47 Answers

Active since 21 July 2023

Last activity 2 months ago

Reputation

Badges 1

22 × Eureka!

Questions 4
Answers 47

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

How Can I Control The

How can I control the ~/clearml.conf file being used by agent-services in the docker-compose stack for clearml-server ? namely, if I enqueue a task, I notice...

clearml

one year ago

0 Votes

3 Answers

480 Views

0 Votes 3 Answers 480 Views

Is There Somewhere I Can Track Upcoming Releases By Any Chance? Trying To Plan An Upgrade Of Our Services. Namely I'M Wondering If I Need To Continue Using My Own Forked Image Of

is there somewhere I can track upcoming releases by any chance? Trying to plan an upgrade of our services. namely I'm wondering if I need to continue using m...

clearml

5 months ago

0 Votes

19 Answers

1K Views

0 Votes 19 Answers 1K Views

I Just Encountered A Really Frightening Bug. Best I Can Explain What Happened Was This: Data Scientist Created New Venv, Installed Clearml==1.11.0 Instead Of Clearml[S3]==1.11.1, And Upon Re-Running A Pipeline From Cli, The Entire Project "Disappeared" (W

I just encountered a really frightening bug. Best I can explain what happened was this: Data scientist created new venv, installed clearml==1.11.0 instead of...

clearml

one year ago

0 Votes

10 Answers

272 Views

0 Votes 10 Answers 272 Views

Does Anyone Have Experience With Integrating Clearml And Slurm? If So, What Pattern Did You Use? (Did You Submit Tasks And Just Use Clearml As Tracker, Or Did You Start Agents With Slurm?) Would Love To Hear From The Community Before Trying To Diy

does anyone have experience with integrating clearml and slurm? if so, what pattern did you use? (did you submit tasks and just use clearml as tracker, or di...

clearml

2 months ago

0 Is There Somewhere I Can Track Upcoming Releases By Any Chance? Trying To Plan An Upgrade Of Our Services. Namely I'M Wondering If I Need To Continue Using My Own Forked Image Of

namely, I'm very interested in testing this unmerged feature, will be trying to leverage it as soon as possible
None

5 months ago

0 Hello! I Created A

credentials for the server to do things with s3 will be in /opt/clearml/apiserver.conf.

one year ago

0 Hello! I Created A

Might be under examples

one year ago

0 Hi All

oh i see. you're talking about the agent-services, not a separate agent in a container.
yup, I've got the same thing going there.
fwiw...
for me, HOST_IP is 0.0.0.0 and the other "HOSTS" env vars don't contain "http" in them.
and my server is publicly reachable, not sure if that matter either.

one year ago

0 Hey All, Very New To Clearml! I Am Trying To Design An Hpo Setup Using The Optuna Configuration, And I'M Working On Getting My Template Trainer Set Up. The Issue I'M Having Is It'S Unclear To Me How To Define One Of My Hyperparameters Whose Size Is Dynami

you could also take the route of NOT specifying num_layers, and instead write your own code to create a set of viable layer designs to choose from and pass that as a parameter, so optuna selects from a countable set instead of suggesting integer values .

the downside of this is the lack of gradient information in the optimization process

2 months ago

0 Hi Everyone! I Use Clearml Pipelines And I Have Too Much Parameters In It So I Want To Use Configuration File. How Could I Connect Configuration File (Like `Task.Connect_Configuration_File`) But In Pipeline With Ui Interface?

I believe pipe.connect_configuration is what you're looking for?

one year ago

0 Hello! I Created A

I think you’d have to run the cleanup service. That’s what seems to be what is controlling deletion based on archived status and some other temporal filters

one year ago

0 How Can I Control The

I tried mounting a config file (in the structure of the one on github but with just the relevant s3 section) into the agent-services container at /root/clearml.conf and after restarting the container, it seems to have had an impact. thank you!

When I inspect the console of the task I'm trying to run, I see there's a call to cp /tmp/clearml.conf ~/default_clearml.conf in the docker command and that the volume /tmp/clearml.conf is picked up from the host at some custom-named file ...

one year ago

0 Is There Any Documentation From Clearml On Best Practices For Mounting/Using External Ebs Volumes For The Clearml Server? We Would Like To Mount An External Ebs Volume To The

@<1541954607595393024:profile|BattyCrocodile47> put together None

one year ago

0 I Am Still Going Through All The Docs And Intro Videos … But: Is The Only Way To Create A New Experiment To Run The Script That Contains The Experiment At Least Once? I Wonder About This B.C. Most Of What I Want To Run Are Quite Long Jobs, So Even Running

you can put task.execute_remotely() to create it in draft mode. I've taken to configuring defaults to run things very quickly just in case i forget though (e.g. placeholder string for dataset, bail out early if not changed… or just do one epoch on a small subset of samples, etc).

one year ago

Yup if you scroll through the logs in the console, near the top (post config dump), you’ll see a git clone and checkout to the specific hash.

PS You can actually change this parameter in an experiment’s configuration if it is in draft mode.

one year ago

For reproducibility, it kind of makes sense though. The existence of the file is contingent on the worker cloning the source code. I'm sure things can be done to maintain state differently but I personally adapted to the git-based workflow for managing files pretty quickly.

though yes I will admit I had the same thought first: why must I run it each time?

Beware: squash merges will ruin the ability to reproduce the experiment at that time since the git commit will be lost (presuming th...

one year ago

0 Can Anyone Recommend A Good Workflow For

I'm guessing this is done through code-server?

I'm currently rolling a JupyterHub instance (multiuser, with codeserver inside) on the same machine as clearml-server. That’s where tasks are executed etc. so, all browser dev env.

It sounds like there’s an option to basically bypass this latter step and just use clearml’s credentialing to accomplish much the same thing? Am I understanding clearml-session correctly?

one year ago

0 I Just Encountered A Really Frightening Bug. Best I Can Explain What Happened Was This: Data Scientist Created New Venv, Installed Clearml==1.11.0 Instead Of Clearml[S3]==1.11.1, And Upon Re-Running A Pipeline From Cli, The Entire Project "Disappeared" (W

i think we may have found the frankenbug?

the argument to the dataset name was not being overridden correctly (mistyped), so the default value of an empty string (instead of a placeholder like "CHANGE_ME") in the parent task caused the dataset to basically get created with an empty name, and somehow that hid the whole project, despite hundreds of existing tasks in it.

and no way to un-hide it as far as I can tell?

one year ago

didnt disappear this time.

one year ago

0 Hello! I Created A

the clearml github, search for a file named cleanup service dot py (or something to that effect)

one year ago

0 Hi All

I ran into something similar during deployment. Hopefully this helps with your debugging: if the agent was launched separately from the rest of the stack, it may not have proper docker-DNS resolution to None . (e.g. if in the same docker-compose, perhaps you didnt add the backend network field, or if it was launched separately through docker run without an explicit external network defined)

if the agent's on the same machine, try docker network connect to add...

one year ago

youre basically asking to sample from a distribution where not all parameters are mutually independent .

the short answer is no- this is not directly supported . optuna needs each hyperparam to be independent, so its up to you to handle the dependencies between parameters yourself unfortunately .

your solution of defining them independently and then using num_layers to potentially ignore other parameters is a valid one .

2 months ago

maybe an important note: I mounted the same cache directory for the agents.

one year ago

0 Hello

👀 following.
I have much the same issue, and it's mission-critical that I resolve it soon.

one year ago

0 How Can I Control The

thank you!
I'll add a volume mount to the services-agent container, and from what I understand that will become the template it uses?

is this the structure of the file?
None

or is it the "dot" syntax (like what shows up in the console when the task executes / your snippet)?

one year ago

probably, but the syntax would be in that of a git diff, so it’d be a touch clunky if you asked me
Are you trying to avoid local development?

one year ago

0 Is There Somewhere I Can Track Upcoming Releases By Any Chance? Trying To Plan An Upgrade Of Our Services. Namely I'M Wondering If I Need To Continue Using My Own Forked Image Of

ah, thank you for the clarity. A quarterly release schedule makes sense, it's about what I've observed.
Let me know if I can be of any assistance in early testing!

5 months ago

0 Hi Guys, I'M Trying To Deploy An Image Segmentation Model, So I Expect That The Front-End Of The Endpoint Will Allow Users To Upload Images, Get Their Segmented Images & Option To Annotate The Images If The Results Are Not Good Enough. My Question Is: How

If you can hit the endpoint with curl, you for sure can hook it up to many frontend frameworks.

Personal recs: gradio, streamlit

Abstract the interaction into a function call, and wrap it all in some UI elements using python.

one year ago

and now its gone.

one year ago

i will attempt to start that now.

one year ago

0 Hey, the <https://clear.ml/docs/latest/docs/references/api/#request-format|api reference> says that the url should be ```https://<base_url>/auth.login``` but to make it actually work I have to do ```https://<base_url>/api/v1.0/auth.login``` Th

Weird . I recently implemented a function that talked to this exact endpoint and found it had to exclude the version and api paths . Is there some sort of redirect that happens?

11 months ago

0 How Would Ya'Ll Approach Backing Up The Elastic-Search/Redis/Etc. Data In Self-Hosted Clearml? Any Drawbacks/Risks Of Doing A Simple Process That Periodically Zips Up The

Can vouch, this works well. Had my server hard reboot (maybe bc of clearml? maybe bc of hardware, maybe both… haven’t figured it out), and busy remote workers still managed to update the backend once it came back up.

Re: backups… what would happen if zipped while running but no work was being performed? Still an issue potentially?

and what happens if docker compose down is run while there’s work in the services queue? Will it be restored? What are the implications if a backup is perform...

one year ago

one note is that it happened after I tried deploying a set of workers to a new queue, which she tried to use to run the tasks in parallel instead of our default queue which is only serviced by one worker (a container i built)

one year ago

0 Does Anyone Have Experience With Integrating Clearml And Slurm? If So, What Pattern Did You Use? (Did You Submit Tasks And Just Use Clearml As Tracker, Or Did You Start Agents With Slurm?) Would Love To Hear From The Community Before Trying To Diy

but isnt that just the same as running agent in daemon mode? thats what i was hoping James could do

2 months ago

Show more results compactanswers