Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
BattyCrocodile47
Moderator
34 Questions, 145 Answers
  Active since 02 March 2023
  Last activity one month ago

Reputation

0

Badges 1

127 × Eureka!
0 Votes
0 Answers
952 Views
0 Votes 0 Answers 952 Views
I gave a demo of ClearML to our data engineering team (is also the ML infra team) and it went over really well! 🎉 We have two data science teams that we are...
one year ago
0 Votes
9 Answers
1K Views
0 Votes 9 Answers 1K Views
Security question: in my journey of running ClearML the "hard way" (self-hosted), one problem I haven't solved is security. Some discussion here...
one year ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Hey friends, how do you configure ClearML to use an S3 bucket? Specifically: does every data scientist have to have hard-coded AWS credentials with read/writ...
one year ago
0 Votes
7 Answers
924 Views
0 Votes 7 Answers 924 Views
Hi friends, we got on a sales call with ClearML yesterday and a discussion about webhooks came up. ClearML seems to not natively implement webhooks It seems ...
one year ago
0 Votes
1 Answers
901 Views
0 Votes 1 Answers 901 Views
Is there a command line interface that lets you query and download models from the ClearML model registry the way you can with MLFlow? Example: # search for ...
one year ago
0 Votes
9 Answers
114 Views
0 Votes 9 Answers 114 Views
one month ago
0 Votes
5 Answers
887 Views
0 Votes 5 Answers 887 Views
ClearML tracks the executed .py file, so you can re-run the experiment. But... - With Metaflow DAGs, you pass a run argument to your training script, e.g. py...
one year ago
0 Votes
0 Answers
656 Views
0 Votes 0 Answers 656 Views
The ClearML Session VS Code extension is live! 🎉 🎉 If you start a ClearML session using the clearml-session CLI tool, this will display it and help you con...
one year ago
0 Votes
2 Answers
979 Views
0 Votes 2 Answers 979 Views
AWS autoscale question: can the autoscaler use the IAM role of the EC2 instance it’s running on rather than needing to be provided AWS keys?
one year ago
0 Votes
5 Answers
871 Views
0 Votes 5 Answers 871 Views
Whelp. Here's our hackathon demo submission for a ClearML VS Code extension @<1523701205467926528:profile|AgitatedDove14> and @<1523701087100473344:profile|S...
one year ago
0 Votes
2 Answers
982 Views
0 Votes 2 Answers 982 Views
Well, we accidentally leaked some super powerful credentials today. Is there a way to not have the UI or console logs show our AWS access keys, git credentia...
one year ago
0 Votes
19 Answers
865 Views
0 Votes 19 Answers 865 Views
Hey
Hey @<1523701205467926528:profile|AgitatedDove14> , I saw this SO answer you gave about ClearML's docker-compose.yaml . You described getting a secret key pa...
one year ago
0 Votes
10 Answers
1K Views
0 Votes 10 Answers 1K Views
I’m working on an automated deployment of ClearML with IaC. I’ve got a script to start an EC2 instance that runs the docker compose file. Separately, I’ve go...
one year ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
More of pushing ClearML to it's data engineering limits 😅 . Could you use ClearML in a event-driven system? That would be so sick! I'm wondering if we could...
one year ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Does clearML have a good story for offline/batch inference in production? I worked in the Airflow world for 2 years and these are the general features we use...
one year ago
0 Votes
7 Answers
690 Views
0 Votes 7 Answers 690 Views
Working on the VS Code extension. Pretty stumped on this one...
one year ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Can you help me make the case for ClearML pipelines/tasks vs Metaflow? Context within...
one year ago
0 Votes
21 Answers
817 Views
0 Votes 21 Answers 817 Views
Crazy idea: what if ClearML had a VS Code extension? It could help you start and join ClearML sessions! It could use your local ~/clearml.conf file for read ...
one year ago
0 Votes
0 Answers
62 Views
0 Votes 0 Answers 62 Views
Is anyone from ClearML going to be in Austin for GenAI / MLOps world?? https://www.linkedin.com/posts/eric-riddoch_5th-ann-mlops-world-and-generative-ai-wor[...
one month ago
0 Votes
18 Answers
903 Views
0 Votes 18 Answers 903 Views
Can anyone recommend a good workflow for clearml-session ? I like to code in the desktop app of VS Code. Right now, I - run clearml-session ... to start a se...
one year ago
0 Votes
13 Answers
1K Views
0 Votes 13 Answers 1K Views
one year ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
one year ago
0 Votes
3 Answers
981 Views
0 Votes 3 Answers 981 Views
one year ago
0 Votes
4 Answers
995 Views
0 Votes 4 Answers 995 Views
Hi,
Hi, @<1565509803839590400:profile|MoodyBear54> and I are trying to deploy the ClearML autoscaler to deploy the EC2 instances into a pre-existing VPC/subnet. ...
one year ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
one year ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
You guys, thank you so much. I have super positive things to say about ClearML and I'm very excited to try it out at work (still selling it) https://www.link...
one year ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
I'm getting some weird ClearML behavior. I've deployed it to an EC2 instance. When I access :8080 , I see the full ClaerML app as expected (first screenshot)...
one year ago
0 Votes
20 Answers
1K Views
0 Votes 20 Answers 1K Views
How would ya'll approach backing up the elastic-search/redis/etc. data in self-hosted ClearML? Any drawbacks/risks of doing a simple process that periodicall...
one year ago
0 Votes
6 Answers
738 Views
0 Votes 6 Answers 738 Views
Sorry for always posting such cryptic problems. I managed to create a docker-compose file that runs ClearML and it sets up 2 agents, one on the services queu...
one year ago
0 Votes
4 Answers
724 Views
0 Votes 4 Answers 724 Views
Hey
Hey @<1523701205467926528:profile|AgitatedDove14> ! Don't know if you're up but we're working on the VS Code extension at the hackathon rn!
one year ago
Show more results questions
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

So here's a snippet from my aws_autoscaler.yaml file

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

So I get output with this one, but the console only shows me the output from my machine. For example, the SSH key is present, and whoami results in ericriddoch

one year ago
one year ago
0 Whelp. Here'S Our Hackathon Demo Submission For A Clearml Vs Code Extension

How it works / what we finished:

  • We used the SaaS ClearML, started an EC2 instance, and manually installed and ran the clearml-agent daemon on it
  • We ran clearml-init on our laptops to generate the clearml.conf file.
  • The extension is in TypeScript, so...
  • We started trying to write code with the Python SDK to list sessions, but realized calling that from the extension would be hard, so we opted to have the TypeScript code make calls to the ClearML API server directly, e.g. ...
one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Haha, that was a total gotcha for me. Yeah, a lot just wasn't even getting run due to the #!/bin/bash part.

Anyway, wow! I finally got the precious console logs you thought to find, here they are:

2023-05-06 00:19:21
User aborted: stopping task (3)
2023-05-06 00:19:21
Successfully installed PyYAML-6.0 attrs-22.2.0 certifi-2022.12.7 charset-normalizer-3.1.0 clearml-agent-1.5.2 distlib-0.3.6 filelock-3.12.0 furl-2.1.3 idna-3.4 jsonschema-4.17.3 orderedmultidict-1.0.1 pathlib2-2.3.7....
one year ago
0 Hi Friends, We Got On A Sales Call With Clearml Yesterday And A Discussion About Webhooks Came Up.

It seems you have a specific workflow in mind, but I'm not sure I follow it. Can you give a specific example ?

Absolutely. So, let's say a DS tags a model in ClearML with "release candidate". It'd be great to have that trigger a number of processes, each with their own retry logic:

  • A fairness/bias evaluation, potentially as a task in ClearML itself. This would load the model and run some sample datasets through it. The
  • Pipeline to prepare for deployment. Trigger a GitHub Actions ...
one year ago
0 Hi Friends, We Got On A Sales Call With Clearml Yesterday And A Discussion About Webhooks Came Up.

Thanks for the response @<1523701205467926528:profile|AgitatedDove14> !

What would you consider an event?

I was thinking of the TriggerScheduler 's definition of an event. Pretty much, any thing the TriggerSchedule allows you to react to, it'd be great to be able to publish those events to a queue external to ClearML, e.g. a tag added to a model (or removed), a state in a task changing, etc. We'd want as much metadata about that event as possible. So if the event is due to a task...

one year ago
0 Hi Friends, We Got On A Sales Call With Clearml Yesterday And A Discussion About Webhooks Came Up.

I could imagine other useful automations for reacting to failed tasks that have certain tags, including alerting.

I realize we could move a lot of this logic into ClearML itself: make handler functions that run within the services queue. That would work for logic that is implemented in Python. But I believe it would be harder for our team to detect and respond to failures in the event handler functions if they were placed there because it seems unclear how we could use our existing systems a...

one year ago
0 Hi Friends, We Got On A Sales Call With Clearml Yesterday And A Discussion About Webhooks Came Up.

Hi. Yes that totally makes sense. It’s just that we don’t want the logic that does the Jenkins trigger to be in a ClearML handler or task, but rather as a handler that acts as a subscriber in a pub-sub system.

This is because we have a pub-sub architecture that we already use, it can handle retries, etc. also we will likely want multiple systems to react to notifications in the pub sub system. We already have a lot of setup for this.

I guess the conclusion is: I realize it’s possible...

one year ago
0 Clearml Tracks The Executed

Oh duh, thanks. What about non standard entrypoints (as opposed to arguments) like accelerate launch train.py ?

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

cc: @<1565509803839590400:profile|MoodyBear54>

one year ago
0 Can You Help Me Make The Case For Clearml Pipelines/Tasks Vs Metaflow? Context Within...

Thanks for replying Martin! (as always)

Do you think ClearML is a strong option for running event-based training and batch inference jobs in production? That’d include monitoring and alerting. I’m afraid that Metaflow will look far more compelling to our teams for that reason.

Since it deploys onto step functions, the scheduling is managed for you and I believe alerts for failing jobs can be set up without adding custom code to every pipeline.

If that’s the case, then we’d probably only...

one year ago
0 Clearml Tracks The Executed

Hi! Yes, it just logs the .py file.

one year ago
0 Whelp. Here'S Our Hackathon Demo Submission For A Clearml Vs Code Extension

Thank you! For now, it's kind of nice that it just picks up your credentials from your conf file. No extra setup required beyond the onboarding ClearML has you do 😄

And look! It's working, assuming you start the clearml session up yourself:

one year ago
0 Whelp. Here'S Our Hackathon Demo Submission For A Clearml Vs Code Extension

Here's the repo: I've recorded a few update videos documenting how we learned about authoring VS Code extensions and how we got it to it's current state. Linked to those in order in the README.

ChatGPT has made working with TypeScript and the VSCode extension framework really nice! None

one year ago
0 Hey

Oh I wasn’t aware of that. I don’t think it’d work for this use case though. We’re trying to test the behavior you can see here in this extension https://share.descript.com/view/g0SLQTN6kAk so basically the examples I said in that earlier message

one year ago
0 On A Related Line But More Complicated: How Can We Ask The Autoscaler To Queue, Say, N Jobs On An N-Gpu Machine, Please? For Example, On Aws, Nvidia A100 Gpus Are Only Available On Instances With 8X A100, Which Is Overkill For A Single-Gpu Job, So Might A

My understanding may be bad. Say I have a single EC2 instance. Is that instance only able to handle one task at a time?

Or can I start multiple instances of the clearml-agent process on it and then have one task per agent?

And if that's the case, can we have multiple agents on the EC2 instance listening to the same queue, e.g. default . Or would this only work if they were listening to different queues?

one year ago
0 More Of Pushing Clearml To It'S Data Engineering Limits

The dark theme you have

It's this chrome extension ! I forget it's even on sometimes. It gives you a keyboard shortcut to toggle dark mode on any website. I love it.

Success! Wow, so this means I can use ClearML training/inference pipelines as part of AWS StepFunctions!

My plan is to have a AWS Step Functions state machine (DAG) that treats running a ClearML job as one step (task) in t...

one year ago
0 Hey

Oh interesting. Is the hope that doing that would somehow result in being able to use those credentials to make authenticated API calls?

one year ago
0 Hey

Aren't they two different auth systems? One for humans and one for machines?

one year ago
0 Hi Community! I'M Facing An Issue With A Self-Hosted Clearml Server. I Modified The Docker-Compose File So To Have All The Volumes Mounted In A Specific Location (

Here's a docker-compose I've been playing with. It doesn't have the same restart problem you're describing, but I did change the volume mounts: None

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I don't see it as an argument in Task.init or Task.execute_remotely

one year ago
0 I’M

I SOLVED IT, NO NEED TO READ FURTHER 😄

I'm a chump and didn't read the docs: None


Oh, I think I got overexcited and didn't look at this closely. So this ACCESS/SECRET key pair is on the agent-services container.

I can see that agent-services is simply a container running `clearml-agent daemon --queue ser...

one year ago
0 I’M

I did a quick local experiment and observed that credentials created from the UI indeed become invalid if you delete the ClearML volumes.

  • starting docker-compose locally
  • creating a set of credentials from the UI
  • hardcodign those credentials into the docker-compose file
  • restarting
  • the agent-services container started up and successfully became a registered worker
  • I killed the docker-compose and deleted the volume folders
  • restarted the docker-compose (with the same hard-coded...
one year ago
0 I’M

Oh my word, is this it? None

Can you set these to any strings and have them function as API keys?

one year ago
Show more results compactanswers