Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
BattyCrocodile47
Moderator
32 Questions, 142 Answers
  Active since 02 March 2023
  Last activity 8 months ago

Reputation

0

Badges 1

124 × Eureka!
0 Votes
18 Answers
626 Views
0 Votes 18 Answers 626 Views
Can anyone recommend a good workflow for clearml-session ? I like to code in the desktop app of VS Code. Right now, I - run clearml-session ... to start a se...
11 months ago
0 Votes
5 Answers
561 Views
0 Votes 5 Answers 561 Views
Whelp. Here's our hackathon demo submission for a ClearML VS Code extension @<1523701205467926528:profile|AgitatedDove14> and @<1523701087100473344:profile|S...
8 months ago
0 Votes
0 Answers
549 Views
0 Votes 0 Answers 549 Views
If anyone wants to join remotely, there’s a remote-first AI/ML hackathon happening tomorrow. Some of the details are out of date right now—I’ll have this pag...
8 months ago
0 Votes
0 Answers
396 Views
0 Votes 0 Answers 396 Views
The ClearML Session VS Code extension is live! πŸŽ‰ πŸŽ‰ If you start a ClearML session using the clearml-session CLI tool, this will display it and help you con...
8 months ago
0 Votes
13 Answers
854 Views
0 Votes 13 Answers 854 Views
one year ago
0 Votes
3 Answers
871 Views
0 Votes 3 Answers 871 Views
If I want to run tensorflow (version 2.10.0 with Python 3.8) with the AWS autoscaler, which AMI and Docker base image should I choose?
one year ago
0 Votes
2 Answers
719 Views
0 Votes 2 Answers 719 Views
AWS autoscale question: can the autoscaler use the IAM role of the EC2 instance it’s running on rather than needing to be provided AWS keys?
one year ago
0 Votes
2 Answers
859 Views
0 Votes 2 Answers 859 Views
one year ago
0 Votes
2 Answers
721 Views
0 Votes 2 Answers 721 Views
Well, we accidentally leaked some super powerful credentials today. Is there a way to not have the UI or console logs show our AWS access keys, git credentia...
one year ago
0 Votes
5 Answers
604 Views
0 Votes 5 Answers 604 Views
ClearML tracks the executed .py file, so you can re-run the experiment. But... - With Metaflow DAGs, you pass a run argument to your training script, e.g. py...
11 months ago
0 Votes
7 Answers
654 Views
0 Votes 7 Answers 654 Views
Hi friends, we got on a sales call with ClearML yesterday and a discussion about webhooks came up. ClearML seems to not natively implement webhooks It seems ...
one year ago
0 Votes
2 Answers
779 Views
0 Votes 2 Answers 779 Views
Does clearML have a good story for offline/batch inference in production? I worked in the Airflow world for 2 years and these are the general features we use...
one year ago
0 Votes
4 Answers
458 Views
0 Votes 4 Answers 458 Views
Hey
Hey @<1523701205467926528:profile|AgitatedDove14> ! Don't know if you're up but we're working on the VS Code extension at the hackathon rn!
8 months ago
0 Votes
6 Answers
726 Views
0 Votes 6 Answers 726 Views
Another AWS autoscaler question. The docker-compose.yml automatically adds a ClearML agent to the services queue. When I run python aws_autoscaler.py --remot...
one year ago
0 Votes
0 Answers
709 Views
0 Votes 0 Answers 709 Views
I gave a demo of ClearML to our data engineering team (is also the ML infra team) and it went over really well! πŸŽ‰ We have two data science teams that we are...
one year ago
0 Votes
6 Answers
461 Views
0 Votes 6 Answers 461 Views
Sorry for always posting such cryptic problems. I managed to create a docker-compose file that runs ClearML and it sets up 2 agents, one on the services queu...
8 months ago
0 Votes
4 Answers
736 Views
0 Votes 4 Answers 736 Views
Hi,
Hi, @<1565509803839590400:profile|MoodyBear54> and I are trying to deploy the ClearML autoscaler to deploy the EC2 instances into a pre-existing VPC/subnet. ...
one year ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Can you help me make the case for ClearML pipelines/tasks vs Metaflow? Context within...
one year ago
0 Votes
11 Answers
907 Views
0 Votes 11 Answers 907 Views
More of pushing ClearML to it's data engineering limits πŸ˜… . Could you use ClearML in a event-driven system? That would be so sick! I'm wondering if we could...
one year ago
0 Votes
1 Answers
639 Views
0 Votes 1 Answers 639 Views
Is there a command line interface that lets you query and download models from the ClearML model registry the way you can with MLFlow? Example: # search for ...
one year ago
0 Votes
5 Answers
884 Views
0 Votes 5 Answers 884 Views
I'm getting some weird ClearML behavior. I've deployed it to an EC2 instance. When I access :8080 , I see the full ClaerML app as expected (first screenshot)...
one year ago
0 Votes
10 Answers
780 Views
0 Votes 10 Answers 780 Views
I’m working on an automated deployment of ClearML with IaC. I’ve got a script to start an EC2 instance that runs the docker compose file. Separately, I’ve go...
one year ago
0 Votes
3 Answers
828 Views
0 Votes 3 Answers 828 Views
one year ago
0 Votes
3 Answers
751 Views
0 Votes 3 Answers 751 Views
one year ago
0 Votes
21 Answers
528 Views
0 Votes 21 Answers 528 Views
Crazy idea: what if ClearML had a VS Code extension? It could help you start and join ClearML sessions! It could use your local ~/clearml.conf file for read ...
9 months ago
0 Votes
9 Answers
826 Views
0 Votes 9 Answers 826 Views
Security question: in my journey of running ClearML the "hard way" (self-hosted), one problem I haven't solved is security. Some discussion here...
one year ago
0 Votes
7 Answers
942 Views
0 Votes 7 Answers 942 Views
Hey friends, how do you configure ClearML to use an S3 bucket? Specifically: does every data scientist have to have hard-coded AWS credentials with read/writ...
one year ago
0 Votes
19 Answers
525 Views
0 Votes 19 Answers 525 Views
Hey
Hey @<1523701205467926528:profile|AgitatedDove14> , I saw this SO answer you gave about ClearML's docker-compose.yaml . You described getting a secret key pa...
8 months ago
0 Votes
34 Answers
39K Views
0 Votes 34 Answers 39K Views
My autoscaled instance fails when running "git clone" on a private repo. I do have the SSH key placed at /root/.ssh/id_rsa on the machine, and when I SSH int...
one year ago
0 Votes
20 Answers
738 Views
0 Votes 20 Answers 738 Views
How would ya'll approach backing up the elastic-search/redis/etc. data in self-hosted ClearML? Any drawbacks/risks of doing a simple process that periodicall...
12 months ago
Show more results questions
0 I’M

The question I'm exploring remains: is it possible to acquire that initial set of ClearML API keys programmatically so that the manual steps of 1-4 above can be avoided for an initial deployment?

one year ago
0 I’M

^^^ For my own notes: this is the web request made by the frontend to create a set of credentials
image

one year ago
0 Does Clearml Have A Good Story For Offline/Batch Inference In Production? I Worked In The Airflow World For 2 Years And These Are The General Features We Used To Accomplish This. Are These Possible With Clearml?

This is totally what I was looking for! Yeah, by "good story for offline batch" I meant, "good feature support for ..."

I bookmarked this comment. I think I'll be doing a POC trying to show this functionality within the next month.

one year ago
0 More Of Pushing Clearml To It'S Data Engineering Limits

If this works, we might be able to fully replace Metaflow with ClearML!

(Refering to the feature where Metaflow creates Step Functions state machines for you, and then you can use those to trigger event-driven batch jobs in the same way described here)

one year ago
0 Hey Friends, How Do You Configure Clearml To Use An S3 Bucket? Specifically: Does

Thanks Vasil! Can you elaborate on what you mean by using boto3? Do you mean writing a script using boto that pulls the credentials down and writes to the user's clearml.conf

Also, I've been seeing references to "credentials vault" in the docs. I can see this is the problem that it solves.

one year ago
0 Security Question: In My Journey Of Running Clearml The "Hard Way" (Self-Hosted), One Problem I Haven'T Solved Is Security. Some Discussion Here...

Is there a way we can protect a ClearML deployment with a load balancer or API Gateway that is exposed to the whole world, but is protected by authentication so that only authorized clients can get in?

one year ago
0 Hey

I could potentially write a selenium script to make a set of keys, but I'd prefer to avoid that πŸ˜…

8 months ago
0 Hey

Oh! System tags! That would definitely have been a better way to do it. We ended up querying for tasks in the "DevOps" project with the name "Interactive Session"

8 months ago
0 Can Anyone Recommend A Good Workflow For

Oh my goodness. Thank you! I'd seen that before, but for some reason it didn't register I could run that with VS Code...

But this config should almost never need to change!

Host clearml-session
    HostName localhost
    User root
    Port 8022
11 months ago
one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

So, we've been able to run sudo su and then git clone with our private repos a few times now

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I can't think of any changes we might have made on our side to cause that πŸ€”

one year ago
0 More Of Pushing Clearml To It'S Data Engineering Limits

possibly cheaper on the cloud (Lambda vs EC2 instance)

Whoa, are you saying there's an autoscaler that doesn't use EC2 instances? I may be misunderstanding, but that would be very cool.

Maybe I should have said: my plan is to use AWS StepFunctions where a single task in the DAG is an entire ClearML pipeline . The non-ClearML steps would orchestrate putting messages into a queue, doing retry logic, and triggering said pipeline.

I think at some point, there has to be some amount of...

one year ago
0 Clearml Tracks The Executed

Oh duh, thanks. What about non standard entrypoints (as opposed to arguments) like accelerate launch train.py ?

11 months ago
0 I’M

That could work! Is that an option? Something that lets me spin up the ClearML and get a services worker to connect to it without manual steps.

one year ago
0 Working On The Vs Code Extension. Pretty Stumped On This One...

While I'm wishing for things: it'd be awesome if it had a queue already set up. But if there's not a way to do that in the docker compose file, I could potentially write a script that uses the creds to create one using API calls

8 months ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I
configurations:
  extra_clearml_conf: ""
  extra_trains_conf: ""
  extra_vm_bash_script: |
    aws ssm get-parameter --region us-west-2 --name /clearml/github_ssh_private_key --with-decryption --query Parameter.Value --output text > ~/.ssh/id_rsa && chmod 600 ~/.ssh/id_rsa
    source /clearml_agent_venv/bin/activate

hyper_params:
  iam_arn: arn:aws:iam::<my account id>:instance-profile/clearml-2-AutoscaledInstanceProfileAutoScaledEC2InstanceProfile56A5348F-90fmf6H5OUBx
one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

It's an Amazon Linux AMI with the AWS CLI pre-installed on it. It uses the AWS CLI to fetch the key from AWS SSM Parameter Store. It's granted read access to that SSM Parameter via the instance role.

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I'm not seeing a extra_docker_shell_script in my clearml.conf generated by clearml-agent init like in this guide

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I don't see it as an argument in Task.init or Task.execute_remotely

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Actually, dumb question: how do I set the setup script for a task?

one year ago
0 More Of Pushing Clearml To It'S Data Engineering Limits

Man, I owe you lunch sometime @<1523701205467926528:profile|AgitatedDove14> . Thanks for being so detailed in your answers.

Okay! So the pipeline ID is really just a task ID. So cool!

Not sure I fully understand what you mean here...

Sorry, I'll try again. Here's an illustrated example with AWS Step Functions (pretend this is a ClearML pipeline). If the pipeline fails, I'd want to have a chance to do some logic to react to that. Maybe in a step called "on_pipeline_failed" or someth...

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I do agree with your earlier observation that the target of that mount seems wrong. I would think that the volume mount should be -v /root/.ssh:/root/.ssh but instead it's -v /root.ssh:/.ssh

one year ago
0 Crazy Idea:

I took a look

  • I think the Outerbounds extension (the one in my screenshot) is currently closed source. That makes sense to me. A bit sad because it is highly similar.
  • Another example could be the AWS ToolKit extension. But sadly, it's hardly a "minimal example". I was thinking it's relevant because it uses your local ~/.aws/ folder, which is similar to what we'd want to do.
9 months ago
0 Hi Friends, We Got On A Sales Call With Clearml Yesterday And A Discussion About Webhooks Came Up.

Hi. Yes that totally makes sense. It’s just that we don’t want the logic that does the Jenkins trigger to be in a ClearML handler or task, but rather as a handler that acts as a subscriber in a pub-sub system.

This is because we have a pub-sub architecture that we already use, it can handle retries, etc. also we will likely want multiple systems to react to notifications in the pub sub system. We already have a lot of setup for this.

I guess the conclusion is: I realize it’s possible...

one year ago
0 Hi Friends, We Got On A Sales Call With Clearml Yesterday And A Discussion About Webhooks Came Up.

It seems you have a specific workflow in mind, but I'm not sure I follow it. Can you give a specific example ?

Absolutely. So, let's say a DS tags a model in ClearML with "release candidate". It'd be great to have that trigger a number of processes, each with their own retry logic:

  • A fairness/bias evaluation, potentially as a task in ClearML itself. This would load the model and run some sample datasets through it. The
  • Pipeline to prepare for deployment. Trigger a GitHub Actions ...
one year ago
0 Hi Friends, We Got On A Sales Call With Clearml Yesterday And A Discussion About Webhooks Came Up.

I could imagine other useful automations for reacting to failed tasks that have certain tags, including alerting.

I realize we could move a lot of this logic into ClearML itself: make handler functions that run within the services queue. That would work for logic that is implemented in Python. But I believe it would be harder for our team to detect and respond to failures in the event handler functions if they were placed there because it seems unclear how we could use our existing systems a...

one year ago
0 Crazy Idea:

Duh! I bet VS Code's Python extensions like the VS Code Black Extension would be a really good starting place. They are small and are wrappers around a Python CLI tool. I bet there's a lot we could adapt for the ClearML CLI

9 months ago
0 Crazy Idea:

Yeah, I believe all VS Code Extensions are in TypeScript. My main point was that this is an example of a VS Code extension that executes a Python CLI.

9 months ago
0 Crazy Idea:

Playing around this weekend to learn the ins and outs of extensions
image

9 months ago
Show more results compactanswers