Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
BattyCrocodile47
Moderator
34 Questions, 145 Answers
  Active since 02 March 2023
  Last activity one month ago

Reputation

0

Badges 1

127 × Eureka!
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
You guys, thank you so much. I have super positive things to say about ClearML and I'm very excited to try it out at work (still selling it) https://www.link...
one year ago
0 Votes
0 Answers
952 Views
0 Votes 0 Answers 952 Views
I gave a demo of ClearML to our data engineering team (is also the ML infra team) and it went over really well! πŸŽ‰ We have two data science teams that we are...
one year ago
0 Votes
6 Answers
737 Views
0 Votes 6 Answers 737 Views
Sorry for always posting such cryptic problems. I managed to create a docker-compose file that runs ClearML and it sets up 2 agents, one on the services queu...
one year ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
I'm getting some weird ClearML behavior. I've deployed it to an EC2 instance. When I access :8080 , I see the full ClaerML app as expected (first screenshot)...
one year ago
0 Votes
18 Answers
901 Views
0 Votes 18 Answers 901 Views
Can anyone recommend a good workflow for clearml-session ? I like to code in the desktop app of VS Code. Right now, I - run clearml-session ... to start a se...
one year ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Another AWS autoscaler question. The docker-compose.yml automatically adds a ClearML agent to the services queue. When I run python aws_autoscaler.py --remot...
one year ago
0 Votes
9 Answers
1K Views
0 Votes 9 Answers 1K Views
Security question: in my journey of running ClearML the "hard way" (self-hosted), one problem I haven't solved is security. Some discussion here...
one year ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
one year ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Hey friends, how do you configure ClearML to use an S3 bucket? Specifically: does every data scientist have to have hard-coded AWS credentials with read/writ...
one year ago
0 Votes
7 Answers
924 Views
0 Votes 7 Answers 924 Views
Hi friends, we got on a sales call with ClearML yesterday and a discussion about webhooks came up. ClearML seems to not natively implement webhooks It seems ...
one year ago
0 Votes
1 Answers
901 Views
0 Votes 1 Answers 901 Views
Is there a command line interface that lets you query and download models from the ClearML model registry the way you can with MLFlow? Example: # search for ...
one year ago
0 Votes
9 Answers
114 Views
0 Votes 9 Answers 114 Views
one month ago
0 Votes
20 Answers
1K Views
0 Votes 20 Answers 1K Views
How would ya'll approach backing up the elastic-search/redis/etc. data in self-hosted ClearML? Any drawbacks/risks of doing a simple process that periodicall...
one year ago
0 Votes
5 Answers
887 Views
0 Votes 5 Answers 887 Views
ClearML tracks the executed .py file, so you can re-run the experiment. But... - With Metaflow DAGs, you pass a run argument to your training script, e.g. py...
one year ago
0 Votes
0 Answers
656 Views
0 Votes 0 Answers 656 Views
The ClearML Session VS Code extension is live! πŸŽ‰ πŸŽ‰ If you start a ClearML session using the clearml-session CLI tool, this will display it and help you con...
one year ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
If I want to run tensorflow (version 2.10.0 with Python 3.8) with the AWS autoscaler, which AMI and Docker base image should I choose?
one year ago
0 Votes
2 Answers
979 Views
0 Votes 2 Answers 979 Views
AWS autoscale question: can the autoscaler use the IAM role of the EC2 instance it’s running on rather than needing to be provided AWS keys?
one year ago
0 Votes
0 Answers
840 Views
0 Votes 0 Answers 840 Views
If anyone wants to join remotely, there’s a remote-first AI/ML hackathon happening tomorrow. Some of the details are out of date right now—I’ll have this pag...
one year ago
0 Votes
5 Answers
871 Views
0 Votes 5 Answers 871 Views
Whelp. Here's our hackathon demo submission for a ClearML VS Code extension @<1523701205467926528:profile|AgitatedDove14> and @<1523701087100473344:profile|S...
one year ago
0 Votes
2 Answers
982 Views
0 Votes 2 Answers 982 Views
Well, we accidentally leaked some super powerful credentials today. Is there a way to not have the UI or console logs show our AWS access keys, git credentia...
one year ago
0 Votes
19 Answers
865 Views
0 Votes 19 Answers 865 Views
Hey
Hey @<1523701205467926528:profile|AgitatedDove14> , I saw this SO answer you gave about ClearML's docker-compose.yaml . You described getting a secret key pa...
one year ago
0 Votes
10 Answers
1K Views
0 Votes 10 Answers 1K Views
I’m working on an automated deployment of ClearML with IaC. I’ve got a script to start an EC2 instance that runs the docker compose file. Separately, I’ve go...
one year ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
More of pushing ClearML to it's data engineering limits πŸ˜… . Could you use ClearML in a event-driven system? That would be so sick! I'm wondering if we could...
one year ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Does clearML have a good story for offline/batch inference in production? I worked in the Airflow world for 2 years and these are the general features we use...
one year ago
0 Votes
4 Answers
994 Views
0 Votes 4 Answers 994 Views
Hi,
Hi, @<1565509803839590400:profile|MoodyBear54> and I are trying to deploy the ClearML autoscaler to deploy the EC2 instances into a pre-existing VPC/subnet. ...
one year ago
0 Votes
7 Answers
690 Views
0 Votes 7 Answers 690 Views
Working on the VS Code extension. Pretty stumped on this one...
one year ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Can you help me make the case for ClearML pipelines/tasks vs Metaflow? Context within...
one year ago
0 Votes
4 Answers
723 Views
0 Votes 4 Answers 723 Views
Hey
Hey @<1523701205467926528:profile|AgitatedDove14> ! Don't know if you're up but we're working on the VS Code extension at the hackathon rn!
one year ago
0 Votes
13 Answers
1K Views
0 Votes 13 Answers 1K Views
one year ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
one year ago
Show more results questions
0 Security Question: In My Journey Of Running Clearml The "Hard Way" (Self-Hosted), One Problem I Haven'T Solved Is Security. Some Discussion Here...

If the load balancer it Gateway can do the computation and leverage caching, we’re much safer against DDOS attacks. In general, I’d prefer not to have our EC2 instance directly exposed to the public Internet.

one year ago
0 Security Question: In My Journey Of Running Clearml The "Hard Way" (Self-Hosted), One Problem I Haven'T Solved Is Security. Some Discussion Here...

I'm imagining:

  • The EC2 instance would be in a private subnet, accessible only on the VPN (read: VPC)
  • The API Gateway and Load Balancer would also be on the VPC and therefore have access to the private subnet BUT the API Gateway or Load Balancer themselves would be exposed to the public internet.
    That way, to do the JWT authentication, the load balancer or API Gateway could reach out to the EC2 instance on the private network to authenticate any incoming ClearML SDK requests.
one year ago
0 Whelp. Here'S Our Hackathon Demo Submission For A Clearml Vs Code Extension

This is a low-key open-source project if anyone wanted to contribute. Since the project is early, there are lots of high-impact things, e.g. UI polish, that would be relatively low effort πŸ˜„

one year ago
0 Hey

Is there some way we could programmatically list all current ClearML sessions?

We need a way to do that, maybe with the clearml-session CLI in order to populate the VS Code extension menu.

one year ago
0 Can You Help Me Make The Case For Clearml Pipelines/Tasks Vs Metaflow? Context Within...

For these functions, Metaflow offers:

  • triggering: integration with AWS event bridge. It's really easy to use Boto3 and AWS access keys to emit events for Metaflow DAGs. It's nice not to have to worry about networking for this.
  • Scheduling: The fact that Metaflow uses stepfunctions is reassuring.
  • observability: this lovely flame graph where you can view the logs and duration of each step in the DAG, it's easy to view all the DAG runs including the ones that have failed. Ideally, we w...
one year ago
0 Hey

Oh wow. If this works, that will be insanely cool. Like, I guess what I'm going for is that if I specify "username: test" and "password: test" in that file, that I can specify "api.access_key: test" and "api.secret_key: test" in the clearml.conf used for CI. I'll give it a try tonight!

one year ago
0 Another Aws Autoscaler Question. The

Sorry, clarifying:

The agent-services entry in the docker-compose file seems to add a single worker to the services queue
image

one year ago
0 Hi Team! Is There A Way To Make Clearml’S Aws Autoscaler And Queues Resource-Aware Please? I.E. If We Can Say, As We Enqueue Our Job, How Much Ram Or Gpu-Ram Or Even Gpus It Needs, Have The Scheduler/Autoscaler Dispatch The Job To Instances That Are Of Th

I'll try to describe the scenario I was thinking would cause ClearML to break down:

Assume:

  • We've got a queue called streaming
  • We've got an S3 bucket with images landing inside
  • When the images land, they go into a queue
  • When there are 100 images in the queue, we trigger a ClearML pipeline to ingest, transform, run inference on the batch, and then write the results somewhere
  • Let's say we get 1,000,000 images in the Bucket per hour. That might be 1,000,000 / 100 = 10,000 batches. ...
one year ago
0 Hi Team! Is There A Way To Make Clearml’S Aws Autoscaler And Queues Resource-Aware Please? I.E. If We Can Say, As We Enqueue Our Job, How Much Ram Or Gpu-Ram Or Even Gpus It Needs, Have The Scheduler/Autoscaler Dispatch The Job To Instances That Are Of Th

But from your other answer, I think I'm understanding that you can have multiple agents on a single instance listening to the same queue.

So we could maybe initialize 4 instances of the agent on a single EC2 instance which would allow us to handle a higher volume of small batches concurrently without tying up the entire instance.

one year ago
one year ago
0 Hey Friends, How Do You Configure Clearml To Use An S3 Bucket? Specifically: Does

Thanks Vasil! Can you elaborate on what you mean by using boto3? Do you mean writing a script using boto that pulls the credentials down and writes to the user's clearml.conf

Also, I've been seeing references to "credentials vault" in the docs. I can see this is the problem that it solves.

one year ago
0 Security Question: In My Journey Of Running Clearml The "Hard Way" (Self-Hosted), One Problem I Haven'T Solved Is Security. Some Discussion Here...

When you run the docker-compose.yml on an EC2 instance, you can configure user login for the ClearML webserver. But the files API is still open to the world, right? (and same with the backend?)

We could solve this by placing the EC2 instance into a VPN.

One disadvantage to that approach is it becomes annoying to reach the model registry from outside the VPN, like if you have a deployment pipeline based in GitHub Actions. Or if you wanted to trigger a ClearML pipeline from a VPC that isn...

one year ago
0 Hi,

Disclaimer: I'm not familiar enouch with the ClearML codebase to vouch for the quality of this PR, although it is short which is typically good . The feature we're interested in is the ability to specify the subnet_id .

one year ago
0 Hi,

I may be able to prepare a PR that only allows specifying the subnet ID. Can you help me brainstorm scenarios you’d want to see tested? Also, do these need to be automated tests?

one year ago
0 Hi Team! Is There A Way To Make Clearml’S Aws Autoscaler And Queues Resource-Aware Please? I.E. If We Can Say, As We Enqueue Our Job, How Much Ram Or Gpu-Ram Or Even Gpus It Needs, Have The Scheduler/Autoscaler Dispatch The Job To Instances That Are Of Th

Thank you! I think it does. It’s just now dawning on me that: because a pipeline is composed of multiple tasks, different tasks in the pipeline could run on different machines. Or more specifically, they could run on different queues, and as you said, in your other response, we could have a Q for smaller CPU-based instances, and another queue larger GPU-based instances.

I like the idea of having a queue dedicated to CPU-based instances that has multiple agents running on it simultaneously....

one year ago
0 I’M

One idea: is it possible to store usable credentials in advance and place them in a volume that the ClearML containers can access and then use?

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Actually, dumb question: how do I set the setup script for a task?

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Here's a screenshot if a session where I first try to clone as ssm-user , but it fails, then I change to root and it succeeds
image

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Well wow, I figured it out. You equiped me with a solid debugging tool AKA running bash commands within the docker container.

I had to pre-add GitHub and Bitbucket to known hosts by adding keyscan commands

configurations:
  extra_clearml_conf: ""
  extra_trains_conf: ""
  extra_vm_bash_script: |
    echo "fetching github key" && (aws ssm get-parameter --region us-west-2 --name /clearml/github_ssh_private_key --with-decryption --query Parameter.Value --output text > ~/.ssh/id_rsa &...
one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I can't think of any changes we might have made on our side to cause that πŸ€”

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I
configurations:
  extra_clearml_conf: ""
  extra_trains_conf: ""
  extra_vm_bash_script: |
    aws ssm get-parameter --region us-west-2 --name /clearml/github_ssh_private_key --with-decryption --query Parameter.Value --output text > ~/.ssh/id_rsa && chmod 600 ~/.ssh/id_rsa
    source /clearml_agent_venv/bin/activate

hyper_params:
  iam_arn: arn:aws:iam::<my account id>:instance-profile/clearml-2-AutoscaledInstanceProfileAutoScaledEC2InstanceProfile56A5348F-90fmf6H5OUBx
one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I do agree with your earlier observation that the target of that mount seems wrong. I would think that the volume mount should be -v /root/.ssh:/root/.ssh but instead it's -v /root.ssh:/.ssh

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Let's see. The screenshots above are me running on the host, not attaching to a running container. So I believe I do want the keys to be mounted into the running containers.

one year ago
0 Security Question: In My Journey Of Running Clearml The "Hard Way" (Self-Hosted), One Problem I Haven'T Solved Is Security. Some Discussion Here...

OOooh, excellent. So the file server isn't necessary at all if you're using some other object storage? That's slick!

Is there a way I could move the JWT authentication (not authorization) logic into an API Gateway or Load Balancer? For example, if ClearML is following OAuth 2.0, then the load balancer or API Gateway could reach out to it's "issuer URL" (probably available on the EC2 instance where ClearML is running) like this example here.
![image](https://clearml-web-assets.s3.amazonaws.c...

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

So, we've been able to run sudo su and then git clone with our private repos a few times now

one year ago
Show more results compactanswers