Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
BattyCrocodile47
Moderator
34 Questions, 145 Answers
  Active since 02 March 2023
  Last activity one month ago

Reputation

0

Badges 1

127 × Eureka!
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Another AWS autoscaler question. The docker-compose.yml automatically adds a ClearML agent to the services queue. When I run python aws_autoscaler.py --remot...
one year ago
0 Votes
0 Answers
851 Views
0 Votes 0 Answers 851 Views
If anyone wants to join remotely, there’s a remote-first AI/ML hackathon happening tomorrow. Some of the details are out of date right now—I’ll have this pag...
one year ago
0 Votes
9 Answers
122 Views
0 Votes 9 Answers 122 Views
one month ago
0 Votes
34 Answers
63K Views
0 Votes 34 Answers 63K Views
My autoscaled instance fails when running "git clone" on a private repo. I do have the SSH key placed at /root/.ssh/id_rsa on the machine, and when I SSH int...
one year ago
Show more results questions
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I do agree with your earlier observation that the target of that mount seems wrong. I would think that the volume mount should be -v /root/.ssh:/root/.ssh but instead it's -v /root.ssh:/.ssh

one year ago
0 Another Aws Autoscaler Question. The

Sorry, clarifying:

The agent-services entry in the docker-compose file seems to add a single worker to the services queue
image

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Actually that's wrong: really this is the current volume mount

'-v', '/tmp/clearml_agent.ssh.cbvchse1:/.ssh',

Could changing these values to /root/.ssh work? Do you know what use within the docker image ClearML is using?
image

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Haha, that was a total gotcha for me. Yeah, a lot just wasn't even getting run due to the #!/bin/bash part.

Anyway, wow! I finally got the precious console logs you thought to find, here they are:

2023-05-06 00:19:21
User aborted: stopping task (3)
2023-05-06 00:19:21
Successfully installed PyYAML-6.0 attrs-22.2.0 certifi-2022.12.7 charset-normalizer-3.1.0 clearml-agent-1.5.2 distlib-0.3.6 filelock-3.12.0 furl-2.1.3 idna-3.4 jsonschema-4.17.3 orderedmultidict-1.0.1 pathlib2-2.3.7....
one year ago
one year ago
0 Sorry For Always Posting Such Cryptic Problems. I Managed To Create A Docker-Compose File That Runs Clearml

And for the session

clearml-session --queue sessions --docker python:3.9
one year ago
0 Hi,

Disclaimer: I'm not familiar enouch with the ClearML codebase to vouch for the quality of this PR, although it is short which is typically good . The feature we're interested in is the ability to specify the subnet_id .

one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

Here's a screenshot if a session where I first try to clone as ssm-user , but it fails, then I change to root and it succeeds
image

one year ago
0 Hey

Oh! System tags! That would definitely have been a better way to do it. We ended up querying for tasks in the "DevOps" project with the name "Interactive Session"

one year ago
0 Hey! Starting An Mlops Director Position In 2 Weeks. I'M Thinking About Architecture. Has Anyone Ever Tried To Use Clearml As An Experiment Tracker, But Used A Different Orchestrator Like Metaflow, Airflow, Prefect, Etc.? I'M Struggling To Find Guides Or

Hey @<1523701482157772800:profile|AnxiousSeal95> ! I think ClearML's orchestrator is a great fit for ad-hoc experimentation, but not for (event-triggered) batch inference jobs that need to be relied on in production.

I'd only feel comfortable supporting pipelines that serve end users on a tool that is known for that, e.g. Metaflow, Dagster, or Airflow--mainly because those tools emphasize good monitoring and integration with the wider data ecosystem.

one month ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I don't see it as an argument in Task.init or Task.execute_remotely

one year ago
0 Hey

Is there some way we could programmatically list all current ClearML sessions?

We need a way to do that, maybe with the clearml-session CLI in order to populate the VS Code extension menu.

one year ago
0 Sorry For Always Posting Such Cryptic Problems. I Managed To Create A Docker-Compose File That Runs Clearml

I've also tried running a clearml-agent daemon directly on my mac (not in docker) serving the sessions queue for the ClearML server that is running in docker. When I do that, it consistently fails with a different error. Something to do with mounting a volume.

one year ago
0 Hey

But I actually wish the interface were more like the apiserver.conf file--specifically, that you can define hard-coded credentials in this file in advance. Except, I wish that you could define API keys this way (or some other way)

auth {
    # Fixed users login credentials
    # No other user will be able to login
    fixed_users {
        enabled: true
        pass_hashed: false
        users: [
            {
                username: "test"
                password: "test"
      ...
one year ago
0 How Would Ya'Ll Approach Backing Up The Elastic-Search/Redis/Etc. Data In Self-Hosted Clearml? Any Drawbacks/Risks Of Doing A Simple Process That Periodically Zips Up The

@<1523701070390366208:profile|CostlyOstrich36> Oh that’s smart. Is that to make sure no transactions happen during the backup? Would there be a risk of ongoing or pending tasks somehow getting corrupted if you shut the server down?

one year ago
0 I’M

^^^ For my own notes: this is the web request made by the frontend to create a set of credentials
image

one year ago
0 Can Anyone Recommend A Good Workflow For

Oh my goodness. Thank you! I'd seen that before, but for some reason it didn't register I could run that with VS Code...

But this config should almost never need to change!

Host clearml-session
    HostName localhost
    User root
    Port 8022
one year ago
0 Hey

I ultimately resorted to creating a selenium script combined with docker-compose. Not a beautiful solution but I can confirm that it works 😕 None

one year ago
0 Hey

Aren't they two different auth systems? One for humans and one for machines?

one year ago
0 I’M

I did a quick local experiment and observed that credentials created from the UI indeed become invalid if you delete the ClearML volumes.

  • starting docker-compose locally
  • creating a set of credentials from the UI
  • hardcodign those credentials into the docker-compose file
  • restarting
  • the agent-services container started up and successfully became a registered worker
  • I killed the docker-compose and deleted the volume folders
  • restarted the docker-compose (with the same hard-coded...
one year ago
0 Hey

Exactly

one year ago
0 How Would Ya'Ll Approach Backing Up The Elastic-Search/Redis/Etc. Data In Self-Hosted Clearml? Any Drawbacks/Risks Of Doing A Simple Process That Periodically Zips Up The

You know, you could probably add some immortal containers to the docker-compose.yml that use images with mongodump and the ES equivalent installed.

The container(s) could have a bash script with a while loop in it that sleeps for 30 minutes and then does a backup. If you installed the AWS CLI inside, it could even take care of uploading to S3.

I like this idea, because docker-compose.yml could make sure that if the backup container ever dies, it would be restarted.

one year ago
0 More Of Pushing Clearml To It'S Data Engineering Limits

I took a stab at writing an automated trigger to handle this. The goal is: anytime a pipeline succeeds or fails, let AWS know so that the input records can be placed onto a retry queue (or not)

I'm trying to get a trigger to work in general, and then I'll add the more complex AWS logic. But I seem to be missing a step somewhere:

I wrote a file called set_triggers.py

from clearml.automation.trigger import TriggerScheduler

TRIGGER_SCHEDULER = TriggerScheduler()

from pprint import...
one year ago
0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

I'm not seeing a extra_docker_shell_script in my clearml.conf generated by clearml-agent init like in this guide

one year ago
Show more results compactanswers