Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
After Presenting Clearml To My Team, I Got The Question "We'Re Already On Aws, Why Not Use Sagemaker?" Tbh, I'Ve Never Gone Through The Ml Workflow With Sagemaker. The Only Advantage I Could Think Of Is That We Can Use Our On-Prem Machines For Training,

After presenting ClearML to my team, I got the question "we're already on AWS, why not use SageMaker?"

TBH, I've never gone through the ML workflow with SageMaker. The only advantage I could think of is that we can use our on-prem machines for training, but we only have 3 of those.

Can anyone make the case for ClearML over SageMaker for teams already on AWS?

[EDIT] For context, we're a medium sized AI team with 6 data scientists and 9 data engineers, putting new models into products fairly often, both in offline/batch and online/real-time settings.

  
  
Posted one year ago
Votes Newest

Answers 3


Hi @<1541954607595393024:profile|BattyCrocodile47> and @<1523701225533476864:profile|ObedientDolphin41>

"we're already on AWS, why not use SageMaker?"

TBH, I've never gone through the ML workflow with SageMaker.

LOL I'm assuming this is why you are asking šŸ™‚

  • First, you can use SageMaker and still log everything to ClearML (2 lines integration). At least you will have visibility to everything that is running/failing šŸ™‚
  • SageMaker job is a container, which means for Every job (that in a lot of cases is a one time test) users need to build containers push them into the registry, and then of course forget to remove them. This means it is hard to move from writing code to launching and the management costs are high (tons of containers no one is using and everyone is afraid of deleting)
  • As mentioned, SageMaker does not support on-prem/hybrid resources
  • SageMaker costs extra on top of the compute
  • There is no good dashboard for monitoring jobs and launching them from sagemaker. Basically it was designed for devops for monitoring long lasting servers, not ephemeral jobs constantly changing, and it shows ...
  • Multi step pipelines are not supported in sagemaker (I mean you can hack it, but go figure later what really happened)
  • Sagemaker does not have caching mechnisms (i.e. rerunning the same job with the same data/args should be reused)
  • Sagemaker outputs by default are just more files in S3 bucket, which is a mess to manage
    I probably forgot a few, but you get the gist, SageMaker was built to launch containers on EC2, not to manager ML workflows. So other than launching containers (that it does very nicely), everything else is missing.
    (just my 2 cents, but I might be a bit biased after having to work with it for a while šŸ˜‰ )
  
  
Posted one year ago

@<1523701205467926528:profile|AgitatedDove14> you beautiful person, this is terrific! I do believe SageMaker has some nice monitoring/data drift capabilities that seem interesting, but these points you have here will be a fantastic starting point for my team's analysis of the products. I think this will help balance some of the over-enthusiasm towards using the native AWS solution.

  
  
Posted one year ago

Iā€™m curious what the opinions are on this! I asked myself the same question. In my limited experience, going through a workflow with SageMaker was a painful process, and one that required a ton of AWS-specific code and configuration. Compared to this, ClearML was easy and quick to set up, and provides a dashboard where everything from experiments to models to output is organised, queryable and comparable. Way less hassle for way more benefits.

  
  
Posted one year ago
1K Views
3 Answers
one year ago
one year ago
Tags