Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello Clearml Ppl

Hello clearml ppl 😄
very general question, does clearml as any API for any feature store or are there any plan on implementing a feature store inside clearml?

Thanks 🙂

  
  
Posted 2 years ago
Votes Newest

Answers 9


Hi SmoggyGoat53
What do you mean by "feature store" ? (These days the definition is quite broad, hence my question)

  
  
Posted 2 years ago

I know I drop a very general question, so I am sorry for the confusion.

However, I was reading online how other companies were dealing with data pipeline for handling data both in training & inference in a very standard, and monitorable way.

I saw people refer to something called feature store , the first was probably uber ( source https://eng.uber.com/michelangelo-machine-learning-platform/ ) ... I know tecton is one provider of it (source https://www.tecton.ai/ ), but there might be others.

Does this clarify a bit more?

  
  
Posted 2 years ago

Sure, something like : https://github.com/feast-dev/feast

  
  
Posted 2 years ago

Exactly! so do you guys have any sort of API for anything like that?

  
  
Posted 2 years ago

Actually it would be interesting to combine the two, feast is fully open-source and supported by the linux foundation, so I cannot see the harm in that.
wdyt?

  
  
Posted 2 years ago

I think it would be amazing Martin 😃 . You guys are providing an amazing service at clearml!

However if I could give a feedback for you to even improve it, I would suggest to start think of implementing some feature store API. you know better then me that for productionization of ML data pipelines are very critical.

And I really hope I don't sound arrogant by giving this feedback 🙂

  
  
Posted 2 years ago

Not at all, we love ideas on improving ClearML.
I do not think there is a need to replace feast, it seems to do a lot, I'm just thinking on integrating it into the ClearML workflow. Do you have a specific use case we can start to work on? Or maybe a workflow that would make sense to implment?

  
  
Posted 2 years ago

I am completely honest, in the sense I am starting to get familiar with feature store in general.

For use case I have something specific to my data pipeline, but I think it can be easily generalized. I could start to look into feast and see from there what we need!

  
  
Posted 2 years ago

Actually AgitatedDove14 I think that Feast is great, but is missing the part of really transform the data with feature engineering ... so you may want either to develop that part on your side or look at other provider ...

is a feature-store-as-a-service. A big difference between Feast and Tecton is that Tecton supports transformations, so feature pipelines can be managed end-to-end within Tecton. Tecton is a managed offering, and a great feature store choice if you need production SLAs, hosting, advanced collaboration, managed transformations (batch/streaming/real-time), and/or enterprise capabilities.
from

  
  
Posted 2 years ago