Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi All! I'M Trying To Easily Show And Query (With Sql) Parquet Files Stored By Clearml In An S3 Storage. I'Ve Tried To Integrate Duckdb, But For Large Chunked Files Of Several Gb, It'S Really Slow Compared To The Efficiency Duckdb Has In Directly Querying

Hi All!
I'm trying to easily show and query (with SQL) Parquet files stored by ClearML in an S3 storage.
I've tried to integrate DuckDB, but for large chunked files of several GB, it's really slow compared to the efficiency DuckDB has in directly querying standard Parquet files (Maybe because in ClearML storage, they are zipped?).
Do you know if an official integration exists, or if there is another tool/method to do that?
Thanks!

  
  
Posted one day ago
Votes Newest

Answers 2


Hi @<1874989039501709312:profile|LividDragonfly0> , are you simply reading the files from your code? There is not specific integration, but simply accessing downloaded files should work

  
  
Posted 20 hours ago

Thanks for the reply, so I have to download the file before, I hoped it would be read on the fly.

  
  
Posted 19 hours ago
12 Views
2 Answers
one day ago
8 hours ago
Tags