Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi Again, I Was Wondering What Would Be A Good Practice With Respect To Saving Different Datasets (While Preprocessing It In Several Steps/Stages). Mainly With The Use Of Remove_Files(). Is It Ok To Delete Raw Data After Preprocessing For Example? In That

Hi again, I was wondering what would be a good practice with respect to saving different datasets (while preprocessing it in several steps/stages). Mainly with the use of remove_files(). Is it ok to delete raw data after preprocessing for example? in that case remove_files() wont work if we got the initial dataset with get_local_copy(), is that correct; and I should use get_mutable_local_copy() ? thanks!
The idea is to avoid downloading all dataset (with parents) for each stage, I see that the option "part" in get_mutable_local_copy() could also allow this.

  
  
Posted one year ago
Votes Newest

Answers


Hi CostlyElephant1
What do you mean by "delete raw data"? Data is always fetched to cached folders and clearml takes care of cache cleanup
That said notice that get mutable copy is a target you specify, in this case you should definetly delete after usage. Wdyt ?

  
  
Posted one year ago
687 Views
1 Answer
one year ago
one year ago
Tags