Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello, I Have A General Question About Data Versioning Using Clearml. When Lets Say That My Parent Dataset Has 100 Files, And That I Create A Child Dataset From It By Adding An Extra 50 Files To The Original 100. Will My 100 Files Be Duplicated On My Serv

hello, i have a general question about data versioning using ClearML.
When lets say that my parent dataset has 100 files, and that I create a child dataset from it by adding an extra 50 files to the original 100. Will my 100 files be duplicated on my server?

  
  
Posted 12 months ago
Votes Newest

Answers 5


No, it should be just the amount of files remaining

  
  
Posted 11 months ago

is this still true if the child dataset is smaller than the parent? If the parent dataset is 1Tb and I delete half the files, I will still be pushing 2Tb of data to the server?

  
  
Posted 11 months ago

Hi @<1547028031053238272:profile|MassiveGoldfish6> , yes, every new version contains all included files

  
  
Posted 12 months ago

so if my parent dataset is 1Tb and I add a single file to create a child dataset. There will now be 2Tb of data on the server. The parent dataset is duplicated on the server?

  
  
Posted 12 months ago

Yes. Differential Datasets are part of the ClearML Scale and Enterprise solution 😞

  
  
Posted 12 months ago