Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi Everyone! I Have Started Using The

Hi everyone!
I have started using the StorageManager as a utility for my training code.
Before training starts, I use it to download the training data from S3, with its built in automatic local caching, which is great because it saves me the time of downloading the data for every single experiment.
I was wondering however, suppose the cache is empty, I launch a new training session and it starts downloading the data. Then, a minute later I launch a second training session that uses the same data.
The second script would observe the a "partial" file in the cache dir, would it start overwriting it on its own? will it wait for the first training session to finish the download? Is it safe at all to use it this way, in terms of race conditions?

  
  
Posted 4 days ago
Votes Newest

Answers 4


It's handled by a separate process, my guess that it will start downloading other chunks of the data or just wait for the original process.

  
  
Posted 4 days ago

CostlyOstrich36 OK. Thanks again!

  
  
Posted 4 days ago

CostlyOstrich36
Thanks! Do you have some details you can share about what would actually happen in the situation I described?

  
  
Posted 4 days ago

Hi IrritableHippopotamus34 , I think it should be safe.

  
  
Posted 4 days ago
21 Views
4 Answers
4 days ago
3 days ago
Tags
Similar posts