Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi Everyone, I’M Trying To Create A Pipeline From Tasks Without Uploading The Data Into Clearml Server Because

Hi everyone,

I’m trying to create a pipeline from tasks without uploading the data into clearml server because it’s large and I get into memory issues. Instead I want to use tensorflow prefetch to get small batches while training. Is this possible? If so, is there any available example uses tf prefetch ?

  
  
Posted 10 months ago
Votes Newest

Answers 6


Hey Yasir, to use tensorflow prefetch your data needs to be (1) chunked and (2) stored on some server/bucket/network-attached FS. If both conditions are not satisfied, TF prefetch won't help you.

How large is the dataset we're talking about?

  
  
Posted 10 months ago

Thank you

  
  
Posted 10 months ago

That's not that much. You can use the AWS autoscaler and provision a spot g4dn GPU instance with a bit more disk. This should cost you less than 50 cents an hour

  
  
Posted 10 months ago

Yes, works with GCP too

  
  
Posted 10 months ago

Can’t I use GCP?

  
  
Posted 10 months ago

Thank you. The data is stored in GCP bucket and it’s about 4k images of 640x640. I’m also using host service clearml.

  
  
Posted 10 months ago
727 Views
6 Answers
10 months ago
10 months ago
Tags