Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi Everyone, I’M Trying To Create A Pipeline From Tasks Without Uploading The Data Into Clearml Server Because

Hi everyone,

I’m trying to create a pipeline from tasks without uploading the data into clearml server because it’s large and I get into memory issues. Instead I want to use tensorflow prefetch to get small batches while training. Is this possible? If so, is there any available example uses tf prefetch ?

  
  
Posted 2 months ago
Votes Newest

Answers 6


Hey Yasir, to use tensorflow prefetch your data needs to be (1) chunked and (2) stored on some server/bucket/network-attached FS. If both conditions are not satisfied, TF prefetch won't help you.

How large is the dataset we're talking about?

  
  
Posted 2 months ago

Yes, works with GCP too

  
  
Posted 2 months ago

Can’t I use GCP?

  
  
Posted 2 months ago

That's not that much. You can use the AWS autoscaler and provision a spot g4dn GPU instance with a bit more disk. This should cost you less than 50 cents an hour

  
  
Posted 2 months ago

Thank you. The data is stored in GCP bucket and it’s about 4k images of 640x640. I’m also using host service clearml.

  
  
Posted 2 months ago

Thank you

  
  
Posted 2 months ago
187 Views
6 Answers
2 months ago
2 months ago
Tags