Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello, For Some Reason My Upload Speed To S3 Is Insanely Slow, I Noticed In Logs That It Upoads To /Tmp Folder. What Does That Mean? Why Tmp?

Hello, for some reason my upload speed to S3 is insanely slow, I noticed in logs that it upoads to /tmp folder. What does that mean? why tmp?
image

  
  
Posted 10 months ago
Votes Newest

Answers 5


Hi AmiableSeaturtle81 ! What function are you using to upload the data?

  
  
Posted 10 months ago

No, i specify where to upload
I see the data on S3 bucket is beeing uploaded. Just the log messages are really confusing
image

  
  
Posted 10 months ago

image

  
  
Posted 10 months ago

im also batch uploading, maybe thats the problem?

  • The dataset is about 1TB containing 1 million files
  • I dont have the SSD space locally to do the upload
  • So i download a part of the dataset, use add_files() and then upload() to that batch
  • Upload the dataset
    I noticed that each batch is slower and slower
  
  
Posted 10 months ago

AmiableSeaturtle81 note that we zip the files before uploading them as artifacts to the dataset task. Any chance you are specifying the default output uri as being a local path, such as /tmp ?

  
  
Posted 10 months ago
755 Views
5 Answers
10 months ago
10 months ago
Tags