Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello, For Some Reason My Upload Speed To S3 Is Insanely Slow, I Noticed In Logs That It Upoads To /Tmp Folder. What Does That Mean? Why Tmp?

Hello, for some reason my upload speed to S3 is insanely slow, I noticed in logs that it upoads to /tmp folder. What does that mean? why tmp?
image

  
  
Posted 6 months ago
Votes Newest

Answers 5


No, i specify where to upload
I see the data on S3 bucket is beeing uploaded. Just the log messages are really confusing
image

  
  
Posted 6 months ago

Hi @<1590514584836378624:profile|AmiableSeaturtle81> ! What function are you using to upload the data?

  
  
Posted 6 months ago

@<1590514584836378624:profile|AmiableSeaturtle81> note that we zip the files before uploading them as artifacts to the dataset task. Any chance you are specifying the default output uri as being a local path, such as /tmp ?

  
  
Posted 6 months ago

im also batch uploading, maybe thats the problem?

  • The dataset is about 1TB containing 1 million files
  • I dont have the SSD space locally to do the upload
  • So i download a part of the dataset, use add_files() and then upload() to that batch
  • Upload the dataset
    I noticed that each batch is slower and slower
  
  
Posted 6 months ago

image

  
  
Posted 6 months ago
479 Views
5 Answers
6 months ago
6 months ago
Tags