Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello, Is There A Way To Disable Dataset Caching So That When

Hello,
is there a way to disable dataset caching so that when dataset.get_mutable_local_copy('/my-folder') is called, a copy of the dataset is not saved to /home/<name>/.clearml/cache/storage_manager but only saved to the '/my-folder' folder

  
  
Posted one year ago
Votes Newest

Answers 6


All datasets contain sensitive data, and I wish there was some way to use a remote network drive as a cache (sounds weird, but is there a better way?)

  
  
Posted one year ago

FreshParrot56 we could add this capability, but the main caveat is that f your version depends on multiple parent versions you still need to download and extract all the parent versions, which means that when you clear them you might hurt later performance. Does that make sense? What is the use-case / scenario for you?

  
  
Posted one year ago

I want to get a copy of the dataset with sensitive data on a remote network drive, and I don't want this data or any part of it to remain on the computer from which I am executing the request (even sacrificing performance)

  
  
Posted one year ago

Hi FreshParrot56 ! This is currently not supported 🙁

  
  
Posted one year ago

FreshParrot56 You could modify this entry in your clearml.conf to point to your drive: sdk.storage.cache.default_base_dir .
Or, if you don't want to touch your conf file, you could set the env var CLEARML_CACHE_DIR to your remote drive before you call get_local_copy. See this example:
dataset = Dataset.get(DATASET_ID) os.environ["CLEARML_CACHE_DIR"] = "/mnt/remote/drive" # change the clearml cache, make it point to your remote drive copy_path = dataset.get_local_copy() print(copy_path) # the path will point to your remote drive del os.environ["CLEARML_CACHE_DIR"] # delete the env var, now clearml will once again cache data to your local machine copy_path = dataset.get_local_copy() print(copy_path)

  
  
Posted one year ago

thank you!

  
  
Posted one year ago
1K Views
6 Answers
one year ago
one year ago
Tags