Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
I'M New To Using Datasets, If My Git Project Root Is

I'm new to using datasets, if my git project root is myProject and I expect file.json to be at the root level, how do I accomplish this?

  
  
Posted 2 years ago
Votes Newest

Answers 18


I assumed I would need to upload it and then reference it somehow?

  
  
Posted 2 years ago

You can save it as a dataset and then fetch it during run time, or am i missing something?

  
  
Posted 2 years ago

do I have to fetch it via code? I was hoping to not modify my scripts

  
  
Posted 2 years ago

ClearML downloads/caches datasets to ~/.clearml/ folder so yes, you need to modify your code.
dataset_folder = Dataset.get(project_name=, dataset_name=, version=).get_local_copy() file_json_path = os.path.join(dataset_folder, 'file.json')

  
  
Posted 2 years ago

Thanks!

  
  
Posted 2 years ago

After proving we can run our training, I would then advise we update our code base

  
  
Posted 2 years ago

Can you please elaborate on what you mean?

  
  
Posted 2 years ago

Or is there an easier way?

  
  
Posted 2 years ago

Could I simply just reference the files by name and pass in a string such as ~/.clearml/my_file.json

  
  
Posted 2 years ago

This would be a short term solution as we build a proof of concept

  
  
Posted 2 years ago

ok good to know

  
  
Posted 2 years ago

you would, but I’d advise against it, since that is not the intended way

  
  
Posted 2 years ago

ok, but if you were to run it from a different machine (or a different user!) it wouldn’t work

  
  
Posted 2 years ago

so it caches to ~/.clearml/ any files that are under the same project name?

  
  
Posted 2 years ago

I wouldn't be able to pass in ~/.clearml/cache/storage_manager/datasets/ds_{ds_id}/my_file.json as an argument?

  
  
Posted 2 years ago

I’m afaid I don’t think there is a way to go around this without modifying your code.

  
  
Posted 2 years ago

Sure. My git repo myProject.git does not have file.json checked into VCS. I'd like to add this file at experiment runtime or equivalent.

  
  
Posted 2 years ago

Is not direcly cached in the ~/.clearml folder. There are some directories inside (one for storage, one for pip, another for venvs, etc.

So in your case it would be stored in ~/.clearml/cache/storage_manager/datasets/ds_{ds_id}/my_file.json

  
  
Posted 2 years ago
1K Views
18 Answers
2 years ago
one year ago
Tags