Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello Everyone, I Have A Question Regarding Datasets. I Writing A Python Script Where It Takes As Inputs A Project Name And Returns All Datasets That Exist Within That Project. I Am Using

Hello everyone,

I have a question regarding datasets. I writing a python script where it takes as inputs a project name and returns all datasets that exist within that project. I am using Dataset.list_datasets . The issue I am running into is that this command does not give me the dataset version number that shows up in the UI. Is there a python command I can run that returns all the dataset names along with their versions ?

  
  
Posted 11 months ago
Votes Newest

Answers 8


Any chance @<1578918150261444608:profile|RoundJellyfish71> you can open a GitHub issue so that we can track it? (I think this is indeed a good idea)

  
  
Posted 10 months ago

Yes, will do

  
  
Posted 10 months ago

we would really need this in the CLI (e.g. in the clearml-data search)

  
  
Posted 10 months ago

Hi @<1547028031053238272:profile|MassiveGoldfish6>

The issue I am running into is that this command does not give me the dataset version number that shows up in the UI.

Oh no, I think you are correct, it will not return the version per dataset 😞 (I will make sure we add it)
But with the dataset ID you can grab all the properties:
Dataset.get(dataset_id="aabbcc").version
wdyt

  
  
Posted 11 months ago

Thank you for your response but I dont think that would solve the problem.
I'm imagining a case where all you know is the Project name and you want to pull the 2nd version out of 10 and you dont know it's id.

  
  
Posted 11 months ago

I have to assume that I do not know the dataset ID

  
  
Posted 11 months ago

I have to assume that I do not know the dataset ID

Sorry I mean:

datasets = Dataset.list_datasets(dataset_project="some_project") 
for d in datasets:
  d["version"] = Dataset.get(dataset_id=d["id"]).version

wdyt?

  
  
Posted 11 months ago

ohh, that is really clever!! I did not think about that! Thank you very much 😄

  
  
Posted 11 months ago