Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello Everyone, I Have A Question Regarding Datasets. I Writing A Python Script Where It Takes As Inputs A Project Name And Returns All Datasets That Exist Within That Project. I Am Using

Hello everyone,

I have a question regarding datasets. I writing a python script where it takes as inputs a project name and returns all datasets that exist within that project. I am using Dataset.list_datasets . The issue I am running into is that this command does not give me the dataset version number that shows up in the UI. Is there a python command I can run that returns all the dataset names along with their versions ?

  
  
Posted one year ago
Votes Newest

Answers 8


Yes, will do

  
  
Posted one year ago

we would really need this in the CLI (e.g. in the clearml-data search)

  
  
Posted one year ago

I have to assume that I do not know the dataset ID

  
  
Posted one year ago

Thank you for your response but I dont think that would solve the problem.
I'm imagining a case where all you know is the Project name and you want to pull the 2nd version out of 10 and you dont know it's id.

  
  
Posted one year ago

ohh, that is really clever!! I did not think about that! Thank you very much 😄

  
  
Posted one year ago

I have to assume that I do not know the dataset ID

Sorry I mean:

datasets = Dataset.list_datasets(dataset_project="some_project") 
for d in datasets:
  d["version"] = Dataset.get(dataset_id=d["id"]).version

wdyt?

  
  
Posted one year ago

Any chance @<1578918150261444608:profile|RoundJellyfish71> you can open a GitHub issue so that we can track it? (I think this is indeed a good idea)

  
  
Posted one year ago

Hi @<1547028031053238272:profile|MassiveGoldfish6>

The issue I am running into is that this command does not give me the dataset version number that shows up in the UI.

Oh no, I think you are correct, it will not return the version per dataset 😞 (I will make sure we add it)
But with the dataset ID you can grab all the properties:
Dataset.get(dataset_id="aabbcc").version
wdyt

  
  
Posted one year ago