Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Do I Understand Correctly, That Running

Do I understand correctly, that running
Dataset.list_datasets(only_completed=False)should return all datasets from all projects?

  
  
Posted 3 years ago
Votes Newest

Answers 19


Basically, my problem is that it returns empty result. In the same code I can get dataset by its ID and I can get the task (which created the dataset) using
Task.get_tasks()(without mentioning th ID explicitly)

  
  
Posted 3 years ago

I create the dataset like this:
` project_name = "Sandbox"
task_name = "get_raw_data"
task = Task.init(project_name=project_name,
task_name=task_name,
task_type=Task.TaskTypes.data_processing,

             )

dataset = Dataset.create(use_current_task=True)

adding some files here

dataset.upload(verbose=True)

dataset.finalize(verbose=True) `

  
  
Posted 3 years ago

I do see the “Data Processing” type task in UI together with all other dataset-related features, like lineage plot

  
  
Posted 3 years ago

Hmmm are you saying the Dataset Tasks do not have the "dataset" system_tag as well as the type ?

  
  
Posted 3 years ago

Hi FiercePenguin76

should return all datasets from all projects?

Correct 🙂

  
  
Posted 3 years ago

I guess so, lemme double-check

  
  
Posted 3 years ago

system tags cannot be seen from UI. right?

  
  
Posted 3 years ago

Task.get_task(..).system_tags

  
  
Posted 3 years ago

true 😞

  
  
Posted 3 years ago

"system_tags": [ "development" ],

  
  
Posted 3 years ago

but I don’t get to this line, because my task is already of type data_processing

Ohh I see now, it should have added the Tag regardless, you are correct.

  
  
Posted 3 years ago

but I don’t get to this line, because my task is already of type data_processing

  
  
Posted 3 years ago

and the problem is with system_tags=[“dataset”]

  
  
Posted 3 years ago

when I go into Dataset.list_datasets with the debugger and remove system_tags=[“dataset”] from api call params - I get the correct response back

  
  
Posted 3 years ago

I think the reason is that the "original" task is already the right type. I'll make sure we fix it, and always set the system tag

  
  
Posted 3 years ago

also - line 77 which sets (non-system) tags is not invoked for me, thus if I define different tags for both task and dataset - then latter is being lost

  
  
Posted 3 years ago

I see that in the end, both query functions are calling Task._query_tasks

  
  
Posted 3 years ago

if the task is of wrong type (not data_processing) - then it’ll get both correct type and correct system tag

  
  
Posted 3 years ago