Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hey I Hope Everyone Is Having A Good Day, Two Quick Questions About Datasets:

Hey I hope everyone is having a good day, two quick questions about datasets:
Does squashing two datasets deletes the two original datasets ? Is it possible to edit tags using the SDK on a finalized dataset ?

  
  
Posted 2 years ago
Votes Newest

Answers 7


Hi FierceHamster54 ,

Does squashing two datasets deletes the two original datasets ?

I don't think so. Should just create a new one.

Is it possible to edit tags using the SDK on a finalized dataset ?

I think so. I don't se a dedicated method for this on Dataset module but Datasets are basically tasks so you can fetch the dataset task and then use Task.add_tags()
https://clear.ml/docs/latest/docs/references/sdk/task#add_tags

  
  
Posted 2 years ago

Old tags are not deleted. When executing a Task (experiment) remotely, this method has no effect).

This description in the add_tags() doc intrigues me tho, I would like to remove a tag from a dataset and add it to another version (eg: a used_in_last_training tag) and this method seems to only add new tags.

  
  
Posted 2 years ago

And additionally does the

When executing a Task (experiment) remotely, this method has no effect).

part means that if it is executed in a remote worker inside a pipeline without the dataset downloaded the method will have no effect ?

I think this means the add tags specifically will have no effect

  
  
Posted 2 years ago

And additionally does the When executing a Task (experiment) remotely, this method has no effect). part means that if it is executed in a remote worker inside a pipeline without the dataset downloaded the method will have no effect ?

  
  
Posted 2 years ago

Okay, thank you for the explanations!

  
  
Posted 2 years ago

This description in the

add_tags()

doc intrigues me tho, I would like to remove a tag from a dataset and add it to another version (eg: a

used_in_last_training

tag) and this method seems to only add new tags.

I see. Then I think you would need to do this via the API:
https://clear.ml/docs/latest/docs/references/api/tasks#post-tasksupdate
or
https://clear.ml/docs/latest/docs/references/api/tasks#post-tasksupdate_batch

  
  
Posted 2 years ago

I mean, if deleting tags in finalized datasets is possible in the GUI, it should be too in the SDK but I don't see the method

  
  
Posted 2 years ago