Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
I Know At Least One Other Person Has Posted About This Previously, But When I Interact With

I know at least one other person has posted about this previously, but when I interact with Datasets I keep getting a bunch of repeated logging messages with

2023-04-06 13:22:41,649 - clearml - INFO - Dataset.get() did not specify alias. Dataset information will not be automatically logged in ClearML Server.

Except that I have set an alias. My dataset code, which lives inside a pipeline.component function looks like:

# fetch local copy of dataset
parent_dataset = Dataset.get(dataset_id=parent_dataset_meta["id"], alias="parent_dataset")
    with TemporaryDirectory() as dir:
        dir = Path(dir)
        local_dataset = Path(parent_dataset.get_mutable_local_copy(dir))
        new_dataset = Dataset.create(
            dataset_name=parent_dataset.name,
            dataset_project=parent_dataset.project,
            parent_datasets=[parent_dataset.id],
            dataset_tags=["train_val_data", "labels", "images", "intermediate"],
            description=f"train_size: {train_size}, " f"stratify_by: {stratify_by}",
        )
.... do some things with the local copy of the data...
        new_dataset.sync_folder(local_dataset)
        new_dataset.finalize(auto_upload=True)

What am i missing?

  
  
Posted one year ago
Votes Newest

Answers 6


Hi @<1533620191232004096:profile|NuttyLobster9> , what version of clearml SDK are you using and is it a self hosted server or not?

  
  
Posted one year ago

I’m using SDK version 1.10.2 and yes, it’s self-hosted. Here is the version info for the server:

 WebApp: 1.9.1-312 • Server: 1.9.1-312 • API: 2.23 

Thanks!

  
  
Posted one year ago

Hi @<1533620191232004096:profile|NuttyLobster9> We likely print the warning by mistake. We will look into it soon and handle it properly

  
  
Posted one year ago

Thanks very much! Yeah, it tends to fill up the console

  
  
Posted one year ago

It happens due to an internal use of Dataset.get , the larger the dataset, the more verbose it will be. We’ll fix this in the upcoming releases

  
  
Posted one year ago

I found I was having this issue as well. I don't have an alias defined in the pipeline but in a task and I get the same error. I'm not hosting my own server but using the free web service at the moment.

  
  
Posted one year ago
1K Views
6 Answers
one year ago
one year ago
Tags