Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi. I Am Experimenting With

Hi. I am experimenting with clearml.Dataset and encountering an error.

LockException: [Errno 11] Resource temporarily unavailable

In my experiment, I make a chain of datasets where each new dataset adds new files to an existing (finalized parent) dataset.

Here is code I use to make the chain of datasets:
` from clearml import Dataset
import os
from pathlib import Path

def make_batch(i_batch, files_per_batch=10):
"""make a batch of mock data files in a folder"""
batch_path = Path(f"./batch_{i_batch}")
os.makedirs(batch_path, exist_ok=True)
for i in range(files_per_batch):
with open(batch_path / f"file_{i_batch}{i}.txt", "w") as fp:
fp.write(f"file
{i_batch}_{i} contents\n")
return batch_path

def make_dataset_chain(num_datasets=3):
"""make a set of chained datasets"""
project = "testing-datasets"
parent_datasets = None
for i_batch in range(num_datasets):
batch_path = make_batch(i_batch)

    new_dataset_name = f"chained_dataset_{i_batch}"
    the_dataset = Dataset.create(
        dataset_name=new_dataset_name,
        dataset_project=project,
        parent_datasets=parent_datasets,
    )
    the_dataset.add_files(batch_path)
    the_dataset.upload()
    the_dataset.finalize()
    parent_datasets = [the_dataset.id]

make_dataset_chain() If I then try to download dataset_1 or dataset_2 : the_dataset = Dataset.get(dataset_project="testing-datasets", dataset_name="dataset_1")
ds_path = the_dataset.get_local_copy() The get_local_copy() waits 300 seconds and then throws that error LockException `

If I try to get a local copy of dataset_0 :
the_dataset = Dataset.get(dataset_project="testing-datasets", dataset_name="dataset_1") ds_path = the_dataset.get_local_copy()The dataset_0 is downloaded without issue

Attaching an image of what dataset_2 looks like in the dataset viewer for clarity.

Am I doing something wrong? Is there some other way of making new datasets by adding to older (finalized) ones?

  
  
Posted 2 years ago
Votes Newest

Answers 7


TimelyPenguin76 , this turned out to be the reason I was having locking issues https://clearml.slack.com/archives/CTK20V944/p1658761943458649 :
SweetBadger76 , CostlyOstrich36 : I've attempted essentially the same thing before https://clearml.slack.com/archives/CTK20V944/p1657124102133519 and I thought it had worked in the past so I'm not sure why it is failing me now.

  
  
Posted 2 years ago

I'm on clearml==1.6.3rc1

  
  
Posted 2 years ago

Hi PanickyMoth78 ,

I've just recreated your example and it works for me on clearml==1.6.2 but indeed not on clearml==1.6.3rc1 which means we have some work to do before the full release 🙂 Can you try on clearml==1.6.2 to check that it does work there?

  
  
Posted 2 years ago

Also a big thank you for so thoroughly testing the system and providing this amount of feedback, it really does help us make the tool better for everyone! 😄

  
  
Posted 2 years ago

PanickyMoth78

LockException: [Errno 11] Resource temporarily unavailable

I'm not sure I understand how you got to this error (obviously creating datasets and getting them back works), what is unique in the setup/flow itself ?

  
  
Posted 2 years ago

AgitatedDove14 I was able to recreate the error. Simply by running Lavi's example on clearml==1.6.3rc1 in a fresh env. I don't know what is unique to the flow itself, but it does seem reproducible

  
  
Posted 2 years ago

Hi PanickyMoth78 we've just released an RC with a deadlock fix which might resolve your issue - 1.6.3.rc2 - please take a look and see if it works for you 🙂

  
  
Posted 2 years ago
998 Views
7 Answers
2 years ago
one year ago
Tags