Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi, I Wanted To Ask Whether Using Clearml

Hi,

I wanted to ask whether using ClearML StorageManager class downloads the whole dataset from the remote_url or uses incremental downloading?

I've created a dataset from an S3 bucket, but when I'm trying to train a YOLOV8 model I get:

TypeError: expected str, bytes or os.PathLike object, not NoneType

MWE:

from ultralytics import YOLO
from clearml import StorageManager

# Load the model
model = YOLO(model="yolov8n.pt")

# Get dataset from S3
data_path = StorageManager.get_local_copy(remote_url="
")
# Pass the dataset path to YOLO training
results = model.train(data=data_path, epochs=100, imgsz=640, batch=1, pretrained=True, classes=1)
  
  
Posted one month ago
Votes Newest

Answers 4


I don't understand what is "the DataOps feature" in this context.

My question is simple: is it possible to use clearml to train a YOLOV8 model without downloading the whole dataset. The example I provided is not required for answering, just for convenience and reproducibility of the issue.

  
  
Posted one month ago

The DataOps feature will abstract your usage of data - None

  
  
Posted one month ago

Can you please explain?

  
  
Posted one month ago

Hi @<1750327614469312512:profile|CrabbyParrot75> , why use the StorageManager module and not the Datasets to manage your data?

  
  
Posted one month ago
94 Views
4 Answers
one month ago
one month ago
Tags