Hi, AnxiousSeal95 suppose we have uploaded a csv file, and want to get the csv file to do some pre processing tasks, then is there a way to get that csv file to that script to do further tasks on the dataset like we normally do?
Hi GrittyHawk31 , you can use Dataset.get(). If you're using a file you can call Dataset.get_local_copy() to download it.
You can check https://clear.ml/docs/latest/docs/clearml_data/data_management_examples/data_man_python#data-ingestion documentation out or an https://github.com/allegroai/clearml/blob/master/examples/datasets/data_ingestion.py that uses it
What is the format of data we get from using dataset.get() AnxiousSeal95 , I just wanted to visualize the columns of that csv file and plot some details. Can I do that directly from the output from Dataset.get() ?
Hi,
You may want to consider to do the visualizing while creating the Datasets - see https://github.com/thepycoder/asteroid_example/blob/main/get_data.py#L34 logging the head()
of the dataframe
Hi GrittyHawk31 , maybe I'm missing something, but what stops you from using Dataset.get() in the preprocessing script? Is there a limitation on it?