BitterLeopard33

Moderator

2 Questions, 22 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Badges 1

8 × Eureka!

Questions 2
Answers 22

0 Votes

17 Answers

1K Views

0 Votes 17 Answers 1K Views

Has Anyone Found A Workaround For The Bug?

has anyone found a workaround for the bug? clearml.storage - ERROR - Exception encountered while uploading string longer than 2147483647 bytes

clearml

2 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

I Am Currently Running A Clearml-Server And Have Few Questions About Dataset Management.

I am currently running a Clearml-server and have few questions about Dataset management. https://clear.ml/docs/latest/docs/getting_started/ds/best_practices#...

dataset

3 years ago

0 Has Anyone Found A Workaround For The Bug?

yes its a single file from local machine to a remote one.

2 years ago

0 Has Anyone Found A Workaround For The Bug?

oh okay in that case i can use the output_ur params in uploading files methodl!!

2 years ago

0 Has Anyone Found A Workaround For The Bug?

and its more than 4gb!

2 years ago

0 Has Anyone Found A Workaround For The Bug?

WebApp: 1.2.0-153 • Server: 1.2.0-153 • API: 2.16

2 years ago

0 Has Anyone Found A Workaround For The Bug?

CostlyOstrich36 could you explain the workflow with MinIO server setup which you just described? Should the data stored in MinIO be treated as a local folder setup and then maybe use this method! https://clear.ml/docs/latest/docs/clearml_data/clearml_data_sdk#syncing-local-storage

2 years ago

0 Has Anyone Found A Workaround For The Bug?

CostlyOstrich36 Thanks for the comment. Is there an Issue already opened and is tracking the status?

2 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

There should be a method called as read_remote_copy(str:dataset_id, str:dataset_tag,bool:mutable) and this should return the path of the remote data.

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Feature request for this: https://clearml.slack.com/archives/CTK20V944/p1629407988075800?thread_ts=1629373886.064600&cid=CTK20V944

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

and this path should follow linux folder structure not a single file like the current .zip.

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Thanks. Let me try it and get back to you.

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Kinda yes.

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

TimelyPenguin76 Could you please give more clarification about the process? cause I cannot find this in the docs. How to create a parent-child Dataset with a same dataset_id and only access the child?

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Thank you for clarifying the parent-child thing. When i say accessing, it means i want to use the data for training(without actually getting a local copy of it ). The whole dataset(both large and small) could be created and uploaded by admin. As a researcher, i normally work with a smaller dataset similar to what SucculentBeetle7 has stated. You should also note that this whole training happens in a remote server. So this situation applies https://clear.ml/docs/latest/docs/getting_started/...

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

This get_local_copy() method is only useful for applications which have datasets in the range of < 10gigs and the training machine is the same as dev machine. Most of us(researchers) its not the case, we share GPU time, this is where clearml comes in.
Requirements: The large dataset should only be a single copy preserving the original folder structure which is presumed to be available remotely and the non-mutable access should be provided via dataset_id. This solves everything or atleas...

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Yes a structure similar to shared folder should be the optimal solution. But i don't understand what you mean by "warm"!!

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

shared "warm" folder without having to download the dataset locally.

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Lets say that this small dataset has a ID and i can use get_local_copy() method to cache it locally and then i can use the remote servers to train it. But I would like to have the same flow without downloading the full dataset which is stored remotely.

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

So for this, should I create a proper issue in the Github? or is this being picked up internally AgitatedDove14

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Anyone who is using small dataset can afford to go with the get_local_copy()

3 years ago

0 Hello Community! Is There An Option To Only Download A Part Of A Dataset With .Get_Local_Copy()? I Imagine Something Like This, But I Can'T Find The Right Way To Do It.

Cause this would again cause the problems which i asked yesterday. Are there any ways to access the parent dataset(assuming its large and i dont want to download it) without using get_local_copy() as that would solve a lot of problems? If so where can I find them in the docs?

3 years ago

0 I Am Currently Running A Clearml-Server And Have Few Questions About Dataset Management.

seems to work thanks. But its not as handy as .get_local_copy() method. I will try to raise a feature request. Since this again returns a .zip path. I would like to received a local path which is easily parsable like the method describe above.

3 years ago

0 I Am Currently Running A Clearml-Server And Have Few Questions About Dataset Management.

yes but for the dataset located in the server, so that i can parse them like a normal local copy

3 years ago