Hi, All I Have Some Issues Uploading A Big (100Gb) Dataset To Self-Hosted Clearml Server. Is There Any Tricks I Should Be Aware When Launching The Server? Maybe Configuring Timeout Or Giving More Resources? Right Now The Upload Freezes And In The Web-Int

Answered

Hi, all

I have some issues uploading a big (100gb) dataset to self-hosted clearml server. Is there any tricks I should be aware when launching the server? Maybe configuring timeout or giving more resources? Right now the upload freezes and in the web-interface the dataset is marked as Aborted with status message in info tab Forced stop (non-responsive) . I created issue on github . before thinking of slack. Let me know if I should delete it.

Kind regards

  				
Posted 
	one year ago

					More
				  		
  Report
		
					StaleElk72
				
					0
					 × 1

Votes Newest

Answers 6

its a directory (sha generation step actually successfull:

Generating SHA2 hash for 1136604 files

as in github issue). given previous experience, i would expect it to be uploaded as multiple zip files.

yes, I dont use s3. i have a dedicated machine with raid configured, were clearml server is running.

  				
Posted 
	one year ago

					More
				  		
  Report
		
					StaleElk72
				
					0
					 × 1

I end up using dvc for the dataset management. It doesnt have fancy UI, but works flawlessly with large datasets

  				
Posted 
	one year ago

					More
				  		
  Report
		
					StaleElk72
				
					0
					 × 1

I am not sure about that. I have another dataset of similar structure which is smaller (40gb) and which succeeded to be uploaded. Seems like the how it works - first it computes sha for all the files, but during uploading - aggregates small files in to zip archives approx 512 mb each.

  				
Posted 
	one year ago

					More
				  		
  Report
		
					StaleElk72
				
					0
					 × 1

Hi @<1547390422483996672:profile|StaleElk72> , are you getting an error at any point? This is indeed a large file, and I assume you're uploading it to eh ClearML fileserver, and not to some object storage like S3?

  				
Posted 
	one year ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

  				
Posted 
	one year ago

					More
				  		
  Report
		
					StaleElk72
				
					0
					 × 1

In that case I assume this is just a series of a lot of small (?) uploads which take a lot of time

  				
Posted 
	one year ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

Write your answer

1K Views

6 Answers

one year ago