Hi, What Would Be The Best Way To Save A Pandas.Dataframe As An

Answered

Hi,
What would be the best way to save a pandas.DataFrame as an https://allegro.ai/clearml/docs/rst/references/clearml_python_ref/task_module/task_task.html?highlight=upload_artifact#clearml.task.Task.upload_artifact in a parquet format ?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					OutrageousSheep60
				
					0
					 × 1

Votes Newest

Answers 8

TimelyPenguin76 - I'm I mistaken?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					OutrageousSheep60
				
					0
					 × 1

Well - that will convert it to a binary pickle format but not as parquet -
since the artifact will be accessed from other platforms we want to use parquet

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					OutrageousSheep60
				
					0
					 × 1

Hi OutrageousSheep60 , can you try with auto_pickle=True when uploading the artifact?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					TimelyPenguin76
				
					0
					 Administrator

CostlyOstrich36 - but we will use any method that will allow us to save the files as parquet.
We are not yet using clearml Dataset - i'm not sure if this is a solution

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					OutrageousSheep60
				
					0
					 × 1

Using the https://allegro.ai/clearml/docs/rst/references/clearml_python_ref/task_module/task_task.html?highlight=upload_artifact#clearml.task.Task.upload_artifact method. It works well, but only saves it as a csv (which is very problematic since when loading the artifact none of the data types of the columns are preserved...)

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					OutrageousSheep60
				
					0
					 × 1

How do you currently save artifacts now?

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					CostlyOstrich36
				
					0

Thx CostlyOstrich36 for your reply
Can't see the reverence to parquet . we are currently using the above functionality , but the pd.DataFrame is only saved as csv compressed by gz

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					OutrageousSheep60
				
					0
					 × 1

Hi!
I think the example here should help you.
https://github.com/allegroai/clearml/blob/master/examples/reporting/pandas_reporting.py#L19
Together with this
https://github.com/allegroai/clearml/blob/master/examples/reporting/artifacts.py
Tell me if it helped 🙂

  				
Posted 
	2 years ago

					More
				  		
  Report
		
					CostlyOstrich36
				
					0

Write your answer

1K Views

8 Answers

2 years ago

one year ago