Hi. I'M Running This Little Pipeline:

Unanswered

Hi there,

PanickyMoth78
I am having the same issue.
Some steps of the pipeline create huge datasets (some GBs) that I don’t want to upload or save.
Wrap the returns in a dict could be a solution, but honestly, I don’t like it.

AgitatedDove14 Is there any better way to avoid the upload of some artifacts of pipeline steps?

The image above shows an example of the first step of a training pipeline, that queries data from a feature store.
It gets the DataFrame, zip and upload it (this one is very small, but in practice they are really big)
How to avoid this?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					IrritableGiraffe81
				
					0
					 × 1

332 Views

0 Answers

3 years ago

2 years ago