You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TL;DR
This post will expose you how to convert in a very convenient and fast way 🚀 some Apache Parquet
files to CSV, and vice-versa, using either DuckDB 🦆 or Pandas 🐍 for a baseline comparison
As a quick bonus, we will embedded this tool in a small convient CLI script, easily triggered from your favorite
shell 👨💻
Let’s go !
Intro
Recently, I’ve been working a little bit more on Data Engineering tasks (setup a Datalake, convert data,
design pipelines, make cleanup of some data). 📊
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Lightening fast, Parquet to CSV
TL;DR
This post will expose you how to convert in a very convenient and fast way 🚀 some Apache Parquet
files to CSV, and vice-versa, using either DuckDB 🦆 or Pandas 🐍 for a baseline comparison
As a quick bonus, we will embedded this tool in a small convient CLI script, easily triggered from your favorite
shell 👨💻
Let’s go !
Intro
Recently, I’ve been working a little bit more on Data Engineering tasks (setup a Datalake, convert data,
design pipelines, make cleanup of some data). 📊
https://emilien-foissotte.github.io/posts/2023/08/fast-convert/
Beta Was this translation helpful? Give feedback.
All reactions