The most insightful stories about Parquet - Medium

Data Engineering

Parquet

Topic

·

47 Followers

·

476 Stories

Recommended stories

Alejandro Perez
Parquet Files Everywhere
For many analysts working with big data, the Parquet file format has become the gold standard. It is ubiquitous in our workflows — our Data…
3d ago
In
Towards Data Science
by
Christopher Ariza
Faster DataFrame Serialization
Read and Write DataFrames Up to Ten Times Faster than Parquet with StaticFrame NPZ
Feb 3
2
In
Towards Data Science
by
Mike Clayton
Saving Pandas DataFrames Efficiently and Quickly — Parquet vs Feather vs ORC vs CSVSpeed, RAM,  size and convenience. Which storage method is best?
Nov 27
Nov 27
Edilson Athayde Junior
Data Transformation: From Bronze to Silver Layer in AdventureWorksThe Silver layer is where data starts to gain a more refined structure and added value. At this stage, we move from the initial…
5d ago
5d ago
In
Towards Data Science
by
Nikola Ilic
Parquet File Format: Everything You Need to KnowNew data flavors require new ways of storing them. Learn all you need to know about the Parquet file format
Jul 18
7
Jul 18
7

Parquet Files Everywhere

Parquet Files Everywhere

Alejandro Perez

Parquet Files Everywhere

For many analysts working with big data, the Parquet file format has become the gold standard. It is ubiquitous in our workflows — our Data…

3d ago

Water on a leaf

Water on a leaf

In

Towards Data Science

by

Christopher Ariza

Faster DataFrame Serialization

Read and Write DataFrames Up to Ten Times Faster than Parquet with StaticFrame NPZ

Feb 3

bar chart comparing output file sizes for mixed data in a dataframe for file formats csv, feather, orc and parquet

In

Towards Data Science

by

Mike Clayton

Saving Pandas DataFrames Efficiently and Quickly — Parquet vs Feather vs ORC vs CSV

Speed, RAM, size and convenience. Which storage method is best?

Nov 27

Data Transformation: From Bronze to Silver Layer in AdventureWorks

Edilson Athayde Junior

Data Transformation: From Bronze to Silver Layer in AdventureWorks

The Silver layer is where data starts to gain a more refined structure and added value. At this stage, we move from the initial…

5d ago

Parquet File Format: Everything You Need to Know

In

Towards Data Science

by

Nikola Ilic

Parquet File Format: Everything You Need to Know

New data flavors require new ways of storing them. Learn all you need to know about the Parquet file format

Jul 18

Creating Tables with Cumulative Design

Najma Bader

Creating Tables with Cumulative Design

My notes from Zach Wilson’s Free YouTube BootCamp: how cumulative tables can save you!

5d ago

A comparative analysis: Parquet Vs. Delta File…

pratik domadiya

A comparative analysis: Parquet Vs. Delta File…

Hello Folks,

Jan 27

Zstd vs Snappy vs Gzip: The Compression King for Parquet Has Arrived

Ritam Mukherjee

Zstd vs Snappy vs Gzip: The Compression King for Parquet Has Arrived

For years, Snappy has been the go-to choice, but its dominance is being challenged

Dec 7

See more recommended stories