ArliinTowards Data ScienceParquet Best Practices: Discover your Data without loading itMetadata, Statistics on Row Groups, Partitions discovery, and RepartitioningJan 3, 20232Jan 3, 20232
Russell JurneyinData Syndrome BlogPython and Parquet PerformanceIn Pandas, PyArrow, fastparquet, AWS Data Wrangler, PySpark and DaskNov 1, 20204Nov 1, 20204
Thomas SpicerinOpenbridgeApache Parquet: How to be a hero with the open-source columnar data formatApache Parquet file format for Google BigQuery, Azure Data Lakes, Amazon Athena, and Redshift Spectrum.Jun 14, 20176Jun 14, 20176