Homepage
Open in app
Sign in
Get started
Whispering Data
ARCHIVE
ABOUT
Newsletter
Follow
Scaling AWS Redshift Concurrency with PostgreSQL
Scaling AWS Redshift Concurrency with PostgreSQL
The most efficient way to move data between an analytics warehouse and an OLTP data store!
Paul Singman
Feb 10, 2023
The State of Data Engineering 2022
The State of Data Engineering 2022
All the latest tools and trends in data engineering.
Paul Singman
Jun 27, 2022
5 Tips For a Tidy Data Warehouse
5 Tips For a Tidy Data Warehouse
Spark joy in your data warehouse by following these data modeling best practices!
Paul Singman
Jun 12, 2022
Towards Effective DataOps
Towards Effective DataOps
Gain the confidence to mess with your data without making a mess of your data.
Paul Singman
May 6, 2022
* To receive the latest posts *
Building a Personal Data Stack to Alert on Crypto Price Fluctuations — Trying Out Hex and Delta…
Building a Personal Data Stack to Alert on Crypto Price Fluctuations — Trying Out Hex and Delta…
If you’re like me, you bought your first cryptocurrency in the past year or so, right when it stopped going up in price and making random…
Paul Singman
Mar 21, 2022
Level Up Your Data Lake
Level Up Your Data Lake
Take your data lake game to new heights with these two architecture improvements.
Paul Singman
Feb 22, 2022
How Easy It Is to Re-use Old Pandas Code in Spark 3.2?
How Easy It Is to Re-use Old Pandas Code in Spark 3.2?
In October, it was announced that the Pandas API was being integrated with Spark. This is particularly exciting news for a Pandas-baby like…
Paul Singman
Feb 7, 2022
The Everything Bagel II: Versioned Data Lake Tables with lakeFS and Trino
The Everything Bagel II: Versioned Data Lake Tables with lakeFS and Trino
Let’s put the bagel to use by querying branched lakeFS data from Trino’s distributed engine.
Paul Singman
Jan 29, 2022
The Guide to Data Versioning
The Guide to Data Versioning
Already familiar with versioning code with git? A look at how it works to version data using the same abstractions.
Paul Singman
Dec 13, 2021
Thoughts on the Future of the Databricks Ecosystem
Thoughts on the Future of the Databricks Ecosystem
Databricks has come a long way since growing out of a Berkeley Lab in 2013 with an open-source distributed computing framework called…
Paul Singman
Oct 21, 2021
The Docker Everything Bagel™ — Spin Up A Local Data Stack
The Docker Everything Bagel™ — Spin Up A Local Data Stack
Use docker compose to create local replicas of a modern data stack with one command.
Paul Singman
Oct 11, 2021
Hive Metastore — It Didn’t Age Well
Hive Metastore — It Didn’t Age Well
Second in a series about Hive Metastore. In the last post, Einat covered its history, the problems that it solves — and questioned whether…
Paul Singman
Sep 21, 2021
Hive Metastore — Why It’s Still Here and What Can Replace it?
Hive Metastore — Why It’s Still Here and What Can Replace it?
A majority of data architectures still feature Hive Metastore. Why has it survived and what can finally replace it in the future?
Paul Singman
Sep 6, 2021
The 4 Phases of Data Lifecycle Management
The 4 Phases of Data Lifecycle Management
Useful datasets are created through a process that involves several predictable steps. Learn how to organize your workflows in this…
Paul Singman
Aug 23, 2021
Guarantee Consistency in Your Delta Lake Table(s)
Guarantee Consistency in Your Delta Lake Table(s)
Learn how to integrate lakeFS hooks to validate data on commits.
Paul Singman
Aug 15, 2021
About Whispering Data
Latest Stories
Archive
About Medium
Terms
Privacy
Teams