Homepage
Open in app
Sign inGet started

Whispering Data

  • ARCHIVE
  • ABOUT
  • Newsletter
  • Scaling AWS Redshift Concurrency with PostgreSQL

    Scaling AWS Redshift Concurrency with PostgreSQL

    The most efficient way to move data between an analytics warehouse and an OLTP data store!
    Go to the profile of Paul Singman
    Paul Singman
    Feb 10, 2023
    The State of Data Engineering 2022

    The State of Data Engineering 2022

    All the latest tools and trends in data engineering.
    Go to the profile of Paul Singman
    Paul Singman
    Jun 27, 2022
    5 Tips For a Tidy Data Warehouse

    5 Tips For a Tidy Data Warehouse

    Spark joy in your data warehouse by following these data modeling best practices!
    Go to the profile of Paul Singman
    Paul Singman
    Jun 12, 2022
    Towards Effective DataOps

    Towards Effective DataOps

    Gain the confidence to mess with your data without making a mess of your data.
    Go to the profile of Paul Singman
    Paul Singman
    May 6, 2022
    * To receive the latest posts *
    Building a Personal Data Stack to Alert on Crypto Price Fluctuations — Trying Out Hex and Delta…

    Building a Personal Data Stack to Alert on Crypto Price Fluctuations — Trying Out Hex and Delta…

    If you’re like me, you bought your first cryptocurrency in the past year or so, right when it stopped going up in price and making random…
    Go to the profile of Paul Singman
    Paul Singman
    Mar 21, 2022
    Level Up Your Data Lake

    Level Up Your Data Lake

    Take your data lake game to new heights with these two architecture improvements.
    Go to the profile of Paul Singman
    Paul Singman
    Feb 22, 2022
    How Easy It Is to Re-use Old Pandas Code in Spark 3.2?

    How Easy It Is to Re-use Old Pandas Code in Spark 3.2?

    In October, it was announced that the Pandas API was being integrated with Spark. This is particularly exciting news for a Pandas-baby like…
    Go to the profile of Paul Singman
    Paul Singman
    Feb 7, 2022
    The Everything Bagel II: Versioned Data Lake Tables with lakeFS and Trino

    The Everything Bagel II: Versioned Data Lake Tables with lakeFS and Trino

    Let’s put the bagel to use by querying branched lakeFS data from Trino’s distributed engine.
    Go to the profile of Paul Singman
    Paul Singman
    Jan 29, 2022
    The Guide to Data Versioning

    The Guide to Data Versioning

    Already familiar with versioning code with git? A look at how it works to version data using the same abstractions.
    Go to the profile of Paul Singman
    Paul Singman
    Dec 13, 2021
    Thoughts on the Future of the Databricks Ecosystem

    Thoughts on the Future of the Databricks Ecosystem

    Databricks has come a long way since growing out of a Berkeley Lab in 2013 with an open-source distributed computing framework called…
    Go to the profile of Paul Singman
    Paul Singman
    Oct 21, 2021
    The Docker Everything Bagel™ — Spin Up A Local Data Stack

    The Docker Everything Bagel™ — Spin Up A Local Data Stack

    Use docker compose to create local replicas of a modern data stack with one command.
    Go to the profile of Paul Singman
    Paul Singman
    Oct 11, 2021
    Hive Metastore — It Didn’t Age Well

    Hive Metastore — It Didn’t Age Well

    Second in a series about Hive Metastore. In the last post, Einat covered its history, the problems that it solves — and questioned whether…
    Go to the profile of Paul Singman
    Paul Singman
    Sep 21, 2021
    Hive Metastore — Why It’s Still Here and What Can Replace it?

    Hive Metastore — Why It’s Still Here and What Can Replace it?

    A majority of data architectures still feature Hive Metastore. Why has it survived and what can finally replace it in the future?
    Go to the profile of Paul Singman
    Paul Singman
    Sep 6, 2021
    The 4 Phases of Data Lifecycle Management

    The 4 Phases of Data Lifecycle Management

    Useful datasets are created through a process that involves several predictable steps. Learn how to organize your workflows in this…
    Go to the profile of Paul Singman
    Paul Singman
    Aug 23, 2021
    Guarantee Consistency in Your Delta Lake Table(s)

    Guarantee Consistency in Your Delta Lake Table(s)

    Learn how to integrate lakeFS hooks to validate data on commits.
    Go to the profile of Paul Singman
    Paul Singman
    Aug 15, 2021
    About Whispering DataLatest StoriesArchiveAbout MediumTermsPrivacyTeams