GauravIntegrating Reddit Data with Snowflake Using Docker, Airflow, and AWSThis project showcases an end-to-end pipeline that extracts data from Reddit, processes it using Python, and seamlessly integrates it into…11h ago
💡Mike ShakhomirovinTowards Data ScienceThe Top 10 Data Lifecycle Problems that Data Engineering SolvesClear strategies for addressing key pain pointsAug 22
Vu TrinhinData Engineer ThingsHow does Notion handle 200 billion data entities?From PostgreSQL → Data Lake4d ago4d ago
Dirk SteynbergSetting Up HDFS on Raspberry Pi: A Fun Home Project Adventure!Ever wondered what to do with those Raspberry Pis gathering dust in your drawer? Well, buckle up, because we’re about to turn them into a…22h ago22h ago
Vu TrinhinData Engineer ThingsApache Kafka — OverviewThe terminology and the architecture.Jul 64Jul 64
GauravIntegrating Reddit Data with Snowflake Using Docker, Airflow, and AWSThis project showcases an end-to-end pipeline that extracts data from Reddit, processes it using Python, and seamlessly integrates it into…11h ago
💡Mike ShakhomirovinTowards Data ScienceThe Top 10 Data Lifecycle Problems that Data Engineering SolvesClear strategies for addressing key pain pointsAug 22
Vu TrinhinData Engineer ThingsHow does Notion handle 200 billion data entities?From PostgreSQL → Data Lake4d ago
Dirk SteynbergSetting Up HDFS on Raspberry Pi: A Fun Home Project Adventure!Ever wondered what to do with those Raspberry Pis gathering dust in your drawer? Well, buckle up, because we’re about to turn them into a…22h ago
Vu TrinhinData Engineer ThingsHow Twitter processes 4 billion events in real-time dailyFrom Lambda to KappaMay 255
Henri-Joseph BASSAMastering SQL: Advanced SELECT tips for senior data engineers (All examples are in French)16h ago