Plumbers Of Data Science

Plumbers Of Data Science

Arockia Nirmal Amala Doss

Fortifying Your Database Migration: A Journey Towards Secure Transitions!

As a database developer/architect, one of my most critical challenges is ensuring data security during database migrations. These…

Mar 21

Fortifying Your Database Migration: A Journey Towards Secure Transitions!

Mar 21

Taranjit Kaur

Inner Join and Intersect: Bridging Data in SQL

Exploring INNER JOIN and INTERSECT in SQL

Oct 27, 2023

Inner Join and Intersect: Bridging Data in SQL

Oct 27, 2023

Mete Can Akar

Introduction to Creating Unit Tests for PySpark Applications Using unittest and pytest Libraries

TL;DR: Software testing, and in particular, unit testing, is a crucial step in modern Data Engineering. Pytest and unittest are great tools…

Oct 22, 2023

Introduction to Creating Unit Tests for PySpark Applications Using unittest and pytest Libraries

Oct 22, 2023

Garvit Arya

Building Robust Data Pipelines with Apache Airflow

Applications of Apache Airflow

Oct 16, 2023

Building Robust Data Pipelines with Apache Airflow

Oct 16, 2023

Chandrashekar M

Backfilling Data in Big Data: Uncovering the Depths of Data Consistency and Impact

In the dynamic realm of Big Data, ensuring the accuracy and completeness of your datasets is an ongoing challenge. Backfilling data is…

Oct 13, 2023

Backfilling Data in Big Data: Uncovering the Depths of Data Consistency and Impact

Oct 13, 2023

Ayşegül Yiğit

Database Storage Types

When considering the functions of an Enterprise Data Warehouse (EDW), there is always room for debate regarding how it should be…

Oct 13, 2023

Database Storage Types

Oct 13, 2023

Andrei Tserakhau

CDC from zero to hero

How to master cross-system data transfer with CDC

Oct 13, 2023

CDC from zero to hero

Oct 13, 2023

Ayşegül Yiğit

📚 What is a Data Warehouse?

A Data Warehouse (DW) is the process of collecting and managing data from various sources to provide meaningful insights about a business…

Sep 27, 2023

📚 What is a Data Warehouse?

Sep 27, 2023

Ravish Kumar

Drowning in Data: Why more data doesn’t equal more value

Sometimes quality is better than quantity!

Sep 27, 2023

Drowning in Data: Why more data doesn’t equal more value

Sep 27, 2023

Vivek Chaudhary

Spark AQE- Dynamic Coalescing

The Objective of this article is to understand a newly added feature in Spark 3.0 that is AQE (Adaptive Query Execution) to enhance Spark…

Sep 27, 2023

Spark AQE- Dynamic Coalescing

Sep 27, 2023

Plumbers Of Data Science

The Data Engineering Community, we publish your Data Engineering stories

Connect with Plumbers Of Data Science

Editors

Andreas Kretz

Data Engineer and Plumber of Data Science. I write about platform architecture, tools and techniques that are used to build modern data science platforms

Kristijan Bakaric

Manuela Kretz

Help

Status

About

Careers

Press

Blog

Privacy

Terms

Text to speech

Teams