Sitemap
TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Follow publication

Member-only story

Implement Behaviour Driven Development in data pipelines using Mage

7 min readJul 6, 2023

--

Photo by Nick Fewings on Unsplash

In my previous articles, I talked a lot about the importance of testing in data pipelines and how to create data tests and unit tests respectively. While testing plays an essential role, it may not always be the most exciting part of the development cycle. As a result, many modern data stacks have introduced frameworks or plugins to expedite the implementation of data tests. In addition, unit testing frameworks in Python such as Pytest, unittest have been there for a long time, helping engineers efficiently create unit tests for data pipelines and any Python applications.

In this article, I want to introduce a setup that uses two modern techniques: Behaviour Driven Development (BDD) — a business-oriented testing framework, and Mage — a modern data pipeline tool. By combining these two techniques, the objective is to create high-quality unit tests for data pipelines while having a seamless developer experience.

What is Behaviour Driven Development (BDD)?

When building data pipelines for business, it’s highly likely that we will encounter complicated and tricky business logic. One example is to define customer segmentation based…

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Xiaoxu Gao
Xiaoxu Gao

Written by Xiaoxu Gao

I’m a Developer with a focus on Python and Data Engineering. I write stuff to talk to myself and the world. You can find me on linkedin.com/in/xiaoxugao/.