What is a Data Angiogram ? How can it help with Data pipeline sanity & Data Quality

Vishnu Rao
2 min readSep 10, 2024
Data Angiogram

I recently had the honour to present at Smartdata 2024 on an idea i came up, related to data quality.

Simple put: How can we test our data pipelines as part of our Continuous integration (CI) ?

What is an Angiogram ?

An angiogram is a medical procedure which one can undergo to find blocks in ones arteries. A block in the artery can lead to a heart attack.

Taking Inspiration from Angiograms, I came up with a Data Angiogram

A data angiogram is a simple technique to assess data pipeline quality and sanity.

I will let the slides take over from here…

A demonstration of the data angiogram for a taxi-ride based data pipeline -mentioned in the slides, is available in the github repo :

github.com/jaihind213/data-angiogram

Thank you and feel free to reach out to me if you have doubts.

Ps: I am the Chief Inspiration officer at Bytespire.io

happy to help you with your Cost Based Data Engineering needs.

feel to reach out to me at [LinkedIn] or [Twitter] or [X]

Book a free consultation

--

--

Vishnu Rao

Chief Inspiration Officer at bytespire.io, Database Enthusiast, fellow Programmer, was the Database expert@ flipkart.com . Currently at @ cuezen.com