Data Engineer Interview Series: Data Modeling - Part 1

Anju Mercian
WWCode Python
Published in
8 min readAug 1, 2023

--

Well-constructed data architectures reflect the goals and the business logic of the organization that relies on the data.

Photo by Christina @ wocintechchat.com on Unsplash

Data modeling involves deliberatively choosing a coherent structure for data and is a critical step to make data useful for the business.

The rise of data lakes and no-sql allowed engineers to bypass the modeling process thus creating data swamps of redundant, mismatched or wrong data. Thus, data modeling has become an important step for data engineers to architect before proceeding into coding the solution. And has thus become important part of the data engineering interviews.

The monumental data being collected becomes useless if the data is not read and architected properly.

Flow of the article

The goal of this article is to provide a cheatsheet to those preparing for a data engineering interview.

Side Note: This is a cheatsheet I wrote when I was preparing for my interviews.

In this article, I will go over all the definitions one needs to be aware of while preparing for a data engineering interview. In the second article I will go over the theory of the data modeling concepts and will end the article with a data modeling question, the solution to the question and

--

--

Anju Mercian
WWCode Python

ML and Data Enthusiast|Writing about my experience with learning ML/Data