What is Data Modelling?

Nilay Chauhan
Data Stash
Published in
3 min readJul 20, 2020
Photo by Christina @ wocintechchat.com on Unsplash

What is a Data Model?

A Data Model is an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. For instance, a data model may specify that the data element representing a car be composed of a number of other elements which, in turn, represent the colour and size of the car and define its owner. — Wikipedia

The structure of the data is explicitly determined by the data model.

The main use of the data model is to use it to communicate between business people and the technical people.

Types of Data Models:

There are three types of data models — Conceptual model, Logical model, Physical model.

Conceptual model:

In the conceptual model, it is concerned about the concept of the database model. It defines WHAT the system is about. The main intention of creating this model is to define the concepts and rules.

Logical model:

This type of data model consists of tables, the structure of the database is defined in this model. It shows HOW the system should be implemented regardless of the database system. The main intention of creating this model is to develop a technical map of rules and data structures.

Physical model:

This type of model shows HOW actual data can be stored in the database. The main intention of creating this model is the actual implementation of the database.

What is Data Modelling?

Data Modelling is the process to organize data into a database management system to make sure that data is persisted and easily usable to you in your organization. Data Modelling is the process to support business and user applications.

You can think this process as designing data and giving it to machine learning engineers, business analysts, data scientists, etc. so they can make use of this data and do further study/predictions.

Important points one should keep in mind while doing Data Modelling:

Data Organization:

The data should be organized properly for your application which is most important.

Use Case:

You should always keep in mind that for what you will be using this data model because if you don’t do that, then you make simple things complicated like doing multiple joins to get customer email-id in e-commerce database.

Always start early:

You should always start this work as soon as possible to be successful in this. This is what you can not accomplish at the last moment.

Repetitive Process:

It is not a fixed process. It is repetitive as new requirements and data are introduced, having flexibility will help as new information will be available.

This was the second blog of my Data Engineering series, In the next part, I will be writing about Data Modelling using relational database.

THANK YOU!!

BE SAFE!!

--

--