TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Member-only story

Exploratory Data Analysis with Python — Part 1

A template to follow to get you started analyzing data with Python and Pandas

Gustavo R Santos
TDS Archive
Published in
6 min readNov 28, 2021

--

Photo by Markus Winkler on Unsplash

Data Science has no recipe. Don’t think that there’s a template you can follow for each and every dataset. There is not.

What I am going to present in this series of posts is just a suggestion, a place to start. From there, obviously, you will be dragged by your data to perform other checks that will fit the needs of your project. It should not be understood as a model to follow or a set of rules, but simply something to get you moving and helping you to extract the first insights from your data.

Summary

Exploratory Data Analysis (EDA) is the art of understanding your dataset and extracting the first insights from it, so you can prepare it for the next phases of the Data Science flow — e.g. data cleaning, data formatting, feature engineering and modeling.

These are the topics included in this series:

  • Libraries import and loading the data
  • Checking data types
  • Looking for null or missing values
  • Descriptive statistics profiling
  • Univariate Analysis
  • Correlations

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Gustavo R Santos
Gustavo R Santos

Written by Gustavo R Santos

Data Scientist | I solve business challenges through the power of data. | Visit my site: https://gustavorsantos.me

No responses yet