Sitemap
TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Follow publication

Member-only story

Data Frame EDA Packages Comparison: Pandas Profiling, Sweetviz, and PandasGUI

11 min readOct 25, 2020

--

GIF created by Author

As a data scientist, our work would always involve exploring data or often called Exploratory Data Analysis (EDA). The purposes of exploring data are to know our data better and grasp what we are dealing with.

Previously, exploring data using the pandas data frame is a big hassle because we need to code every single analysis from scratch. Not only it takes a lot of time, but it takes our mind focus as well.

Take an example of the mpg dataset below.

import pandas as pd
import seaborn as sns
mpg = sns.load_dataset('mpg')
mpg.head()
Image created by Author

While the data looks simple but to exploring this dataset would still take a lot of time.

Luckily, many great people already develop great packages to simplify the EDA process in our modern time. Examples of these packages are Pandas Profiling, Sweetviz, and PandasGUI.

Knowing many of the EDA packages out there, I am intrigued to see how they are compared to each other and which packages are…

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Cornellius Yudha Wijaya
Cornellius Yudha Wijaya

Written by Cornellius Yudha Wijaya

2.6M+ Views |Top 1000 Writer | LinkedIn: Cornellius Yudha Wijaya | Twitter:@CornelliusYW