TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Member-only story

Pandas III: value_counts(), duplicated(), min(), max()

In Pandas II, we began to clean up the Metal Bands by Nation data set. We eliminated an unneeded column and filled some missing values. Now, we are going to examine the other columns, assess their data types and value, and take action as needed.

Christine Egan
TDS Archive
Published in
4 min readApr 4, 2021

--

Image by Nick115 from Pixabay

I. Examining Data Types

  1. Open your project notebook, then click Kernel > Restart and Run All to re-execute our previous work.

2. It would be good to see what other kinds of values we are dealing with. According to what we learned from entering info we have five object (non-numeric) columns, and one numeric column. The numeric column is our target variable — the number of fans (more on that later). The rest of the data is categorical data that informs us about the qualities of the different bands.

II. Value Counts in Python…

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Christine Egan
Christine Egan

No responses yet