EDA On Covid19 India Data

Chilakala Bala Mahesh
Analytics Vidhya
Published in
3 min readApr 26, 2021

Exploratory Data Analysis(EDA) refers to the critical process of performing initial investigations on data so as to discover patterns, to spot anomalies, to test hypothesis and to check assumptions with the help of summary statistics and graphical representations.

At first, you need to download the covid19 India data.

Let’s import all the libraries and read the data.

We can check if the data is successfully imported by displaying the first 5 rows of data frame using head() method.

Now, let’s apply the describe() method over this dataset and see the results. It displays a description of mean, standard deviation, quartiles and maximum & minimum values.

State wise Analysis

Top 10 states with confirmed cases

Top 10 states with cured cases

Top 10 states with deaths

From the above plots we can see that Maharashtra state has highest number of confirmed, recovered and death cases.

Month wise Analysis

Comparison of confirmed cases

Comparison of cured cases

Comparison of death cases

Thus, by using EDA techniques we can get to know the dataset completely and we can bring out meaning information from the dataset and also could figure out if any flaw exists in dataset or not.

Using the visualization charts it is possible to derive several insights into the data. In this article, we covered some of the basic visualizations. A lot more analysis and insights can be derived using this data.

--

--