Covid-19 : The World Threat

Harsita Mav
Butterfly Effect | MetaMorphoSys
7 min readJun 13, 2021

Deadly coronavirus has taken over the world . It has affected the world lot in almost all the fields let it be education, businesses , countries economy , healthcare and much more. How many people have the coronavirus ? This remains the most important question for its people, schools, hospitals, and businesses. It is also still among the hardest to answer.

On looking at the numbers, there are total 17.6 Cr cases all over the world with daily rise in new cases by almost 4–5 L. Total Deaths recorded till now are almost 37.8 L. Though we have good recovery rate as well 16 Cr. Analysis performed reveals that vaccination numbers are also quite good i.e. around 2 billion people worldwide and count is increasing everyday.

Introduction :

This analysis is basically trying to visualize various trends in cases of Covid19 . This report shows various graphs show casing the total confirmed cases, total recovered , total active cases and death rates worldwide. This report also has detailed analysis for Covid-19 cases summary for India.

Exploratory Data Analysis :

In statistics, exploratory data analysis is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

Basic flow of any Model

The above is the basic approach for any data science project to start with and exploratory analysis is the very basic and most important step .

The very step is Importing Libraries required for the model, followed by reading the dataset and preprocessing it. Here for this analysis I have used two different dataset . These have lot of null values in them, so we need to replace them either with Nan or 0 or we can even delete that rows. This step is must, if not done might affect our visualizations.

Since, we have two datasets there are multiple data frames so for ease we would be combining these into a single data frame and then we would be dropping unwanted columns.

Insights :

Correlation Plot: Correlation plots can be used to quickly find insights. It is used to investigate the dependence between multiple variables at the same time and to highlight the most correlated variables in a data table.

Bar Plots : A bar plot or bar chart is a graph that represents the category of data with rectangular bars with lengths and heights that is proportional to the values which they represent. The bar plots can be plotted horizontally or vertically. One of the axis of the plot represents the specific categories being compared, while the other axis represents the measured values corresponding to those categories.

Bar plot for Total confirmed Cases for Top 20 Countries
Bar plot for Total active Cases for Top 20 Countries
Bar plot for Total Deaths for Top 20 Countries

Scatter Geo Plot : The Plotly Express function px.scatter_geo for a geographical scatter plot. The size argument is used to set the size of markers from a given column of the Data Frame.

Scatter geo plot for death rates
Bar plot for Total Recovered Cases for Top 20 Countries
Bar plot for Comparing between Total confirmed cases and Total recovered cases for Top 20 Countries
Bar plot for Comparing between Total recovered cases and Total active cases for Top 10 Countries

Bar and Line Plots for Daily Rise in Cases:

A line chart or line plot or line graph or curve chart is a type of chart which displays information as a series of data points called ‘markers’ connected by straight line segments.

Bar plot for daily rise in cases
Line plot for daily rise in cases

Vaccination Insights :

Let’s see vaccination trends for top 20 countries . This will have visualizations for number of people vaccinated with one dose and people fully vaccinated.

Top 20 countries
Bar plot for fully vaccinated and vaccinated with one dose for top 20 countries

Vaccine Brands :

There are total 14 brands used by various countries, few of them are :

Choropleth Map : A choropleth map is a type of thematic map in which a set of pre-defined areas is colored or patterned in proportion to a statistical variable that represents an aggregate summary of a geographic characteristic within each area, such as population density or per-capita income.

Choropleth map for Popular vaccine brands used

Analysis for India :

India is witnessing a second wave of COVID-19 pandemic. So far, the country has recorded over 2.65 crore confirmed COVID-19 cases, including 2,99,266 deaths. A total of 2,34,25,467 people have recuperated from COVID-19 so far. There are 28,05,399 active cases in the country as of date, which comprises 10.57 percent of the total caseload, the data stated.

But these trends are reducing after taking several precautionary measures. India’s daily Covid count remained below 1 lakh with the country reporting 84,332 new cases , the lowest in 70 days, according to the data updated by the Union health ministry. With this, the country’s total tally has gone up to 29,359,155. On June 8, India reported 86,498 Covid cases, the lowest in 63 days. Deaths due to Covid, however, are on the rise again. With 4,002 more Covid deaths in the last 24 hours, the death toll has climbed to 3,67,081.

Visualizations for India’s Case Trends :

Plot for Active cases and daily new deaths

The plot above clearly shows that though active cases are decreasing but there is rise in number of deaths due to covid-19. Reason behind increasing death rates is definitely lack of resources due to more number of cases compared to resources.

Daily vaccinations in India :

Dist Plot : distplot() function provides access to several approaches for visualizing the univariate or bivariate distribution of data, including subsets of data defined by semantic mapping and faceting across multiple subplots.

Dist Plot for daily vaccinations
Line Plot for daily vaccinations

The above graphs shows us that vaccination in the April and May month was reduced with quite large number due to unavailability of vaccines but later increased after restoration of vaccines. Vaccine brand used in India is Covishield and Covaxin, highest used is Covishield.

Joint plot : jointplot() function provides a convenient interface to the JointGrid class, with several canned plot kinds. This is intended to be a fairly lightweight wrapper; if you need more flexibility, you should use JointGrid directly.

Joint plot for seeing trends in daily_new_cases and daily_vaccinations

Pair Plot : Pair plots are a powerful tool to quickly explore distributions and relationships in a dataset. Seaborn provides a simple default method for making pair plots that can be customized and extended through the Pair Grid class.

Pair Plot for India cases Analysis

Conclusion :

So with that we completed creating COVID-19 report with Python. If you forgot to follow any above-mentioned steps I have provided the full code for this analysis below. Apart from analysis we can create various predicting models using python, numpy, pandas, matplotlib and many more. You can find many useful resources on the internet based on data science in python.

Hope this article was useful and knowledgeable!!!

References :

  1. Dataset : First and Second
  2. Libraries : seaborn , plotly , matplotlib and colormaps
  3. Code : Github and Kaggle Notebook

--

--

Harsita Mav
Butterfly Effect | MetaMorphoSys

Data science enthusiast, ML Developer, LLM & GenAI Consultant