# Among Us Google Play Store Rating Exploratory Data Analysis using Python

Nov 5, 2020 · 4 min read

Hello Everyone! Do you know the Among Us Game? This phenomenal game is growing fast in Google Play and App Store Platform. Some Colleagues and I who have graduated from IYKRA Data MBA Batch IV, tried to scrap, analyze, and visualize comment sections of this games.

In this notebook, we will try to do simple Exploratory Data Analysis (EDA) from the Google Play Store User Ratings and Comments for Among Us Game. The scrapping process have already been done before with the data range of September 22nd to October 20th and saved in Microsoft Excel format.

We begin by importing the required libraries and read the Excel file.

`# Import required libsimport pandas as pdimport numpy as npimport matplotlib.pyplot as pltimport plotlyimport plotly.express as pximport plotly.graph_objects as go# Get the data and show its top 5 datadrive_path = '/path/to/file/'df = pd.read_excel(drive_path+'AmongUsRating.xlsx',index_col='Unnamed: 0')df.head()`

We then observe the data by using the dtypes to see the type of each column.

`# Get the type of each columndf.dtypesStar                  objectUser                  objectComment               objectDateReview    datetime64[ns]Rate                   int64dtype: object`

and describe to see the count, mean, min max, and some statistics of each numerical column.

`# Get the description of each numerical columndf.describe()`

## Graphical Representation

Based on the dataset, we want to observe how many users of each rating each day so it might be seen as a comparison from time to time. Therefore we used the groupby method by the rate and date column. Notice that we assign the new data frame to a new variable to keep the original data frame values for different analysis.

`# Count the total user of each day groupped by its rate and datedf_user_rating_day = df.groupby(['Rate','DateReview'])[['User']].count()df_user_rating_day.reset_index(inplace=True)df_user_rating_day.head()`

We plot the Total User per Rate per Day data as a line chart (as it’s the best way to represent time-series data).

`# Plot the user_rating_per_day data using plotlyfig = px.line(df_user_rating_day, x='DateReview', y='User', color='Rate', title='Among Us Google Play Rating 22 September to 20 October 2020')fig.show()`

We also observe total users per week to see how this application attract users over week.

`# Count the total user of each week grouppeddf_user_week = df.groupby(pd.Grouper(key='DateReview', freq='1W')).mean()df_user_week.index = df_user_week.index.strftime('%Y-%m-%d')df_user_week.reset_index(inplace=True)df_user_week`
`fig = px.line(df_user_week, x='DateReview', y='Rate', title='Among Us Rating per Week')fig.show()`

Last but not least, we want to see the distribution of total user per rating from the dataset, represented by pie chart.

`# Count the total user of each ratedf_user_rate = df.groupby(['Rate'])[['User']].count()df_user_rate.reset_index(inplace=True)df_user_rate`
`# Plot the user_rating data using plotlyfig = px.pie(df_user_rate, values='User', names='Rate', title='User Rating Distribution')fig.show()`

# Conclusion

We showed a basic EDA using Python from the Google Play Store User Ratings and Comments for Among Us Game data that we scrapped with the data range of September 22nd to October 20th. The result showed that the peak reviews are in the 24th of September, while this game reviews are mostly positive (4 stars and above). There are still more to analyze from this data especially regarding to text analysis on the users comments, and we hope to be able to do it soon.

# Remarks

Please find the Jupyter Notebook used for this analysis here. We also have already done the analysis using Tableau which you can find it on this post. Feel free to discuss with us for anything :D

## Our Projects

1. Among Us Google Play Store User Rating Visualization Using Tableau (source)
2. Among Us Google Play Store Rating Exploratory Data Analysis using Python (source)
3. Among Us EDA Web Application using Streamlit (source)

# Our team

Above, from left to right: Imam Bhaskara, Ricky Nauvaldy Ruliputra, Rahman Firmansyah
Below, from left to right: Cahya Putera, Winata Syahputera, Muhamad Teo Khibran, Arnold P

## The Startup

Get smarter at building your thing. Join The Startup’s +792K followers.

### By The Startup

Get smarter at building your thing. Subscribe to receive The Startup's top 10 most read stories — delivered straight into your inbox, once a week. Take a look.

Medium sent you an email at to complete your subscription.

## The Startup

Get smarter at building your thing. Follow to join The Startup’s +8 million monthly readers & +792K followers.

Written by

## Ricky Nauvaldy

Learn to live, live to learn

## The Startup

Get smarter at building your thing. Follow to join The Startup’s +8 million monthly readers & +792K followers.

## Who Will Win RuPaul’s Drag Race All Stars Season 5? (Final Predictions)

Medium is an open platform where 170 million readers come to find insightful and dynamic thinking. Here, expert and undiscovered voices alike dive into the heart of any topic and bring new ideas to the surface. Learn more

Follow the writers, publications, and topics that matter to you, and you’ll see them on your homepage and in your inbox. Explore

If you have a story to tell, knowledge to share, or a perspective to offer — welcome home. It’s easy and free to post your thinking on any topic. Write on Medium

Get the Medium app