UDEMY COURSES DATA ANALYSIS

Gokcencngz
3 min readMar 24, 2023

--

In this article, we will analyze Udemy course data. The project relies on the Udemy Courses Data 2023 provided by Kaggle. It answers five simple questions to show my proficiency in SQL for data analysis.

We will find answers to questions such as:

  • How many instructors and courses are there in total?
  • What are the 10 most popular courses?
  • What are the 10 most popular trainers?
  • What are the most viewed instructor’s courses?
  • What are the 5 most popular data analysis courses?

Jupyter notebooks can be powerful tools to connect to your remote database. They allow you to streamline, replicate, and document your data. Python Database (DB) APIs are compatible with various databases, and in particular, Python supports relational database systems. In this stage, using a Jupyter notebook, we will briefly see how to connect to a PostgreSQL database, which is a popular open-source relational database, and how to make queries in a Jupyter Notebook using Python language.

The Python DB API implementation for PostgreSQL is “psycopg2”. During the execution of the steps below, you might need to install a psycopg2-related module such as psycopg2-binary. In case your code complains with the error code “ModuleNotFound”, please pip install the missing module in your console. Let’s start by following the steps below:

Now let’s query the answers to our questions.

How many instructors and courses are there in total?

The first line of code will uniquely tell us how many instructors and courses there are in total.

What are the 10 most popular courses?

When looking for the most popular courses, we’ll consider those with statistically high reviews and above average ratings.

What are the 10 most popular instructors?

We’ll consider the number of reviews when querying the most popular instructors. Looking by the results, Jose Portilla makes a good difference in this regard.

What are the most reviewed instructor’s courses?

In this line we see all the courses of the most popular instructor, sorted by their rating. The ratings are pretty good too.

What are the 5 most popular data analysis courses?

Here, when we query the most popular data analysis courses, we display the highest reviewed courses with a rating greater than 4,50.

The project link: my GitHub.

--

--