This is udacity’s capstone project, using spark to analyze user behavior data from music app Sparkify.

Project Definition

Sparkify is a music app, this dataset contains two months of sparkify user behavior log. The log contains some basic information about the user as well as information about a single action. A user can contain many entries. In the data, a part of the user is churned, through the cancellation of the account behavior can be distinguished.

My analysis detail is here. And the project got a lot of help in this blog.

Problem Statement

The job of the project is to find the characteristics…


image source: https://datasciencedojo.com/locations/seattle/

Business background

“A‌i‌r‌b‌n‌b‌,‌ ‌I‌n‌c‌.‌ is a privately held global company headquartered in San Francisco that operates an online marketplace and hospitality service which is accessible via its websites and mobile apps. Members can use the service to arrange or offer lodging, primarily homestays, or tourism experiences. The company does not own any of the real estate listings, nor does it host events; as a broker, it receives commissions from every booking.”

Understanding CRISP-DM methodology

CRISP-DM stands for Cross-Industry Standard Process for Data Mining and the whole includes 6 steps;

Business understanding, Data understanding, Data preparation, Modelling, Evaluation, and Deployment.

The first step as part of…

Emre Ismet Karakurum

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store