Machine Learning Process Tutorial

Extensive tutorial illustrating the machine learning process using the cruise ship dataset with python code included

Benjamin Obi Tayo Ph.D.

Published in

The Startup

5 min readMay 24, 2020

Introduction

The machine learning process includes 4 main stages:

In this article, we present a practical tutorial of the machine learning process using the cruise ship dataset cruise_ship_info.csv. The dataset and Jupyter notebook for this tutorial can be downloaded from here: https://github.com/bot13956/Machine_Learning_Process_Tutorial.

1. Problem Framing

Define your project goals. What do you want to find out? Do you have the data to analyze?

Objective: The goal of this project is to build a regressor model that recommends the “crew” size for potential cruise ship buyers using the cruise ship dataset cruise_ship_info.csv.

2. Data Analysis

Import the dataset, analyze features to select the relevant features that correlate with the target variable.

2.1 Import necessary libraries

import numpy as np 
import pandas as pd 
import matplotlib.pyplot as plt 
import seaborn as sns