Exploring Titanic Dataset For the first time as a Beginner
Exploring, Modeling, and Predicting Survival.
Hello there,
I m Abhay, a student, and a machine learning enthusiast.
Recently I started working on some Kaggle datasets. One of the most famous datasets on Kaggle is Titanic Dataset. In this blog, I will show you my first-time interaction with the Kaggle dataset.
One thing to notice is that it is still an ongoing competition on Kaggle till Oct 2020.
You can find the dataset at https://www.kaggle.com/c/titanic
Let’s start,
I am going to use Kaggle inbuild notebook for all computation if you want you can also use Jupyter notebook.
Just do pip install jupyter-notebook
and then jupyter notebook
to run it on to the local server.
For a Kaggle notebook just go to New notebook can create a new one.
Importing necessary libraries
First thing is to import all the necessary libraries. There are 3–4 basic libraries like NumPy, pandas, matplotlib
, seaborn, etc. which can be used for every machine learning project.
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt…