Exploring Titanic Dataset For the first time as a Beginner

Exploring, Modeling, and Predicting Survival.

Abhay Parashar
The Pythoneers

--

Photo by Lorenzo from Pexels

Hello there,

I m Abhay, a student, and a machine learning enthusiast.

Recently I started working on some Kaggle datasets. One of the most famous datasets on Kaggle is Titanic Dataset. In this blog, I will show you my first-time interaction with the Kaggle dataset.

One thing to notice is that it is still an ongoing competition on Kaggle till Oct 2020.

You can find the dataset at https://www.kaggle.com/c/titanic

Let’s start,

I am going to use Kaggle inbuild notebook for all computation if you want you can also use Jupyter notebook.

Just do pip install jupyter-notebook and then jupyter notebook to run it on to the local server.

For a Kaggle notebook just go to New notebook can create a new one.

Importing necessary libraries

First thing is to import all the necessary libraries. There are 3–4 basic libraries like NumPy, pandas, matplotlib, seaborn, etc. which can be used for every machine learning project.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt…

--

--

Abhay Parashar
The Pythoneers

Cyber Guy 🧑‍💻| Top Writer | 5M+ Views | Engineer | Learning and Sharing Knowledge Everyday | Python is ❤️| Editor of The Pythoneers