Preparing Data for Machine Learning
Clean up continuous data
As a reminder this article belongs to the course about Applied Machine Learning: Ensemble Learning. If you want to know more about Machine learning and its basics you can check out the following articles:
- What is Machine Learning?
- What does machine learning look like in real life?
- What does an end-to-end machine learning pipeline look like?
- Bias-Variance trade-off?
In this article we will mainly focus on cleaning up the continuous features and in the next article, we will clean up the categorical features. Cleaning up the data means that we will do a quick pass over the data to check that it can fit the model.
Remember that we are using the Titanic Dataset, Read the following article if you want to know more about Titanic and how to read the Dataset from a CSV file.
So the first this we will do is import a few packages that will help us in cleaning and visualizing data.