Day 16 of 30 days of Data Engineering Series with Projects
Published in
13 min readDec 24, 2022
Welcome back peeps to Day 16 of Data Engineering Series with Projects!
In this we will cover —
Data Pre-processing
Handling missing values
Data Cleaning
Mean/mode/median Imputation
Hot Deck Imputation
Rescale Data
Binarize Data
Regression Imputation
Stochastic regression imputation
Feature Scaling
Pre-requisite to Day 16 is to complete Day 1–15( link below):
Day 2 : Complete Python for Data Engineering — Part 1
Day 3 : Complete Advanced Python for Data Engineering — Part 2