Data Preprocessing Terms
For Data Mining & Machine Learning
Data Preprocessing is making data suitable for Data Mining. The approaches:
Aggregation — Combining multiple attributes/objects into one.
Sampling — Selection of a representative data subset to be analyzed. Types:
- Simple Random,
- Sampling with(out) replacement
- Stratified
- Progressive