Principal Component Analysis (PCA) 101, using R

Peter Nistrup
Towards Data Science
8 min readJan 29, 2019

--

Improving predictability and classification one dimension at a time! “Visualize” 30 dimensions using a 2D-plot!

Basic 2D PCA-plot showing clustering of “Benign” and “Malignant” tumors across 30 features.

Make sure to follow my profile if you enjoy this article and want to see more!

Setup

For this article we’ll be using the Breast Cancer Wisconsin data set from the UCI Machine learning repo as our data. Go ahead and…

--

--