Building machine learning models for big data has gained lot of pace in the recent years, I like to demonstrate univariate linear regression in Pyspark along with my other posts in R and Python.
The scope of this article is to demonstrate various approaches we can build univariate linear regression in R. We will see the formulas used for each approach and…
Simple Linear Regression has five key assumptions in predicting the dependent/target variable. They are
1. Linear relationship
2. Multivariate Normality
Introduction
This demonstration is about clustering using Kmeans and also determining the optimal number of clusters (k) using Silhouette Method.