Boda YeIntroduction of Neural NetworkNeural Network is a common nonlinear algorithm in data scientists’ tool kit. Today, let’s go through what is neural network and how to…Mar 3, 2019Mar 3, 2019
Boda YeIntroduction of Recommendation System and how to build it on SparkRecommendation system seeks to predict the “rating” or “preference” a user would give to an item, and figure out items that users may…Mar 3, 2019Mar 3, 2019
Boda YeSpark Machine LearningThe first important terminology we are going to talk today is pipeline.Mar 3, 2019Mar 3, 2019
Boda YeIntroduction to SparkBefore touching Spark, there are several important terminologies need to be explained.Mar 3, 2019Mar 3, 2019
Boda YeHypothesis TestingHypothesis testing a common tool in Data scientists’ tool kit. Today, I’m going to go through the detail of famous stats theorem.Mar 3, 2019Mar 3, 2019
Boda YeLogistic RegressionLogistic Regression is a common model used on categorical response. But why not use linear regression instead?Mar 3, 2019Mar 3, 2019
Boda YeNonlinear Model II and Feature SelectionToday, we are going to talk about SVM and how to do feature selection.Mar 3, 2019Mar 3, 2019
Boda YeNonlinear Models I-Decision Tree, Random Forest and KNNLet’s start with decision tree.Mar 3, 2019Mar 3, 2019
Boda YeModel EvaluationIn today’s blog, we are going to go through how to solve over fitting and evaluate your accuracy of model.Mar 3, 2019Mar 3, 2019