John ClementsinTowards Data ScienceBootstrapping BasicsNon-Parametric Resampling, ExplainedMar 2, 2022Mar 2, 2022
John ClementsinTowards Data ScienceAPI Interaction with RVignette Demonstrating Interaction with the NHL APIOct 22, 2021Oct 22, 2021
John ClementsinTowards Data ScienceK-Nearest Neighbors (K-NN) ExplainedThe intuition behind a simple, but powerful algorithmFeb 8, 20211Feb 8, 20211
John ClementsinTowards Data SciencePrincipal Components Analysis ExplainedWhat it is, Why it’s useful, and How to use itSep 17, 20201Sep 17, 20201
John ClementsinTowards Data ScienceOrigins of AutoML: Best Subset SelectionAnd the Perils of Post-Selection InferenceAug 13, 20201Aug 13, 20201
John ClementsinTowards Data ScienceData-backed articles on American policing and raceWhat data says about an injusticeJul 9, 2020Jul 9, 2020
John ClementsinTowards Data ScienceIntro to Markov Chain Monte CarloMCMC Explained and Applied to Logistic RegressionMay 12, 20203May 12, 20203
John ClementsinTowards Data ScienceBayesian Stats 101 for Data ScientistsAn alternative perspective on statistics and probabilityApr 14, 20202Apr 14, 20202
John ClementsinTowards Data ScienceBlind Data Mining is Bad.The math behind the multiple comparisons problem and what to do about itMar 16, 20201Mar 16, 20201
John ClementsinTowards Data ScienceGradient Boosting from Almost ScratchOver the past month, I’ve been slowly working my way through Joel Grus’ Data Science from Scratch 2nd Edition and I have thoroughly…Jan 27, 2020Jan 27, 2020