Published inTowards Data ScienceThink Correlation Isn’t Causation? Meet Partial CorrelationDespite being so powerful, partial correlation is perhaps the most underrated tool in data scienceJan 814Jan 814
Published inTowards Data ScienceWhy “Statistical Significance” Is PointlessHere’s a better framework for data-driven decision-makingDec 1, 202429Dec 1, 202429
Published inTowards Data ScienceNon-Linearity: Can Linear Regression Compete With Gradient Boosting?Linear models can handle non-linear relationships thanks to data pre-processing. But how close can they get to more sophisticated models?Oct 11, 202410Oct 11, 202410
Published inTowards Data ScienceCausality in ML Models: Introducing Monotonic ConstraintsMonotonic constraints are key to making machine learning models actionable, yet they are still quite unusedSep 6, 202412Sep 6, 202412
Published inTowards Data ScienceWhat’s Wrong With R-Squared (And How to Fix It)Even if you think you are using R-Squared out-of-sample, you are not. Here is whyAug 7, 202419Aug 7, 202419
Published inTowards Data ScienceForget Statistical Tests: A/B Testing Is All About SimulationsHow simulations outperform traditional stats in that they are easier to understand, more flexible, and economically meaningfulJul 4, 202415Jul 4, 202415
Published inTowards Data ScienceHypothesis Testing Explained (How I Wish It Was Explained to Me)Most resources focus on things like Confidence and Power. But they don’t really matter: here is what you should care aboutMay 13, 20246May 13, 20246
Published inTowards Data ScienceWhy You Should Never Use Cross-ValidationIn real-world applications, using randomized cross-validation is always a bad choice. Here is why.Mar 27, 202441Mar 27, 202441
Published inTowards Data ScienceAre Outliers Harder To Predict?An empirical analysis about whether ML models make more mistakes when making predictions on outliersFeb 4, 202411Feb 4, 202411
Published inTowards Data Science“Approximate-Predictions” Make Feature Selection Radically FasterFeature selection is so slow because it requires the creation of many models. Find out how to make it blazingly faster thanks to…Nov 17, 202323Nov 17, 202323