Samuele MazzantiinTowards Data ScienceHypothesis Testing Explained (How I Wish It Was Explained to Me)Most resources focus on things like Confidence and Power. But they don’t really matter: here is what you should care aboutMay 134May 134
Samuele MazzantiinTowards Data ScienceWhy You Should Never Use Cross-ValidationIn real-world applications, using randomized cross-validation is always a bad choice. Here is why.Mar 2733Mar 2733
Samuele MazzantiinTowards Data ScienceAre Outliers Harder To Predict?An empirical analysis about whether ML models make more mistakes when making predictions on outliersFeb 411Feb 411
Samuele MazzantiinTowards Data Science“Approximate-Predictions” Make Feature Selection Radically FasterFeature selection is so slow because it requires the creation of many models. Find out how to make it blazingly faster thanks to…Nov 17, 202320Nov 17, 202320
Samuele MazzantiinTowards Data ScienceYour Dataset Has Missing Values? Do Nothing!Models can handle missing values out-of-the-box more effectively than imputation methods. An empirical proofOct 9, 20237Oct 9, 20237
Samuele MazzantiinTowards Data ScienceWhich Features Are Harmful For Your Classification Model?How to calculate the Error Contribution of the features of a classifier, with the goal of understanding and improving the modelSep 12, 20237Sep 12, 20237
Samuele MazzantiinTowards Data ScienceYour Features Are Important? It Doesn’t Mean They Are Good“Feature Importance” is not enough. You also need to look at “Error Contribution” if you want to know which features are beneficial for…Aug 21, 202314Aug 21, 202314
Samuele MazzantiinTowards Data ScienceWhen You Should Prefer “Thompson Sampling” Over A/B TestsAn in-depth explanation of “Thompson Sampling”, a more efficient alternative to A/B testing for online learningJun 13, 20236Jun 13, 20236
Samuele MazzantiinTowards Data ScienceIs F1-Score Really Better than Accuracy?What’s the cost of being wrong (and the gain of being right) according to different metricsApr 18, 20238Apr 18, 20238
Samuele MazzantiinTowards Data Science12 Ways to Test Your Forecasts like A ProHow to find the best performance estimation approach for time-series forecasts among 12 strategies proposed in the literature. With Python…Mar 7, 20236Mar 7, 20236