Samuele MazzantiinTowards Data ScienceHypothesis Testing Explained (How I Wish It Was Explained to Me)Most resources focus on things like Confidence and Power. But they don’t really matter: here is what you should care about·8 min read·1 day ago--1--1
Samuele MazzantiinTowards Data ScienceWhy You Should Never Use Cross-ValidationIn real-world applications, using randomized cross-validation is always a bad choice. Here is why.·12 min read·Mar 27, 2024--32--32
Samuele MazzantiinTowards Data ScienceAre Outliers Harder To Predict?An empirical analysis about whether ML models make more mistakes when making predictions on outliers·8 min read·Feb 4, 2024--11--11
Samuele MazzantiinTowards Data Science“Approximate-Predictions” Make Feature Selection Radically FasterFeature selection is so slow because it requires the creation of many models. Find out how to make it blazingly faster thanks to…·10 min read·Nov 17, 2023--20--20
Samuele MazzantiinTowards Data ScienceYour Dataset Has Missing Values? Do Nothing!Models can handle missing values out-of-the-box more effectively than imputation methods. An empirical proof·10 min read·Oct 9, 2023--7--7
Samuele MazzantiinTowards Data ScienceWhich Features Are Harmful For Your Classification Model?How to calculate the Error Contribution of the features of a classifier, with the goal of understanding and improving the model·14 min read·Sep 12, 2023--6--6
Samuele MazzantiinTowards Data ScienceYour Features Are Important? It Doesn’t Mean They Are Good“Feature Importance” is not enough. You also need to look at “Error Contribution” if you want to know which features are beneficial for…·10 min read·Aug 21, 2023--13--13
Samuele MazzantiinTowards Data ScienceWhen You Should Prefer “Thompson Sampling” Over A/B TestsAn in-depth explanation of “Thompson Sampling”, a more efficient alternative to A/B testing for online learning·8 min read·Jun 13, 2023--6--6
Samuele MazzantiinTowards Data ScienceIs F1-Score Really Better than Accuracy?What’s the cost of being wrong (and the gain of being right) according to different metrics·10 min read·Apr 18, 2023--7--7
Samuele MazzantiinTowards Data Science12 Ways to Test Your Forecasts like A ProHow to find the best performance estimation approach for time-series forecasts among 12 strategies proposed in the literature. With Python…·11 min read·Mar 7, 2023--6--6