Remy CanarioinDataDrivenInvestorFirth’s Logistic Regression: Classification with Datasets that are Small, Imbalanced or SeparatedGet better classification results with problematic datasets using Firth’s logistic regressionFeb 7, 20201Feb 7, 20201
Remy CanarioUPDATE: Converting Python DataFrames to R with RPY2How to create R dataframes in Python — the correct way.Dec 25, 20192Dec 25, 20192
Remy CanarioThe Chow Test — Dealing with Heterogeneity in PythonIs a dummy variable good enough to deal with the heterogeneity in your data? Use a Chow test to find out.Dec 19, 20191Dec 19, 20191
Remy CanarioThere Is Never a Reason to Use KNN for ClassificationAccording to the no free lunch theorem, there can be no machine learning algorithm that is better at every task than any other…Dec 12, 2019Dec 12, 2019
Remy CanarioWhat’s Wrong with Data Science?A breakdown of the very best books criticizing data published to date.Dec 2, 20192Dec 2, 20192
Remy CanarioinAnalytics VidhyaMultiple Imputation: a Better Way to Fill NAsFill NAs without distorting variables’ distributions with with this two-part process.Nov 24, 20191Nov 24, 20191
Remy CanarioThe Top 5 Books ABOUT Data Science for Total NovicesA short reading list about what data science is, does and will be (no technical guides allowed).Oct 7, 2019Oct 7, 2019
Remy CanarioNo, Mercury in Retrograde Didn’t Cause the NYC BlackoutA statistical take on astrology’s most notorious bad boy.Jul 18, 2019Jul 18, 2019
Remy CanarioTests for Heteroskedasticity in PythonBreusch-Pagan & White tests are hassle-free in Python and give yes/no answers re: heteroskedasticity.Jul 2, 20191Jul 2, 20191