Sam WilsoninTowards Data ScienceDealing with Leaky, Missing Data in a Production EnvironmentAs a consultant, I don’t always have control over the data I receive. Going back and forth with a client can only get you so far. At a…Oct 19, 2021Oct 19, 2021
Sam WilsoninTowards Data ScienceA Parallel Implementation of Bayesian OptimizationThe concept of ‘optimization’ is central to data science. We minimize loss by optimizing weights in a neural network. We optimize…Sep 20, 2020Sep 20, 2020
Sam WilsoninTowards Data ScienceMultiple Imputation with Random Forests in PythonMissing data is a common problem in data science — one that tends to cause a lot of headaches. Some algorithms simply can’t handle it —…Sep 14, 20206Sep 14, 20206