Amelia DobronyiCollaborative Filtering or Content Filtering? Maybe Use BothFor a recent data science project, I adapted the well-known MovieLens 100k Dataset to build a recommendation system. This data is…May 28, 2023May 28, 2023
Amelia DobronyiOne way to deal with missing data? Treat it as suchRecently I was looking at data from the National 2009 H1N1 Flu Survey. Unsurprisingly, there was a lot of missing data — you tend to see…Dec 1, 2022Dec 1, 2022
Amelia DobronyiPairs Plot Insights: An Example for Linear RegressionThe pairs plot (a matrix of scatterplots) is a useful tool for visualizing the relationships between continuous or numerical variables. For…Sep 20, 2022Sep 20, 2022
Amelia DobronyiContinuous to Categorical: Using Bucketing in Pandas to Improve AnalysisOne of the facets of exploratory data analysis (EDA) is examining relationships between variables. However, those relationships aren’t…Jul 19, 2022Jul 19, 2022