Gerard BarbalichUsing p-values for A/B testsProblem: We have two sets of customers, those who have progressed through a marketing channel, and those who haven’t. We want to know if…Jul 9, 2020Jul 9, 2020
Gerard BarbalichUsing levenshtein distance to cluster text data for de-duplicationProblem: We have data that contains near-duplicate entries. We want to group similar data together, e.g. the common spellings of Gerard…Jul 2, 2020Jul 2, 2020
Gerard BarbalichClustering and visualising customers usinging K-MeansI experimented with some clustering and visualisation, using the Mall Customers dataset (link). First, I imported the dataset, and took a…Jun 22, 2020Jun 22, 2020