TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Member-only story

Topic Modelling in Business Intelligence: FASTopic and BERTopic in Code

A comparison of two cutting-edge dynamic topic models solving consumer complaints classification exercise

Petr Korab
TDS Archive
Published in
10 min readJan 22, 2025

--

Source: Freepic, Image by rawpixel.com

Customer reviews about products and services provide valuable information about customer satisfaction. They provide insight into what should be improved across the whole product development. Dynamic topic models in business intelligence can identify key product qualities and other satisfaction factors, cluster them into categories, and evaluate how business decisions materialized in customer satisfaction over time. This is highly valuable information not only for product managers.

This article will compare two of the latest topic models to classify customer complaints data. BERTopic by Maarten Grootendorst (2022) and the recent FASTopic by Xiaobao Wu et al. (2024) presented at last year’s NeurIPS, are the current leading models for topic analytics of customer data. For these models, we’ll explore in Python code:

  • how to effectively preprocess data
  • how to train a Bigram topic model for customer complaint analysis
  • how to model topic activity over time.

1. Customer complaints data in companies

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Petr Korab
Petr Korab

Written by Petr Korab

Python engineer /NLP / data Viz. Text Mining Stories founder textminingstories.com

Responses (3)