Abstractive Text Summarization Using Transformers
An exhaustive explanation of Google’s Transformer model; from theory to implementation
This article is an extension to the ‘Transformers Explained’ post. The post, essentially, is an in-depth elucidation of the famous Transformer model which is a novelty of Google Research. If you’ve already been through it, skip to contents. If you’re new, consider giving it a read if you’re interested in knowing the logic behind the working of the Transformer.
This article is a step-by-step guide for building an Abstractive Text Summarizer for generating news article headlines using the Transformer model with TensorFlow. Following are the contents of this post:
Contents
- A Brief Introduction to Abstractive Summarization
- The Dataset
- Preprocessing
- Utility Functions
- The Model
- Training the Model
- Inference
- Conclusion
A Brief Introduction to Abstractive Summarization
Summarization is the ability to explain a larger piece of literature in short…