Abstractive Text Summarization Using Transformers

An exhaustive explanation of Google’s Transformer model; from theory to implementation

Published in

The Startup

7 min readJun 11, 2020

This article is an extension to the ‘Transformers Explained’ post. The post, essentially, is an in-depth elucidation of the famous Transformer model which is a novelty of Google Research. If you’ve already been through it, skip to contents. If you’re new, consider giving it a read if you’re interested in knowing the logic behind the working of the Transformer.

This article is a step-by-step guide for building an Abstractive Text Summarizer for generating news article headlines using the Transformer model with TensorFlow. Following are the contents of this post:

A Brief Introduction to Abstractive Summarization
The Dataset
Preprocessing
Utility Functions
The Model
Training the Model
Inference
Conclusion

A Brief Introduction to Abstractive Summarization

Summarization is the ability to explain a larger piece of literature in short…

Abstractive Text Summarization Using Transformers

An exhaustive explanation of Google’s Transformer model; from theory to implementation

Contents

A Brief Introduction to Abstractive Summarization

Written by Rohan Jagtap