Research Papers related with Encoder Decoder (Part 5)

Thet Su
3 min readJun 16, 2024

--

Part 1 : Input Embedding and Positional Embedding

Part 2 : Encoder Stage

Part 3 : Decoder Stage

Part 4 : Pros and Cons, Resources

Part 5 : Research Papers using Transformers

Machine Translation

  • Paper: “Attention is All You Need” (2017) by Vaswani et al.
  • Application: Translating text from one language to another.
  • Link : https://arxiv.org/abs/1706.03762

Text Summarization

  • Paper: “BERTSUM: Extensively Pretrained Encoder-Decoder Architecture for Text Summarization” (2019) by Liu and Lapata.
  • Application: Generating concise summaries of longer texts.
  • Link : https://aclanthology.org/D19-1387.pdf

Text Generation

  • Paper: “Language Models are Few-Shot Learners” (2020) by Brown et al.
  • Application: Generating human-like text based on a given prompt.
  • Link : https://arxiv.org/abs/2005.14165

Question Answering

  • Paper: “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding” (2019) by Devlin et al.
  • Application: Answering questions based on provided context.
  • Link Paper : https://arxiv.org/abs/1810.04805

Text Classification

  • Paper: “XLNet: Generalized Autoregressive Pretraining for Language Understanding” (2019) by Yang et al.
  • Application: Categorizing text into predefined classes.
  • Link : https://arxiv.org/abs/1906.08237

Image Captioning

  • Paper: “Show, Attend and Tell: Neural Image Caption Generation with Visual Attention” (2015) by Xu et al.
  • Application: Generating descriptive captions for images.
  • Paper Link : https://arxiv.org/pdf/1502.03044

Speech Recognition

  • Paper: “Transformer-based Acoustic Modeling for Hybrid Speech Recognition” (2020) by Zhou et al.
  • Application: Converting spoken language into written text.

Document Understanding

  • Paper: “Longformer: The Long-Document Transformer” (2020) by Beltagy et al.
  • Application: Extracting and summarizing key information from documents.

Conversational AI (Chatbots)

  • Paper: “DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation” (2020) by Zhang et al.
  • Application: Developing systems that can carry on a natural conversation with users.
From Paper with code

Code Generation

  • Paper: “Codex: A Large-Scale Neural Network for Code Generation” (2021) by Chen et al.
  • Application: Automatically generating code snippets or entire programs from natural language descriptions.

Happy Learning

--

--