Gaurav GhatiinTowards Data ScienceHead Pruning in Transformer models !In this article, we are going to look at pruning attention heads in transformers models like BERT.May 11, 2020May 11, 2020
Gaurav GhatiComparison between BERT, GPT-2 and ELMoThe recent progress in NLP in terms of model architecture had led us to breakthrough ideas like BERT architecture. Among those ideas, the…May 3, 2020May 3, 2020
Gaurav GhatiinTowards Data ScienceUsing Github Pages for creating global APIsApr 1, 20202Apr 1, 20202