Gaurav GhatiinTowards Data ScienceHead Pruning in Transformer models !In this article, we are going to look at pruning attention heads in transformers models like BERT.6 min read·May 11, 2020----
Gaurav GhatiComparison between BERT, GPT-2 and ELMoThe recent progress in NLP in terms of model architecture had led us to breakthrough ideas like BERT architecture. Among those ideas, the…7 min read·May 3, 2020----
Gaurav GhatiinTowards Data ScienceUsing Github Pages for creating global APIs3 min read·Apr 1, 2020--2--2