Akashdeep Singh JaswalinTowards Data ScienceImagining a world without Transformers — Single Headed Attention RNNDistilling key ideas from one of the most entertaining NLP papers picturing a world without the BERT family of models7 min read·Jan 7, 2020--1--1
Akashdeep Singh JaswalinTowards Data ScienceByte Pair Encoding — The Dark Horse of Modern NLPDeriving meaning from rare infrequent words3 min read·Nov 22, 2019--8--8