Processing longer forms of text with BERT-like models require us to rethink the…
Pre-training of Deep Bidirectional Transformers for Language Understanding (link)
These were the top 10 stories published by DAIR.AI in 2020. You can also dive into monthly archives for 2020 by using the calendar at the top of this page.