TL;DR Use a large, regularized LSTM language model with projection layers and Softmax approximation using importance sampling, trained on a large dataset, to beat…
We recently watched Jeff Dean’s Lecture for YC AI for discussion (found here). As the head…