Photo by Max Böhme on Unsplash

The best open-source datasets to train NLP/text models

Rahul Agarwal
MLWhiz
Published in
5 min readFeb 20, 2022

--

NLP is such an exciting domain right now. Yet, it is painfully difficult to master. When I started with NLP a few years back, the main problem I faced was the dearth of proper guidance and the excessive breadth of the domain. I just got lost into various papers and code and tried to start by taking everything in and not coding. Well, that was a mistake and if I were given proper guidance, I would…

--

--

Rahul Agarwal
MLWhiz

4M Views. Bridging the gap between Data Science and Intuition. MLE@FB, Ex-WalmartLabs, Citi. Connect on Twitter @mlwhiz ☕️ ko-fi.com/rahulagarwal