MLWhiz
Published in

MLWhiz

Photo by on

The best open-source datasets to train NLP/text models

NLP is such an exciting domain right now. Yet, it is painfully difficult to master. When I started with NLP a few years back, the main problem I faced was the dearth of proper guidance and the excessive breadth of the domain. I just got lost into various papers and code and tried to start by taking everything in and not coding. Well, that was a mistake and if I were given proper guidance, I would…

--

--

--

ML, NLP, AI

Recommended from Medium

Hands-on TensorFlow 2.0: Multi-Label Classifications with MLP

Boosting performance by combining trees with GLM: A benchmarking analysis

Independent and Dependent Variables in Machine Learning

Realtime Image Moderation At Scale Using AWS Rekognition

Train a Neural Network to classify images and OpenVINO CPU inferencing in 10mins!

Heartbeat Newsletter: Volume 7

Not Enough Data To Do Machine Learning? Think Again

Machine Learning & AI Applications in Oncology

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Rahul Agarwal

Rahul Agarwal

4M Views. Bridging the gap between Data Science and Intuition. MLE@FB, Ex-WalmartLabs, Citi. Connect on Twitter @mlwhiz

More from Medium

Continuous Machine Learning on Huggingface Transformer with DVC including Weights & Biases…

Huggingface Transformers Interpretability with Captum

Huggingface Transformers/Bert Pytorch: Play and Serve

How to Paraphrase Documents using Transformers