Where to Find Awesome Machine Learning Datasets

Over 50 open datasets for you to try out

Serokell
The Startup

--

Photo by Chris Liverani on Unsplash.

Good machine learning research starts with an exceptional dataset. There is no need to spend your evening crafting your own set of data in MySQL or, god forbid, Excel. Basically, anything from COVID-19 stats to Harry Potter spells (made it myself!) exists in a form of a database. You just need to find it.

Let me help you — in this post, you will learn where to find datasets for machine learning research.

Top general ML dataset aggregators

Dataset aggregators collect thousands of databases for various purposes.

1. Kaggle

Kaggle, being updated by enthusiasts every day, has one of the largest dataset libraries online.

Kaggle is a community-driven machine learning platform. It contains plenty of tutorials that cover hundreds of different real-life ML problems. It is true that quality may vary. However, all the data is completely free. You can also upload your own dataset there.

2. Google Dataset Search

--

--

Serokell
The Startup

Serokell is a software development company focused on building innovative solutions for complex problems. Come visit us at serokell.io!