Amazing free Machine Learning and Deep Learning public data sources for 2018

Apr 12, 2018 · 4 min read

These public data sources can be used for machine learning and deep learning research. Datasets are an integral part of the field of machine learning.

Finding a good machine learning dataset is often the biggest hurdle a developer has to cross before starting any data science project. Whether you’re new to machine learning, or a professional data scientist, finding a good machine learning dataset is the key to extracting actionable insights.

Below is an up-to-date list of freely available data sources.

World Bank Open Data

IMF Data

The US National Center for Education Statistics

The UK Data Centre

5 FiveThirtyEight

FBI Uniform Crime Reporting

Bureau of Justice

Qlick Data Market

NASA Exoplanet Archive

UN Comtrade Database

Financial Times Market Data

Google Trends


Google Scholar



Glassdoor API

IMDB Datasets

OpenLibrary Data Dumps

Labelled Faces in the Wild

Microsoft Marco

Machine Learning Dataset Repository

EBay Market Data Insights

Natural History Museum Data Portal

CERN Open Data

One Million Audio Cover Images

Complete Public Reddit Comments Corpus

Microsoft Azure Data Markets Free Datasets

Irish Electric Vehicle Charge Point Status


If you are interested in Natural Language Processing, try our free Demo WebApp (NLP in practice — text summarization, Named-entity extractor and sentiment analysis)

NLP in Practise Demo WebApp



Written by


Transforming Data into Action Value —