Datasets Resources for Data Science Projects — Part 1

Fahad Masood Reda
3 min readFeb 6, 2022

--

As Data Science Practitioner, most of us find difficulty in finding the right dataset for our projects, and lately, I’ve been getting a lot of messages from my followers, asking my guidance in helping them to find the best dataset for their project, so I gathered a few websites that offer a good amount of datasets

1- KDDCUP Archive

Link: https://kdd.ics.uci.edu/

KDDCup is an annual competition for data science practitioners, and I highly recommend that you pick your dataset from this site as it is in a raw format and most of the time it’s in a high volume, so they are perfect if you are doing Masters or PhD

You can also filter the dataset by their type ( text data for NLP projects — Images for Computer vision..etc)

Link: https://kdd.ics.uci.edu/
Text and Time Series Datasets
Image and Multivariate datasets

2- Google Dataset Search

link: https://datasetsearch.research.google.com/

A free datasets search engine from Google that helps you find datasets, it contains over 25 million datasets

Retail related datasets from Google

3- UCI Machine Learning Repository

link: https://archive.ics.uci.edu/ml/index.php

All the datasets were uploaded by the users and you can filter them by attribute and data type and area of expertise

https://archive.ics.uci.edu/ml/datasets.php

4- OpenML

link: https://www.openml.org/

An online machine learning platform for sharing and organizing data with more than 21.000 datasets

https://www.openml.org/

5- Wikipedia

link: https://en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research

A very well organized repository for different types of datasets from Wikipedia

Machine Learning Datasets from Wikipedia

Thanks for reading this article, hope you liked it, stay tuned for Part 2 of this article, Make sure to like it ( Clap 👏)and share it with your friends

You can check my social media accounts and courses on this link

--

--

Fahad Masood Reda

Founder of Fahad Academy|Educator |Data Science & MIS Mentor📊, I write about Data Science and MIS, follow me on: https://dope.link/themis