Fair Bytes
Published in

Fair Bytes

We Need to Change How Image Datasets are Curated

Why many gold-standard computer vision datasets, such as ImageNet, are flawed


Even though it was created in 2009, ImageNet is the most impactful dataset in computer vision and AI today. Consisting of more than 14 million human-annotated images, ImageNet has become the standard for all large-scale datasets in AI. Every year…




A Medium publication sharing byte-sized stories about research, resources, and issues related to fairness & ethics of AI

Recommended from Medium

AI Myths when dealing with newbies

It is time to start treating road deaths like a global pandemic

Privizio Accident Prevention Technology™

Xenobots: A Nightmare is Born

Picture of a xenobot, a tiny floating thing floating in water

Conversational User Interfaces (CUI) Testing Strategies

Trends for #Insurance: September 2020

For the uninitiated: What makes AIML so transformative

Monthly News Roundup: February 2022

Decentralized AI on Blockchain

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Catherine Yeo

Catherine Yeo

Computer Science @ Harvard | I write about AI/ML in @fairbytes @towardsdatascience | Storyteller, innovator, creator| Visit me at catherinehyeo.com

More from Medium

Classifying Adult vs. Youth Anime Using Synopsis and Genre

Stemming & Lemmatization


English to Spanish translation with Transformer

Multioutput -Multiclass Classification