We Need to Change How Image Datasets are Curated

Why many gold-standard computer vision datasets, such as ImageNet, are flawed

Catherine Yeo
Fair Bytes
Published in
4 min readJul 2, 2020

--

ImageNet

Even though it was created in 2009, ImageNet is the most impactful dataset in computer vision and AI today. Consisting of more than 14 million human-annotated images, ImageNet has become the standard for all large-scale datasets in AI. Every year…

--

--

Catherine Yeo
Fair Bytes

Harvard | Book Author | AI/ML writing in @fairbytes @towardsdatascience | More at catherinehyeo.com