AI companies are desperate for data and they’ll go to any length to find it

Enrique Dans
Enrique Dans
Published in
2 min readApr 7, 2024

--

IMAGE: A hyper-realistic image of a humongous library with several stories and endless shelves full of books, capturing the grandeur and vastness of the space dedicated to knowledge and learning

It’s important to have an idea of the scale of the data that companies working generative artificial intelligence algorithms, and some recent articles I’ve come across can really help.

The Verge has this piece, “OpenAI transcribed over a million hours of YouTube videos to train GPT-4", which gives some insight into the level of…

--

--

Enrique Dans
Enrique Dans

Professor of Innovation at IE Business School and blogger (in English here and in Spanish at enriquedans.com)