The Growing Revolt Against AI Data Scraping

The Public Turn Against AI Scraping

Paul DelSignore
The Generator

--

made in Midjourney

When new technologies are introduced into society, they usually undergo several stages of growth and acceptance.

Generative AI is a new kind of animal, bringing with it a new set of challenges. How it shapes our understanding of ‘creativity’ is something we haven’t seen before.

So when ChatGPT hit the cultural mainstream, it was a phenomenon that sparked excitement and uncertainty.

But today, we have entered a new phase in our relationship with generative AI. We are beginning to witness a genuine cultural backlash that centers around one particular aspect:

The Training data.

The question on the table that everyone is grappling with is as follows:

Should unapproved content be used as training data for LLMs, and does it qualify as fair use?

First Blood

Before ChatGPT and the generative text craze, the art world had already been pushing back on AI image tools related to scraping.

In January 2023, the first legal action was taken when three artists filed lawsuits against Stability AI, Midjourney, and DeviantArt. The artists claimed that the companies had used copyrighted images to…

--

--

Paul DelSignore
The Generator

Ramblings on the intersection of technology and culture • Creative Technologist :: https://medium.com/@pdelsignore/membership