Common Crawl Criticized for 'Quietly Funneling Paywalled Articles to AI Developers'

For more than a decade, the nonprofit Common Crawl “has been scraping billions of webpages to build a massive archive of the internet,” notes the Atlantic, making it freely available for research.
“In recent years, however, this archive has been put to a controversial purpose: AI companies including OpenAI, Google, Anthropic, Nvidia, Meta, and Amazon have used it to train large language models.

“In the process, my reporting has found, Common Crawl has opened a back door for AI companies to…

Source link

Common Crawl Criticized for ‘Quietly Funneling Paywalled Articles to AI Developers’

Slashdot

Slashdot

Events

Trending

35+ Mac apps – build your own bundle from $2.50

Issue Subscribed 5% On Day 1 So Far

Lehar Footwears announced H1FY26 and Q2FY26 results, Reports Strong Revenue and PAT Growth

Grab to invest $60m in Vay’s remote-driven EV service

Useful Links

Categories

Startups

Legal

Popular This Week

Editor's Pick

What Are You Looking For?

Recent

What Are You Looking For?

Recent

What Are You Looking For?

Recent

Common Crawl Criticized for ‘Quietly Funneling Paywalled Articles to AI Developers’

Here’s how Google Messages’ upcoming Insights feature will work

How to watch Sunderland vs Arsenal live streams: Premier League 2025/26

You may also like

Events

Trending

Useful Links

Categories

Startups

Legal

Popular This Week

Editor's Pick