33
Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'
(www.404media.co)
"We did it, Patrick! We made a technological breakthrough!"
A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.
tbh this can happen with everything now so..
i'm not sure what would be the solution, sadly.
The same can and will happen with the Fediverse right?
Probably already happened
Probably not. An enormous amount of publicly availablr data on a single instance, like with bluesky, is an AI scraper's wet dream.
The fediverse, in contrast, has much fewer people spread around perhaps HUNDREDS of instances. That's a much less appealing effort to reward ratio for the scrapers..
I see. Probably mastodon.social gets scraped, then 🫣