this post was submitted on 27 Nov 2024
40 points (95.5% liked)

Fuck AI

1443 readers
271 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 8 months ago
MODERATORS
 

Bluesky may have said it won't use user data to train generative AI, but someone else just published a dataset of million Bluesky posts for "machine learning research". Already very popular dataset, your data may be scraped

Without paywall

you are viewing a single comment's thread
view the rest of the comments
[–] ladicius@lemmy.world 1 points 4 hours ago

Is that a problem for a proper scraper? Give the machine a list of domains and some hints about the relevant protocols, and then the computer runs until the end of the list.