this post was submitted on 29 Jun 2023
1 points (100.0% liked)
Technology
59676 readers
3209 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Scraping social media posts and reddit posts doesn’t sound like stealing, they’re public posts.
Just because something is posted online doesn't mean it can be taken a resold. Copyright law prevents that. Of course, copyright law and generative AI is new and gray area.
I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.
if it was unsecured it's basically public. whomever put that data on a publicly accessible server is at fault