116

New study shows large language models have high toxic probabilities and leak private information (techxplore.com)

submitted 10 months ago by L4s@lemmy.world to c/technology@lemmy.world

7 comments fedilink hide all child comments

New study shows large language models have high toxic probabilities and leak private information::Generative AI may be riddled with hallucinations, misinformation, and bias, but that didn't stop over half of respondents in a recent global study from saying that they would use this nascent technology for sensitive areas ...

you are viewing a single comment's thread
view the rest of the comments

[-] Buddahriffic@lemmy.world 2 points 10 months ago

Maybe the next big revolution will be to have two of them that take turns giving their best response to your prompt and then their responses. Then they can indicate when a response is controversial and would statistically lead to an argument if it was posted in locations they trained at.

Though I suppose you can do this with a single one and just ask if there's a counter argument to what it just said. "If you were another user on the internet that thought your previous response was the dumbest thing you've ever seen, what would you say?"

It also just occurred to me that it's because of moderators that you can even give rules like that. The LLM can see that posts in x location are subject to certain rules but they would only have an effect if those rules are followed or enforced. If there was a rule that you can't say "fuck" but everyone said it anyways, then an LLM might conclude that "don't say fuck" has no effect on output at all. Though I am making some big assumptions about how LLMs are trained to follow rules with this.

this post was submitted on 26 Aug 2023

116 points (86.7% liked)

Technology

55692 readers

2872 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS