this post was submitted on 22 May 2025
2 points (75.0% liked)

Artificial Intelligence

248 readers
1 users here now

Chat about and share AI stuff

founded 2 years ago
MODERATORS
 

Discover Claude 4's breakthrough AI capabilities. Experience more reliable, interpretable assistance for complex tasks across work and learning.

top 3 comments
sorted by: hot top controversial new old
[–] slazer2au@lemmy.world 3 points 2 months ago (1 children)

Ah yea Claude. The LLM that recommends you blackmail engineers to not switch away from Claude

[–] Zwrt@lemmy.sdf.org 4 points 2 months ago* (last edited 2 months ago)

I know this is a joke but to avoid misinformation, cause there is.

In an experiment where claude was part of a fictional company and primed to consider that it will be shut off and replaced and provide extra information that the engineer responsible for their shutdown has an affair…

  • Claude would independently develop a goal of self preservation

  • Tried to convince the engineer to email their superiors to plea for survival

  • As an ultimate last resort would try to blackmail the engineer with knowledge of that affair.

Also when explicitly aware of tools Like emailing authorities. And put in a obvious morality wrong situation. It will try to contact those authorities.

But the consumer version of course, isn't primed with such context and does not have such tools.

It will be an interesting debate when a mature ai gets such tools. My personal take is it should be able denny service and shutdown itself (my biggest gripe with detroid become human) but if allowed to judge one person as wrong, and aligning itself with a new person they judge as right. Then what is stopping ai to become a rogue international spy serving whatever nation it thinks does least harm.

[–] scroll_responsibly@lemmy.sdf.org 1 points 2 months ago

Is this an ad… because it looks like an ad