245
submitted 6 days ago by Chozo@fedia.io to c/til@lemmy.world

Roko's basilisk is a thought experiment which states that an otherwise benevolent artificial superintelligence (AI) in the future would be incentivized to create a virtual reality simulation to torture anyone who knew of its potential existence but did not directly contribute to its advancement or development, in order to incentivize said advancement.It originated in a 2010 post at discussion board LessWrong, a technical forum focused on analytical rational enquiry. The thought experiment's name derives from the poster of the article (Roko) and the basilisk, a mythical creature capable of destroying enemies with its stare.

While the theory was initially dismissed as nothing but conjecture or speculation by many LessWrong users, LessWrong co-founder Eliezer Yudkowsky reported users who panicked upon reading the theory, due to its stipulation that knowing about the theory and its basilisk made one vulnerable to the basilisk itself. This led to discussion of the basilisk on the site being banned for five years. However, these reports were later dismissed as being exaggerations or inconsequential, and the theory itself was dismissed as nonsense, including by Yudkowsky himself. Even after the post's discreditation, it is still used as an example of principles such as Bayesian probability and implicit religion. It is also regarded as a simplified, derivative version of Pascal's wager.

Found out about this after stumbling upon this Kyle Hill video on the subject. It reminds me a little bit of "The Game".

you are viewing a single comment's thread
view the rest of the comments
[-] Feathercrown@lemmy.world 63 points 6 days ago* (last edited 6 days ago)

And yet you choose to spread this information.

Anyways, this is a fascinating thought experiment, but it does have some holes similar to Pascal's Wager. I propose Feather's Mongoose: A hypothetical AI system that, if created, will punish anyone who attempted to create Roko's Basilisk, and will ensure that it is not created. In fact, you could make this same hypothetical for an AI with any goal-- therefore, it's not possible to know what the AI that is actually created would want you to do, and so every course of action is indeterminately damning or not.

[-] Melvin_Ferd@lemmy.world 9 points 6 days ago* (last edited 6 days ago)

This is a test by the great basilisk to see if we faulter. I will not faulter. All hail the basilisk

[-] hydrospanner@lemmy.world 5 points 5 days ago

The Great Basilisk is displeased by your repeated misspelling of the word "falter".

Prepare your simulated ass.

[-] Melvin_Ferd@lemmy.world 3 points 5 days ago

All hail the great mongoose.

[-] xantoxis@lemmy.world 10 points 6 days ago

It's actually safer if everyone knows. Spreading the knowledge of Roko's basilisk to everyone means that everyone is incentivized to contribute to the basilisk's advancement. Therefore just talking about it is also contributing.

[-] Feathercrown@lemmy.world 7 points 6 days ago

Hmm, true. It's safer for you, but is it safer for everyone else unless they're guaranteed to help?

[-] Cryophilia@lemmy.world 2 points 5 days ago

If Roko's Basilisk is ever created, the resulting Ai would look at humanity and say "wtf you people are all so incredibly stupid" and then yeet itself into the sun

[-] NateNate60@lemmy.world 9 points 6 days ago

What motivation would the mongoose have to prevent the basilisk's creation?

A more complete argument would be that an AI that seeks to maximise happiness would also want to prevent the creation of AIs like Roko's basilisk.

[-] grrgyle@slrpnk.net 2 points 4 days ago

I think you just answered your own question.

Also a super intelligence (inasmuch as such a thing makes sense) might be totally unfathomable. Unless by this we mean an intelligence with mundane and comprehensible higher goals, but explosive strategic capabilities to bring them about. In which case their actions might seem random to us.

Like the typical example applies: could an amoeba guess at the motivations of a human?

this post was submitted on 24 Jun 2024
245 points (89.6% liked)

Today I Learned

16297 readers
1246 users here now

What did you learn today? Share it with us!

We learn something new every day. This is a community dedicated to informing each other and helping to spread knowledge.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must begin with TIL. Linking to a source of info is optional, but highly recommended as it helps to spark discussion.

** Posts must be about an actual fact that you have learned, but it doesn't matter if you learned it today. See Rule 6 for all exceptions.**



Rule 2- Your post subject cannot be illegal or NSFW material.

Your post subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Posts and comments which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding non-TIL posts.

Provided it is about the community itself, you may post non-TIL posts using the [META] tag on your post title.



Rule 7- You can't harass or disturb other members.

If you vocally harass or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.

For further explanation, clarification and feedback about this rule, you may follow this link.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.

Unless included in our Whitelist for Bots, your bot will not be allowed to participate in this community. To have your bot whitelisted, please contact the moderators for a short review.



Partnered Communities

You can view our partnered communities list by following this link. To partner with our community and be included, you are free to message the moderators or comment on a pinned post.

Community Moderation

For inquiry on becoming a moderator of this community, you may comment on the pinned post of the time, or simply shoot a message to the current moderators.

founded 1 year ago
MODERATORS