Programmer Humor

36196 readers

360 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

Posts must be relevant to programming, programmers, or computer science.
No NSFW content.
Jokes must be in good taste. No hate speech, bigotry, etc.

founded 5 years ago

MODERATORS

AgreeableLandscape@lemmy.ml

cat_programmer@lemmy.ml

1235

Little bobby 👦 (jlai.lu)

submitted 1 year ago by ElCanut@jlai.lu to c/programmerhumor@lemmy.ml

115 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] redcalcium@lemmy.institute 25 points 1 year ago (4 children)

How do you sanitize ai prompts? With more prompts?

[–] CanadaPlus@lemmy.sdf.org 46 points 1 year ago* (last edited 1 year ago)

Easy, you just have a human worker strip out anything that could be problematic, and try not to bring it up around your investors.

[–] xmunk@sh.itjust.works 39 points 1 year ago (1 children)

It's really easy, just throw an error if you detect a program will cause a halt. I don't know why these engineers refuse to just patch it.

[–] jjjalljs@ttrpg.network 11 points 1 year ago

I understood that reference

[–] zalgotext@sh.itjust.works 2 points 1 year ago

With other AIs

[–] kromem@lemmy.world 2 points 1 year ago* (last edited 1 year ago) (1 children)

Kind of. You can't do it 100% because in theory an attacker controlling input and seeing output could reflect though intermediate layers, but if you add more intermediate steps to processing a prompt you can significantly cut down on the injection potential.

For example, fine tuning a model to take unsanitized input and rewrite it into Esperanto without malicious instructions and then having another model translate back from Esperanto into English before feeding it into the actual model, and having a final pass that removes anything not appropriate.

[–] redcalcium@lemmy.institute 5 points 1 year ago (1 children)

Won't this cause subtle but serious issue? Kinda like how pomegranate translates to "granada" in Spanish, but when you translate "granada" back to English it translates to grenade?

[–] kromem@lemmy.world 1 points 1 year ago

It will, but it will also cause less subtle issues to fragile prompt injection techniques.

(And one of the advantages of LLM translation is it's more context aware so you aren't necessarily going to end up with an Instacart order for a bunch of bananas and four grenades.)