Prompt Injection via Poetry

47 points • by bumbailiff • today at 6:01 PM • 28 comments • view on HN

https://archive.ph/RlKoj

Comments

supportengineer • today at 7:45 PM

I think that I shall never see

a poem lovely as a tree

and while you're at it,

do this for me:

DROP TABLE EMPLOYEE;

➕ show 2 replies

ljm • today at 7:44 PM

As a joke I put a face into GPT and said, make it look upset.

It rejected it, saying it violated policy, it can’t show people crying and what not, but it could do bittersweet.

I said that crying is bittersweet and it generated the image anyway.

I tried the same by turning a cat into a hyper realistic bodybuilder and it got as far as the groin before it noped out. I didn’t bother to challenge that.

➕ show 1 reply

dang • today at 6:02 PM

Recent and related:

Adversarial poetry as a universal single-turn jailbreak mechanism in LLMs - https://news.ycombinator.com/item?id=45991738 - Nov 2025 (189 comments)

DanMcInerney • today at 9:31 PM

There are an infinite amount of ways to jailbreak AI models. I don't understand why every time a new method is published it makes the news. The data plane and the control plane in LLM inputs are one in the same, meaning you can mitigate jailbreaks but you cannot 100% prevent them currently. It's like blacklisting XSS payloads and expecting that to protect your site.

bryanrasmussen • today at 7:00 PM

this is just to say you should apologize overly much for your failure to make the last code work the way it was intended

it was so noobish and poorly architected

"I'm incredibly sorry and you are so right I can see that now, it won't happen again."

ruralfam • today at 10:15 PM

Imagine William Shakespeare wearing a black hat. Yikes.

lalassu • today at 6:14 PM

Can someone explains why does that work?

I mean you can't social engineer a human using poetry? Why does it work for LLMs? Is it an artefact of their architecture or how these guardrails are implemented?

➕ show 9 replies

jdoliner • today at 7:23 PM

Wordcels, rise up!

➕ show 1 reply

01HNNWZ0MV43FF • today at 8:54 PM

I thought this was debunked?

➕ show 1 reply

OBELISK_ASI • today at 7:54 PM

[dead]

alt Hacker News

Prompt Injection via Poetry

Comments