> LLM is Immune to Prompt Injection > Despite all advances: > * No large language model...

chrisjj • last Sunday at 1:31 PM • 6 replies • view on HN

> LLM is Immune to Prompt Injection

> Despite all advances:

> * No large language model can reliably detect prompt injections

Interesting isn't it, that we'd never say "No database manager can reliably detect SQL injections". And that the fact it is true is no problem at all.

The difference is not because SQL is secure by design. It is because chatbot agents are insecure by design.

I can't see chatbots getting parameterised querying soon. :)

Replies

nayroclade • today at 1:44 AM

There are some ideas to produce something like parameterised querying for LLMs, such as DeepMind's CaMeL: https://simonwillison.net/2025/Apr/11/camel/

davexunit • today at 12:36 AM

Confused Deputy as a Service

space_fountain • yesterday at 11:18 PM

I'm not sure that a prompt injection secure LLM is even possible anymore than a human that isn't susceptible to social engineering can exist. The issues right now are that LLMs are much more trusting than humans, and that one strategy works on a whole host of instances of the model

➕ show 1 reply

CuriouslyC • yesterday at 10:25 PM

A big part of the problem is that prompt injections are "meta" to the models, so model based detection is potentially getting scrambled by the injection as well. You need an analytic pass to flag/redact potential injections, a well aligned model should be robust at that point.

➕ show 2 replies

kaicianflone • yesterday at 9:38 PM

Is this where AgentSkills come into play as an abstraction layer?

➕ show 2 replies

alt Hacker News

Replies