Calling prompt injection "not malware" because LLM behavior is unpredictable is like sayin...

d4rken • today at 10:20 AM • 1 reply • view on HN

Calling prompt injection "not malware" because LLM behavior is unpredictable is like saying a phishing email is not an attack because humans are unpredictable.

Even if maybe the mechanism of "injecting a prompt" could be beneficial in some use-cases, e.g. to instruct an LLM positively, this is case is clearly malicious by intent. The author even tried to hide it by obfuscation.

It's just an insane take by that libraries author. Even someone "on their side", that may even hate AI/LLMs more than him, would probably drop that library in a heartbeat, as the authors judgement clearly can't be trusted.

Replies

fwlr • today at 10:34 AM

    Calling prompt injection "not malware" … is like saying a phishing email is not [malware] …

I would say phishing emails are not malware, I think most people would agree that phishing emails are not malware, and if pressed to defend this point on its own merits I would say something like “they are deceptive instructions that rely on a human executing them to do harm”. I think the “phishing” analogy supports the case for not calling it malware (it is a different, also bad thing).

➕ show 2 replies

alt Hacker News

Replies