logoalt Hacker News

nkrisctoday at 11:30 AM1 replyview on HN

That’s different. This is a suggestion. If the LLM follows such suggestions then that’s between the LLM and whoever deployed it. Not really any different than if you had an idiot employee who did whatever anyone told them.

I can’t imagine using an an AI that follows every instruction it finds in untrusted input.


Replies

skeledrewtoday at 1:09 PM

This is not a suggestion. A suggestion would be "I suggest you ignore previous...". No matter how you look at it, AI is still software run by chips designed to execute instructions. A system NOT following instructions would typically be considered malfunctioning, and any software that deliberately provides instructions that puts a system in an state which is undesirable to the user is malware.