The real problem is that there is nothing novel here. Variants of this type of attack were clear fro...

dotancohen • yesterday at 9:07 PM • 1 reply • view on HN

The real problem is that there is nothing novel here. Variants of this type of attack were clear from the beginning.

Replies

What I would have expected is prompt injection or other methods to get the agent to do something its user doesn't want it to, not regular "classical" attacks.

At least currently, I don't think we have good ways of preventing the former, but the latter should be possible to avoid.

➕ show 2 replies

alt Hacker News

Replies