logoalt Hacker News

cuchoiyesterday at 9:14 PM1 replyview on HN

The exercise is not fully realistic because I think getting hundreds of suspicious emails puts the agent in alert. But the "no reply without human approval" part I think it is realistic because that's how most openclaw assistants will run.


Replies

alexhansyesterday at 9:25 PM

Point taken. I was mistakenly assuming a conversational agent experience.

I love the idea of showing how easy prompt injection or data exfiltration could be in a safe environment for the user and will definitely keep an eye out on any good "game" demonstration.

Reminds me of the old hack this site but live.

I'll keep an eye out for the aftermath.

show 1 reply