The exercise is not fully realistic because I think getting hundreds of suspicious emails puts the agent in alert. But the "no reply without human approval" part I think it is realistic because that's how most openclaw assistants will run.
Point taken. I was mistakenly assuming a conversational agent experience.
I love the idea of showing how easy prompt injection or data exfiltration could be in a safe environment for the user and will definitely keep an eye out on any good "game" demonstration.
Point taken. I was mistakenly assuming a conversational agent experience.
I love the idea of showing how easy prompt injection or data exfiltration could be in a safe environment for the user and will definitely keep an eye out on any good "game" demonstration.
Reminds me of the old hack this site but live.
I'll keep an eye out for the aftermath.