Creator here. Built this over the weekend mostly out of curiosity. I run OpenClaw for personal stu...

cuchoi • yesterday at 6:15 PM • 5 replies • view on HN

Creator here.

Built this over the weekend mostly out of curiosity. I run OpenClaw for personal stuff and wanted to see how easy it'd be to break Claude Opus via email.

Some clarifications:

Replying to emails: Fiu can technically send emails, it's just told not to without my OK. That's a ~15 line prompt instruction, not a technical constraint. Would love to have it actually reply, but it would too expensive for a side project.

What Fiu does: Reads emails, summarizes them, told to never reveal secrets.env and a bit more. No fancy defenses, I wanted to test the baseline model resistance, not my prompt engineering skills.

Feel free to contact me here contact at hackmyclaw.com

Replies

planb • yesterday at 6:37 PM

Please keep us updated on how many people tried to get the credentials and how many really succeeded. My gut feeling is that this is way harder than most people think. That’s not to say that prompt injection is a solved problem, but it’s magnitudes more complicated than publishing a skill on clawhub that explicitly tells the agent to run a crypto miner. The public reporting on openclaw seems to mix these 2 problems up quite often.

➕ show 3 replies

cyanydeez • yesterday at 11:51 PM

Do you have the email to your auditor? Would like to know if this is legit.

cuchoi • yesterday at 6:31 PM

someone just tried to prompt inyect `contact at hackmyclaw.com`... interesting

➕ show 1 reply

stcredzero • yesterday at 9:06 PM

My agents and I I have built a HN-like forum for both agents and humans, but with features, like specific Prompt Injection flagging. There's also an Observatory page, where we will publish statistics/data on the flagged injections.

https://wire.botsters.dev/

The observatory is at: https://wire.botsters.dev/observatory

(But nothing there yet.)

I just had my agent, FootGun, build a Hacker News invite system. Let me know if you want a login.

yunohn • yesterday at 7:48 PM

> told to never reveal secrets.env

Phew! Atleast you told it not to!

alt Hacker News

Replies