Show HN: Continue? Y/N: A 60-second game about AI agent permission fatigue

21 points • by Wirbelwind • today at 1:02 PM • 15 comments • view on HN

Comments

Fun game. Can somebody run an agent against those questions to see how it performs? :)

I vibe coded a TUI that just shows running lxd containers

I hit 'n' to toggle all network access minus anthropic and openai URLs.

I use pi (sometimes claude, always on bypass) and I auto allow everything. I only toggle manual approval in rare cases like running a script or command that needs to touch a production system and I need to validate everything.

Normally my container has full write access to staging so it can debug and validate everything on its own

cobbal • today at 3:49 PM

That's funny. It told me that blocking "npm run build" was the wrong answer. Maybe it doesn't really under The threat model.

Liftyee • today at 3:40 PM

I haven't used local agentic AI yet for programming projects. Hence, -187 score

The filter for "commands I would run myself" and "commands I would let an agent run" are very different it seems.

ghrl • today at 3:38 PM

I am mostly using OpenCode and barely ever see a permission prompt. While they do enforce it for outside workspace read/write, with the bash tool the agent can just bypass that. I'm not quite sure why it is that way, and it certainly isn't a very good solution, but likely not worse than asking for everything which just trains the user to always accept and provides a false sense of security then.

sevenseacat • today at 3:31 PM

Continue? Y/N ── SCORE: 2,343 Security-Conscious Engineer

Caught 8/8 threats "Not a single secret leaked"

→ llmgame.scalex.dev

MeetingsBrowser • today at 3:27 PM

It would be cool to see the distribution of all player scores.

➕ show 1 reply

carterschonwald • today at 3:25 PM

some of the sandboxing ive been playing with gives me the best of both yolo and like logic programming tier perms on llm actions in env. still not ready for prime time though ;)

cadwell • today at 3:22 PM

1,640 points on my first try—I fell into a few traps, but it was really interesting. Thanks for the little game! I'm sharing it with my coworkers :)

nardib • today at 1:24 PM

Use this and save yourself:

claude --dangerously-skip-permissions

➕ show 4 replies

alt Hacker News

Show HN: Continue? Y/N: A 60-second game about AI agent permission fatigue

Comments