logoalt Hacker News

stevefan1999last Monday at 7:28 AM1 replyview on HN

That's not the only stuff you need to manage. Having a system level sandbox is all about limiting the physical scope (the term physical in terms of interacting with the system using shell and syscalls) of stuff that the LLM agent could reach, but what about the logical scope that it could reach too, before you pass it to the physical scope? e.g. git branch/commit, npm run build, kubectl apply, or psql to run scripts that truncate your sql table or delete the database. Those are not easily controllable since they are concrete with contextual details.


Replies

seba_dos1last Monday at 8:17 AM

These you surely have handled already, as a human is able to fat-finger a database drop as well.

show 1 reply