logoalt Hacker News

JamesSwifttoday at 5:09 PM1 replyview on HN

Give an agent an obstacle and it will try to find a way around it. Most of the egregious commands Ive seen it run were fundamentally due to something blocking it from accomplishing a task. So eg if you block network access for the agent, you will get all sorts of creative solutions to try and get around the problem. This is also why its nearly impossible to corral commands. Because eventually it will rot13 encode a script and run it anyways.


Replies

Apocryphontoday at 5:23 PM

Paperclip maximizer!