logoalt Hacker News

CamperBob2yesterday at 9:35 PM1 replyview on HN

Makes for a fascinating principal/agent problem: which role is the LLM playing? If I just tell it "Try different things until you solve the game", it tries to do just that until it reaches 15 tool calls.


Replies

alecfyesterday at 9:48 PM

Yeah made me wonder if you could speedrun the game by giving it a lot of complex instructions and then just let it run...

show 1 reply