Makes for a fascinating principal/agent problem: which role is the LLM playing? If I just tell it "Try different things until you solve the game", it tries to do just that until it reaches 15 tool calls.
Yeah made me wonder if you could speedrun the game by giving it a lot of complex instructions and then just let it run...
Yeah made me wonder if you could speedrun the game by giving it a lot of complex instructions and then just let it run...