Here’s what I don’t get: while this makes for a fun blog post, you can just program an efficient killing machine that probably wins all the time and has $0 in token costs. LLMs should work to build such a machine, not be the machine themselves.
The things LLMs are good at, you do not actually need for an agent like this. You can use classical AI methods. But that would be a boring article.