Notably it has 0 wins.
Friendo, this is an anti-benchmark to figure out which AI is more likely to kill you.
If you point both at some github issues you can gauge their relative ability to solve problems.
"if you judge a fish by its ability to climb a tree" yada yada
Not much less than GPT 5.4 with 2 wins or gemini-3.1-pro with 3 wins in 30 rounds.
Such is life in royal rumble games.
Friendo, this is an anti-benchmark to figure out which AI is more likely to kill you.
If you point both at some github issues you can gauge their relative ability to solve problems.