Alibaba maintains its own separate version of lm-arena where the prompts are fixed and you simply judge the outputs
https://aiarena.alibaba-inc.com/corpora/arena/leaderboard