An ensemble can spot more bugs / fixes than a single model. I run claude, codex and gemini in parallel for reviews.