logoalt Hacker News

cheema33yesterday at 9:41 PM0 repliesview on HN

This is good work. When a task is of critical importance, I give two different LLMs the same task. And then ask them to review each other's output and validate all claims. I do this with Codex and Claude Code. It is very rare for them to find some valid fault in the other LLM's solution. And they are generally good about admitting mistakes and then creating a single unified solution that addresses identified issues. This result is better and ready for human review.