The value of Claude Code the harness isn't that great. There's a lot of other good harnesses out there.
I thought so, and then I tried Opencode and Codex and started to appreciate Claude Code a lot more. They've actually done great work with the small details.
And it gets dragged down by Anthropic actively injecting unhelpful things into prompts without telling users about them (https://github.com/anthropics/claude-code/issues/58262).
What’s your favourite harness? Is there any benchmarks for harness like LLMs have for swe verified?