Claude Code gets smoked on benchmarks by an agent that has a single tool: tmux. So I think they might actually like that quite a bit.
What benchmarks are you referring to?
What benchmarks are you referring to?