I've always interchangeably used the models.
I don't look at benchmarks.
It's a non-deterministic tool. A lot of the shit going on with LLMs just doesn't make sense to me. All the tooling around like MCPs, they're all just putting stuff into context. So to me the tools aren't really robust and they make little difference.
Lots of AI psychosis going on these days. And I say that as somebody that hasn't written a line of code since Sept 2025