Opus 4.6 genuinely seems worse than 4.5 was in Q4 2025 for me. I know everyone always says this and anecdote != data but this is the first time I've really felt it with a new model to the point where I still reach for the old one.
I'll give GPT 5.3 codex a real try I think
I asked Codex 5.3 and Opus 4.6 to write me a macos application with a certain set of requirements.
Opus 4.6 wrote me a working macos application.
Codex wrote me a html + css mockup of a macos application that didn't even look like a macos application at all.
Opus 4.5 was fine, but I feel that 4.6 is more often on the money on its implementations than 4.5 was. It is just slower.
I agree with you. Codex 5.3 is good it's just a bit slower.
Huh… I’ve seen this comment a lot in this thread but I’ve really been impressed with both Anthropic’s latest models and latest tooling (plugins like /frontend-design mean it actually designs real front ends instead of the vibe coded purple gradient look). And I see it doing more planning and making fewer mistakes than before. I have to do far less oversight and debugging broken code these days.
But if people really like Codex better, maybe I’ll try it. I’ve been trying not to pay for 2 subscriptions at once but it might be worth a test.