logoalt Hacker News

pnathanyesterday at 12:57 PM1 replyview on HN

Data is missing on this chart.

It's my experience that opus 4, and then, particularly, 4.5, in Claude code, are head and shoulders above the competition.

I wrote an agentic coder years ago and it yielded trash. (Tried to make it do then what kiro does today).

The models are better. Now, caveat - I don't use anything but opus for coding - Sonnet doesn't do the trick. My experience with Codex and Gemini is that their top models are as good as Sonnet for coding...


Replies

BloondAndDoomyesterday at 5:00 PM

I was trying to do something yestesrday and Claude was keep messing it up, after like an hour i realized the model somehow switched to sonet, opus 4.6 is crazy good. It’s very obvious in practice.

Although I feel like for chasing bugs and big systems codex is even better