Data is missing on this chart. It's my experience that opus 4, and then, particularly, 4.5, i...

pnathan • yesterday at 12:57 PM • 1 reply • view on HN

Data is missing on this chart.

It's my experience that opus 4, and then, particularly, 4.5, in Claude code, are head and shoulders above the competition.

I wrote an agentic coder years ago and it yielded trash. (Tried to make it do then what kiro does today).

The models are better. Now, caveat - I don't use anything but opus for coding - Sonnet doesn't do the trick. My experience with Codex and Gemini is that their top models are as good as Sonnet for coding...

Replies

BloondAndDoom • yesterday at 5:00 PM

I was trying to do something yestesrday and Claude was keep messing it up, after like an hour i realized the model somehow switched to sonet, opus 4.6 is crazy good. It’s very obvious in practice.

Although I feel like for chasing bugs and big systems codex is even better

alt Hacker News

Replies