logoalt Hacker News

nextaccountictoday at 9:09 AM2 repliesview on HN

This puts Sonnet 4.6 above Opus 4.6 in the coding index.. kinda hard to trust those numbers.

(Also it puts Opus 4.7 universally above Opus 4.6, and I may be wrong but this doesn't seem to match the experience of most/many/some people. I think it's widely recognized that Anthropic is severely lacking compute and Opus 4.7 is a costs saving measure)


Replies

manmaltoday at 9:52 AM

Anthropic themselves have (had?) this thing where Opus is used for planning and Sonnet for coding.

show 1 reply
conceptiontoday at 12:28 PM

What I’ve usually seen is 4.7 -> 4.5 -> 4.6 in terms of quality. Though 4.7 seems to hallucinate more than before.

show 1 reply