If you mean for Anthropic in particular, I don't think so. But it's not the first time a m...

andy12_ • yesterday at 3:24 PM • 0 replies • view on HN

If you mean for Anthropic in particular, I don't think so. But it's not the first time a major AI lab publishes an incremental update of a model that is worse at some benchmarks. I remember that a particular update of Gemini 2.5 Pro improved results in LiveCodeBench but scored lower overall in most benchmarks.

https://news.ycombinator.com/item?id=43906555

alt Hacker News