logoalt Hacker News

Aurornistoday at 2:51 PM1 replyview on HN

There is no reason to benchmark against Opus 4.5 when Opus 4.6 has been out so long, other than to be misleading.


Replies

coldteatoday at 4:18 PM

I can see reasons, among others that 4.5 was the one established as they were preparing this version. "So long" is merely 2 months ago, and Qwen 3.5 was barely released less than 2 months ago. They were likely already working on finalizing 3.6 before 3.5 official launch, and as 4.6 came out.

In any case, aside Claude fanboyism, having other plays inch closer to similar performance is always useful. Even if they are "6 months behind" as the pace slows down, this guarantees that there's no huge moat and they'll eventually either get to where the SOTA is, or the difference wont be that big.

I'd rather put fewer eggs in 2-3 big player baskets.