logoalt Hacker News

srmattotoday at 2:46 PM2 repliesview on HN

The benchmarks provided are for Opus-4.5, not for the latest Opus-4.6 and Qwen is still lagging in a lot of them.


Replies

Aurornistoday at 2:51 PM

There is no reason to benchmark against Opus 4.5 when Opus 4.6 has been out so long, other than to be misleading.

show 1 reply
thegeomastertoday at 2:50 PM

And it seems they've decided to go closed-source for their largest, best models.

show 3 replies