(commented on the wrong thread, HN doesn't let me delete it :( )
They're comparing to Opus 4.6, not 4.5. It was Anthropic's best public model up until last week.
They're comparing to Opus 4.6, not 4.5. It was Anthropic's best public model up until last week.