logoalt Hacker News

madagangtoday at 4:16 AM3 repliesview on HN

Their Chinese announcement says that, based on internal employee testing, it is not as good as Opus 4.6 Thinking, but is slightly better than Opus 4.6 without Thinking enabled.


Replies

mchusmatoday at 4:22 AM

I appreciate this, makes me trust it more than benchmarks.

ibictoday at 5:37 AM

In case people wonder where the announcement is (you can easily translate it via browser if you don't read Chinese): https://mp.weixin.qq.com/s/8bxXqS2R8Fx5-1TLDBiEDg

It's still a "preview" version atm.

deauxtoday at 4:52 AM

That's super interesting, isn't Deepseek in China banned from using Anthropic models? Yet here they're comparing it in terms of internal employee testing.

show 2 replies