Their Chinese announcement says that, based on internal employee testing, it is not as good as Opus ...

madagang • today at 4:16 AM • 3 replies • view on HN

Their Chinese announcement says that, based on internal employee testing, it is not as good as Opus 4.6 Thinking, but is slightly better than Opus 4.6 without Thinking enabled.

Replies

mchusma • today at 4:22 AM

I appreciate this, makes me trust it more than benchmarks.

ibic • today at 5:37 AM

In case people wonder where the announcement is (you can easily translate it via browser if you don't read Chinese): https://mp.weixin.qq.com/s/8bxXqS2R8Fx5-1TLDBiEDg

It's still a "preview" version atm.

deaux • today at 4:52 AM

That's super interesting, isn't Deepseek in China banned from using Anthropic models? Yet here they're comparing it in terms of internal employee testing.

➕ show 2 replies

alt Hacker News

Replies