DeepSeek and GLM (plus Kimi) are at or above Sonnet level wrt. favorable workloads like coding. They...

zozbot234 • today at 8:40 AM • 2 replies • view on HN

DeepSeek and GLM (plus Kimi) are at or above Sonnet level wrt. favorable workloads like coding. They're not close to Opus or the latest GPT yet, and Fable is even higher than that. Other workloads relying more on real-world knowledge have them even further behind, and this can't be mitigated without making the model itself bigger and harder to host locally.

Replies

thepasch • today at 9:22 AM

> They're not close to Opus or the latest GPT yet

Disagreed. GLM-5.1 is easily as good as Opus 4.5 for all the coding purposes I could throw at it, which is the model that kicked this entire hype cycle into overdrive in the first place.

Cider9986 • today at 8:47 AM

I've found GLM to be comparable or better than Opus at writing and at a fraction of the cost.

➕ show 1 reply

alt Hacker News

Replies