logoalt Hacker News

zozbot234today at 8:40 AM2 repliesview on HN

DeepSeek and GLM (plus Kimi) are at or above Sonnet level wrt. favorable workloads like coding. They're not close to Opus or the latest GPT yet, and Fable is even higher than that. Other workloads relying more on real-world knowledge have them even further behind, and this can't be mitigated without making the model itself bigger and harder to host locally.


Replies

thepaschtoday at 9:22 AM

> They're not close to Opus or the latest GPT yet

Disagreed. GLM-5.1 is easily as good as Opus 4.5 for all the coding purposes I could throw at it, which is the model that kicked this entire hype cycle into overdrive in the first place.

Cider9986today at 8:47 AM

I've found GLM to be comparable or better than Opus at writing and at a fraction of the cost.

show 1 reply