Do you have benchmarks or at least anecdotes to back that up? I'm not arguing with you; I would...

smith7018 • today at 1:03 PM • 2 replies • view on HN

Do you have benchmarks or at least anecdotes to back that up? I'm not arguing with you; I would just love to see some proof that open models are getting as good as Anthropic's models.

Replies

redox99 • today at 1:30 PM

I've been running some test prompts comparing frontier models for webdev, particularly pretty visualizations, physics / orbital simulations, etc.

Do note that GLM is not multi modal, which can be a deal breaker. And these open models are not good outside coding.

unrvl22 • today at 1:29 PM

look at benchmarks, use the model yourself. Im usually first to call BS on every chinese model that says they are as good as Opus. this is finally the first one that actually is. It is a massive jump from every other previous chinese model.

➕ show 1 reply

alt Hacker News

Replies