logoalt Hacker News

edg5000today at 3:54 AM3 repliesview on HN

So you're going to use DeepSeek, Qwen, GLM, Kimi and Mistral now? I tried them, and they really fall short of GPT and Claude.

Without access to US models, I'd be limited to asking simple questions in chat interfaces and maybe some grunt work in coding CLIs, but even that the weak models will mess up.

Nothing has reached Opus and GPT5 levels in my personal experience, which also aligns with what the labs themselves admit ("near-frontier").


Replies

data-ottawatoday at 4:22 AM

Well I am definitely not using the models that I'm not able to access.

So now the question is whether the capabilities of other models are worth their far cheaper token prices.

Plus, are we at all confident Opus or GPT 5.5 aren't about to get shut off?

bean469today at 7:16 AM

Not all people need the SOTA. Also, many take into consideration speed, token / plan cost and many other factors when choosing a model

ignoramoustoday at 7:59 AM

> Nothing has reached Opus and GPT5 levels in my personal experience

You mean, GPT 5.5 xhigh and Claude Opus 4.8 max? At least the benchmarks / public evals / rankings show some of the new coding models (ex: Qwen 3.7 Max & Mimo v2.5 Pro) are Opus 4.7 & GPT 5.4 level (but 3x to 5x cheaper): https://artificialanalysis.ai/leaderboards/models / https://gertlabs.com/rankings Personally speaking, in the past 1mo or so, I haven't missed GPT 5.4 / Opus 4.7 after moving to Qwen 3.7 / MiMo 2.5 / Kimi 2.6 et al.

show 1 reply