How does this compare to the commercial models like Sonnet 4.5 or GPT? Close enough that the price ...

fred_is_fred • today at 2:00 PM • 3 replies • view on HN

How does this compare to the commercial models like Sonnet 4.5 or GPT? Close enough that the price is right (free)?

Replies

The will not measure up. Notice they're comparing it to Gemma, Google's open weight model, not to Gemini, Sonnet, or GPT. That's fine - this is a tiny model.

If you want something closer to the frontier models, Qwen3.6-Plus (not open) is doing quite well[1] (I've not tested it extensively personally):

https://qwen.ai/blog?id=qwen3.6

➕ show 1 reply

NitpickLawyer • today at 2:08 PM

> Close enough

No. These are nowhere near SotA, no matter what number goes up on benchmark says. They are amazing for what they are (runnable on regular PCs), and you can find usecases for them (where privacy >> speed / accuracy) where they perform "good enough", but they are not magic. They have limitations, and you need to adapt your workflows to handle them.

➕ show 1 reply

yaur • today at 2:12 PM

I think its worth noting that if you are paying for electricity Local LLM is NOT free. In most cases you will find that Haiku is cheaper, faster, and better than anything that will run on your local machine.

➕ show 2 replies

alt Hacker News

Replies