logoalt Hacker News

fred_is_fredtoday at 2:00 PM3 repliesview on HN

How does this compare to the commercial models like Sonnet 4.5 or GPT? Close enough that the price is right (free)?


Replies

vidarhtoday at 2:07 PM

The will not measure up. Notice they're comparing it to Gemma, Google's open weight model, not to Gemini, Sonnet, or GPT. That's fine - this is a tiny model.

If you want something closer to the frontier models, Qwen3.6-Plus (not open) is doing quite well[1] (I've not tested it extensively personally):

https://qwen.ai/blog?id=qwen3.6

show 1 reply
NitpickLawyertoday at 2:08 PM

> Close enough

No. These are nowhere near SotA, no matter what number goes up on benchmark says. They are amazing for what they are (runnable on regular PCs), and you can find usecases for them (where privacy >> speed / accuracy) where they perform "good enough", but they are not magic. They have limitations, and you need to adapt your workflows to handle them.

show 1 reply
yaurtoday at 2:12 PM

I think its worth noting that if you are paying for electricity Local LLM is NOT free. In most cases you will find that Haiku is cheaper, faster, and better than anything that will run on your local machine.

show 2 replies