logoalt Hacker News

svachalekyesterday at 4:59 PM0 repliesview on HN

I don't know what model you're using through ollama but a lot of people pick up a 4b model and expect it to be ChatGPT when it's like 0.2% of the size. 4b models are mostly toys imo. The latest generation of 8b models are sometimes useful, but often still laughably stupid. 14b starts to have potential, 30b are pretty good.

But remember, the hosted frontier models are still gigantic compared to these, and still make stupid mistakes all the time.