Yes, I thought that too! But qwen3:0.6b (and to some extent gemma 1b) has made me reevaluate. They...

nl • last Wednesday at 12:40 PM • 2 replies • view on HN

Yes, I thought that too! But qwen3:0.6b (and to some extent gemma 1b) has made me reevaluate.

They still aren't useful like large LLMs, but for things like summarization, and other tasks where you can give them structure but want the sheen of natural language they are much better than things like the Phi series were.

Replies

redman25 • last Wednesday at 2:40 PM

That's interesting. For what projects would you want the "sheen of natural language" though?

➕ show 1 reply

nunodonato • last Wednesday at 6:07 PM

qwen3 family, mostly 4B and 8B are absolutely amazing. the VL versions even more

alt Hacker News

Replies