logoalt Hacker News

lanthissayesterday at 8:26 PM3 repliesview on HN

isn't gemini 3 flash already model shrinkage that does well in coding?


Replies

skippyboxedherotoday at 2:29 AM

Xiaomi, Nvidia Nemotron, Minimax, lots of other smaller ones too. There are massive economic incentives to shrink models because they can be provided faster and at lower cost.

I think even with the money going in, there has to be some revenue supporting that development somewhere. And users are now looking at the cost. I have been using Anthropic Max for most of this year after checking out some of these other models, it is clearly overpriced (I would also say their moat of Claude Code has been breached). And Anthropic's API pricing is completely crazy when you use some of the paradigms that they suggest (agents/commands/etc) i.e. token usage is going up so efficient models are driving growth.

hedgehogyesterday at 8:29 PM

Smaller open-weights models are also improving noticeably (like Qwen3 Coder 30B), the improvements are happening at all sizes.

show 1 reply
Imustaskforhelpyesterday at 9:28 PM

How much billion parameter model is gemini 3 flash, I can't seem to find info about it online.