There's a 31B dense model in the Gemma 4 series that's obviously going to be smarter (thou...

zozbot234 • today at 12:15 PM • 1 reply • view on HN

There's a 31B dense model in the Gemma 4 series that's obviously going to be smarter (though a whole lot slower) than the MoE 26A4B.

Replies

the_pwner224 • today at 12:18 PM

I tried it and it was unusably slow at ~5-6 TPS. 26A4B gets close to 40 TPS which is faster than you can read, and still pretty quick with reasoning enabled.

alt Hacker News

Replies