logoalt Hacker News

alex43578today at 1:05 PM1 replyview on HN

Quants will push it below 256GB without completely lobotomizing it.


Replies

lostmsutoday at 3:59 PM

> without completely lobotomizing it

The question in case of quants is: will they lobotomize it beyond the point where it would be better to switch to a smaller model like GPT-OSS 120B that comes prequantized to ~60GB.

show 1 reply