Quants will push it below 256GB without completely lobotomizing it.

alex43578 • today at 1:05 PM • 1 reply • view on HN

Replies

> without completely lobotomizing it

The question in case of quants is: will they lobotomize it beyond the point where it would be better to switch to a smaller model like GPT-OSS 120B that comes prequantized to ~60GB.

➕ show 1 reply

alt Hacker News

Replies