logoalt Hacker News

zozbot234yesterday at 10:04 PM1 replyview on HN

4-bit quantization is native for Kimi 2.x series.


Replies

CamperBob2yesterday at 10:15 PM

You're right, I was thinking of Qwen. K2.6 will run at UD-Q2_K_XL precision on 4x RTX6000 boards, but I have no idea if it's worthwhile.