logoalt Hacker News

zozbot234today at 6:37 AM0 repliesview on HN

Kimi uses INT4 as its native format, there's no such thing as "better than 4-bit precision" for that model. This is in contrast with GLM for which 16-bit precision is native and 8-bit is in common use.