logoalt Hacker News

geerlingguylast Sunday at 2:17 PM1 replyview on HN

Kiki K2 was made to be optimized at 4-bit, though.


Replies

natryslast Sunday at 3:06 PM

That's the Kimi K2 Thinking, this post seems to be talking about original Kimi K2 Instruct though, I don't think INT4 QAT (quantization aware training) version was released for this.