logoalt Hacker News

theanonymousonetoday at 3:29 PM1 replyview on HN

In OpenRouter, there is an "int4" tag for Moonshot provider of Kimi K2. 7 Code. Isn't that too low, particularly coming from the very developer of the model? Os that a mistake? How is it in their direct API offer?


Replies

kouteiheikatoday at 3:33 PM

The model is natively quantized (i.e. it was trained that way in the first place, so this is not a post-training quantization which degrades performance).

show 2 replies