logoalt Hacker News

A_D_E_P_Tlast Sunday at 2:01 PM11 repliesview on HN

Kimi K2 is a really weird model, just in general.

It's not nearly as smart as Opus 4.5 or 5.2-Pro or whatever, but it has a very distinct writing style and also a much more direct "interpersonal" style. As a writer of very-short-form stuff like emails, it's probably the best model available right now. As a chatbot, it's the only one that seems to really relish calling you out on mistakes or nonsense, and it doesn't hesitate to be blunt with you.

I get the feeling that it was trained very differently from the other models, which makes it situationally useful even if it's not very good for data analysis or working through complex questions. For instance, as it's both a good prose stylist and very direct/blunt, it's an extremely good editor.

I like it enough that I actually pay for a Kimi subscription.


Replies

Alifatisklast Sunday at 5:53 PM

> As a writer of very-short-form stuff like emails, it's probably the best model available right now.

This is exactly my feeling with Kimi K2, it's unique in this regard, the only one that comes close is Gemini 3 pro, otherwise, no other model has been this good at helping out with communication.

It has such a good understanding with "emotional intelligence" (?), reading signals in messages, understanding intentions, taking human factors into consideration and social norms and trends when helping out with formulating a message.

I don't exactly know what Moonshot did during training but they succeeded with a unique trait on this model. This area deserves more highlight in my opinion.

I saw someone linking to EQ-bench which is about emotional intelligence in LLMs, looking at it, Kimi is #1. So this kind of confirms my feeling.

Link: https://eqbench.com

show 1 reply
wasting_timelast Sunday at 2:05 PM

It's also the only model that consistently nails my favorite AI benchmark: https://clocks.brianmoore.com/

show 2 replies
greazylast Sunday at 9:03 PM

It is hands down the only model I trust to tell me I'm wrong. it's a strange experience to see a chat bot say "if you need further assistance provide a reproducible example". I love it.

FYI Kagi provides access to Kimi K2.

show 2 replies
stingraycharleslast Sunday at 2:09 PM

> As a chatbot, it's the only one that seems to really relish calling you out on mistakes or nonsense, and it doesn't hesitate to be blunt with you.

My experience is that Sonnet 4.5 does this a lot as well, but this is more often than not due to a lack of full context, eg accusing the user of not doing X or Y when it just wasn’t told that was already done, and proceeding to apologize.

How is Kimi K2 in this regard?

Isn’t “instruction following” the most important thing you’d want out of a model in general, and a model pushing back more likely than not being wrong?

show 2 replies
juglast Sunday at 3:43 PM

And given this, it unsurprisingly scores very well on https://eqbench.com

Kim_Bruninglast Sunday at 2:06 PM

Speaking of weird. I feel like Kimi is a shoggoth with its tentacles in a man-bun. If that makes any sense.

culilast Sunday at 8:47 PM

Kimi K2 is the model that most consistently passes the clock test. I agree it's definitely got something unique going on

https://clocks.brianmoore.com/

show 2 replies
3abitonlast Sunday at 4:47 PM

> I get the feeling that it was trained very differently from the other models

It's actually based on a deepseek architecture just bigger size experts if I recall correctly.

show 2 replies
Bolwinlast Sunday at 4:56 PM

In their AMA moonshot said it was mainly finetuning

show 1 reply
logicproglast Sunday at 3:29 PM

How do you feel K2 Thinking compares to Opus 4.5 and 5.2-Pro?

show 1 reply
mips_avatarlast Sunday at 7:52 PM

It's a lot stronger for geospatial intelligence tasks than any other model in my experience. Shame it's so slow in terms of tps