Kimi K2 is a really weird model, just in general. It's not nearly as smart as Opus 4.5 or 5.2...

A_D_E_P_T • last Sunday at 2:01 PM • 11 replies • view on HN

Kimi K2 is a really weird model, just in general.

It's not nearly as smart as Opus 4.5 or 5.2-Pro or whatever, but it has a very distinct writing style and also a much more direct "interpersonal" style. As a writer of very-short-form stuff like emails, it's probably the best model available right now. As a chatbot, it's the only one that seems to really relish calling you out on mistakes or nonsense, and it doesn't hesitate to be blunt with you.

I get the feeling that it was trained very differently from the other models, which makes it situationally useful even if it's not very good for data analysis or working through complex questions. For instance, as it's both a good prose stylist and very direct/blunt, it's an extremely good editor.

I like it enough that I actually pay for a Kimi subscription.

Replies

Alifatisk • last Sunday at 5:53 PM

> As a writer of very-short-form stuff like emails, it's probably the best model available right now.

This is exactly my feeling with Kimi K2, it's unique in this regard, the only one that comes close is Gemini 3 pro, otherwise, no other model has been this good at helping out with communication.

It has such a good understanding with "emotional intelligence" (?), reading signals in messages, understanding intentions, taking human factors into consideration and social norms and trends when helping out with formulating a message.

I don't exactly know what Moonshot did during training but they succeeded with a unique trait on this model. This area deserves more highlight in my opinion.

I saw someone linking to EQ-bench which is about emotional intelligence in LLMs, looking at it, Kimi is #1. So this kind of confirms my feeling.

Link: https://eqbench.com

➕ show 1 reply

wasting_time • last Sunday at 2:05 PM

It's also the only model that consistently nails my favorite AI benchmark: https://clocks.brianmoore.com/

➕ show 2 replies

greazy • last Sunday at 9:03 PM

It is hands down the only model I trust to tell me I'm wrong. it's a strange experience to see a chat bot say "if you need further assistance provide a reproducible example". I love it.

FYI Kagi provides access to Kimi K2.

➕ show 2 replies

stingraycharles • last Sunday at 2:09 PM

> As a chatbot, it's the only one that seems to really relish calling you out on mistakes or nonsense, and it doesn't hesitate to be blunt with you.

My experience is that Sonnet 4.5 does this a lot as well, but this is more often than not due to a lack of full context, eg accusing the user of not doing X or Y when it just wasn’t told that was already done, and proceeding to apologize.

How is Kimi K2 in this regard?

Isn’t “instruction following” the most important thing you’d want out of a model in general, and a model pushing back more likely than not being wrong?

➕ show 2 replies

jug • last Sunday at 3:43 PM

And given this, it unsurprisingly scores very well on https://eqbench.com

Kim_Bruning • last Sunday at 2:06 PM

Speaking of weird. I feel like Kimi is a shoggoth with its tentacles in a man-bun. If that makes any sense.

culi • last Sunday at 8:47 PM

Kimi K2 is the model that most consistently passes the clock test. I agree it's definitely got something unique going on

https://clocks.brianmoore.com/

➕ show 2 replies

3abiton • last Sunday at 4:47 PM

> I get the feeling that it was trained very differently from the other models

It's actually based on a deepseek architecture just bigger size experts if I recall correctly.

➕ show 2 replies

Bolwin • last Sunday at 4:56 PM

In their AMA moonshot said it was mainly finetuning

➕ show 1 reply

logicprog • last Sunday at 3:29 PM

How do you feel K2 Thinking compares to Opus 4.5 and 5.2-Pro?

➕ show 1 reply

mips_avatar • last Sunday at 7:52 PM

It's a lot stronger for geospatial intelligence tasks than any other model in my experience. Shame it's so slow in terms of tps

alt Hacker News

Replies