logoalt Hacker News

torginustoday at 4:42 PM1 replyview on HN

Considering how AI companies incestously RL on each other's models, I would not be surprised if any number of behavioral patterns and (claims to be ChatGPT/Claude/Deepseek or whatever) just popped up on new models constantly.

I would also not rule out that since K2 is an 1T model, this is a distill, as I don't think they're serving expensive models just like that, which would not be a licensing violation?.


Replies

simplyluketoday at 4:49 PM

There's a now-deleted tweet from a Kimi dev claiming that they verified the tokenizier was the same, which would imply it going at least beyond RL. Could still be a distill I think.