The model is (like Composer 2) based on Kimi K2.5 and they claim SOTA performance for 1/10th of...

asar • yesterday at 5:40 PM • 9 replies • view on HN

The model is (like Composer 2) based on Kimi K2.5 and they claim SOTA performance for 1/10th of the cost. The tweet also mentions that they've started a new model from scratch on Colossus 2 (xAI/SpaceX Cluster). Really impressive how they've made this jump from being called the vscode fork with no moat just a couple of months ago.

Replies

antirez • today at 6:19 AM

How much the RL they are doing really improves Kimi K2.5 is to be seen. So, right now, the ground truth is that they combined what they had with a strong open weights model. The RL improvement may be both marginal (since may folks report strong results with vanilla K2.6) and may mostly bias the model towards coding tasks: when a model like this is trained to be generalist, there is a tension between being good at one thing and the other, in terms of SFT and RL. You can see this in the DeepSeek v4 Flash training report for instance but it is a known fact. So if you have the GPUs and a decent RL pipeline that does not run the model you can indeed specialize it a bit more for a given task at the expenses of tasks people will not do inside Cursor. But, so far, the measurable reality is that Cursor uses an open weight model like most could do, and the RL story could be partilly a marketing move to call to Composer 2.5 more than a real strong gain, given that there is no way to verify and K2.5 was already strong. And we also know that they had to partner to do the training, which is also not a good news.

the_duke • today at 7:14 AM

In my opinion cursor actually has one of the best harnesses again at the moment.

onlyrealcuzzo • yesterday at 6:47 PM

> Really impressive how they've made this jump from being called the vscode fork with no moat just a couple of months ago.

Impressive, yes. But they still don't have a moat...

➕ show 3 replies

wg0 • today at 5:36 AM

This was the only way forward.

liuliu • yesterday at 6:42 PM

Since the frontier is only 8-month ahead of DeepSeek, it is hard to see how model training can be a moat as all the tricks are available from open labs in China. You really just need <100m to bootstrap at this point.

Lionga • yesterday at 6:10 PM

They are still a vscode fork with no moat? Like they lost about 70% of users in half a year which goes to show how there is not even the tiniest of moat.

➕ show 1 reply

whywhywhywhy • yesterday at 6:37 PM

It's still a VsCode fork just now with a Kimi fine tune and still no moat...

I won't debate that it turns out none of this mattered when it came to being as successful company though and kinda makes anyone who tried to roll their own instead of fork look a little silly.

➕ show 1 reply

aurareturn • yesterday at 6:41 PM

I doubt it's a brand new model. It's likely just Kimi K2.5 further trained on coding.

➕ show 1 reply

alt Hacker News

Replies