logoalt Hacker News

throwaw12today at 10:29 AM4 repliesview on HN

Congratulations, great work Kimi team.

Why is that Claude still at the top in coding, are they heavily focused on training for coding or is it their general training is so good that it performs well in coding?

Someone please beat the Opus 4.5 in coding, I want to replace it.


Replies

symisc_develtoday at 4:42 PM

Gemini 3 pro is way better than Opus especially for large codebases.

show 1 reply
pokot0today at 1:53 PM

I don't think that kind of difference in benchmarks has any meaning at all. Your agentic coding tool and the task you are working on introduce a lot more "noise" than that small delta.

Also consider they are all overfitting on the benchmark itself so there might be that as well (which can go in either directions)

I consider the top models practically identical for coding applications (just personal experience with heavy use of both GPT5.2 and Opus 4.5).

Excited to see how this model compares in real applications. It's 1/5th of the price of top models!!

Balinarestoday at 12:55 PM

I replaced Opus with Gemini Pro and it's just plain a better coder IMO. It'll restructure code to enable support for new requirements where Opus seems to just pile on more indirection layers by default, when it doesn't outright hardcode special cases inside existing functions, or drop the cases it's failing to support from the requirements while smugly informing you you don't need that anyway.

show 1 reply
MattRixtoday at 12:36 PM

Opus 4.5 only came out two months ago, and yes Anthropic spends a lot of effort making it particularly good at coding.