Another day, another hn thread of "this model changes everything" followed immediately by ...

jahsome • yesterday at 6:30 PM • 6 replies • view on HN

Another day, another hn thread of "this model changes everything" followed immediately by a reply stating "actually I have the literal opposite experience and find competitor's model is the best" repeated until it's time to start the next day's thread.

Replies

StephenHerlihyy • yesterday at 7:50 PM

What amazes me the most is the speed at which things are advancing. Go back a year or even a year before that and all these incremental improvements have compounded. Things that used to require real effort to consistently solve, either with RAGs, context/prompt engineering, have become… trivial. I totally agree with your point that each step along the way doesn’t necessarily change that much. But in the aggregate it’s sort of insane how fast everything is moving.

➕ show 1 reply

SatvikBeri • yesterday at 7:59 PM

I use Claude Code every day, and I'm not certain I could tell the difference between Opus 4.5 and Opus 4.0 if you gave me a blind test

clhodapp • yesterday at 7:31 PM

And of course the benchmarks are from the school of "It's better to have a bad metric than no metric", so there really isn't any way to falsify anyone's opinions...

malshe • yesterday at 6:37 PM

This pretty accurately summarizes all the long discussions about AI models on HN.

cactusplant7374 • yesterday at 7:32 PM

Hourly occurrence on /r/codex. Model astrology is about the vibes.

wasmainiac • yesterday at 6:38 PM

[flagged]

➕ show 3 replies

alt Hacker News

Replies