logoalt Hacker News

jahsomeyesterday at 6:30 PM6 repliesview on HN

Another day, another hn thread of "this model changes everything" followed immediately by a reply stating "actually I have the literal opposite experience and find competitor's model is the best" repeated until it's time to start the next day's thread.


Replies

StephenHerlihyyyesterday at 7:50 PM

What amazes me the most is the speed at which things are advancing. Go back a year or even a year before that and all these incremental improvements have compounded. Things that used to require real effort to consistently solve, either with RAGs, context/prompt engineering, have become… trivial. I totally agree with your point that each step along the way doesn’t necessarily change that much. But in the aggregate it’s sort of insane how fast everything is moving.

show 1 reply
SatvikBeriyesterday at 7:59 PM

I use Claude Code every day, and I'm not certain I could tell the difference between Opus 4.5 and Opus 4.0 if you gave me a blind test

clhodappyesterday at 7:31 PM

And of course the benchmarks are from the school of "It's better to have a bad metric than no metric", so there really isn't any way to falsify anyone's opinions...

malsheyesterday at 6:37 PM

This pretty accurately summarizes all the long discussions about AI models on HN.

cactusplant7374yesterday at 7:32 PM

Hourly occurrence on /r/codex. Model astrology is about the vibes.

wasmainiacyesterday at 6:38 PM

[flagged]

show 3 replies