logoalt Hacker News

theappsecguylast Wednesday at 3:59 AM1 replyview on HN

It’s incredibly tiring to see this narrative peddled every damn day. I use opus 4.5 every day. It’s not much different than any previous models, still does dumb things all the time.


Replies

gpmlast Wednesday at 4:01 AM

Same experience - I've had it fail at the same reasonably simple tasks I had opus 4 and sonnet 4.5 and sonnet 4 fail at when they aren't carefully guided and their work check and fixed...