logoalt Hacker News

simonw04/23/20251 replyview on HN

How long ago was this? I'd be surprised to see Claude 3.7 Sonnet make a mistake of this nature.

Either way, when a model starts making dumb mistakes like that these days I start a fresh conversation (to blow away all of the bad tokens in the current one), either with that model or another one.

I often switch from Claude 3.7 Sonnet to o3 or o4-mini these days. I paste in the most recent "good" version of the thing we're working on and prompt from there.


Replies

th0ma504/24/2025

Lol, "it didn't do it... and if it did it didn't mean it... and if it meant it it surely can't mean it now." This is unserious.

show 2 replies