logoalt Hacker News

torawayyesterday at 6:28 PM1 replyview on HN

Whenever an "LLM fail" goes viral like the car wash question, you can observe the exact same wording of the question get "fixed" within a week or so. With slight variations in phrasing still able to replicate the problem.

Followed by lots of "works perfectly for me, why are people even talking about this?"

I can't say what exactly they're doing behind the scenes but it's a consistent pattern among the big SOTA model providers. With obvious incentive to "fix" the problem so users will then organically "debunk" the meme as they try it themselves and share their experiences.


Replies

simianwordsyesterday at 6:49 PM

You are misremembering. There’s no patch. All these examples used the instant model.