Do you have evals for this claim? I don't really experience this

whycombinetor • today at 1:04 PM • 2 replies • view on HN

spixy • today at 6:02 PM

quick search:

noosphr • today at 1:08 PM

If given A and not B llms often just output B after the context window gets large enough.

It's enough of a problem that it's in my private benchmarks for all new models.

➕ show 1 reply

alt Hacker News