logoalt Hacker News

whycombinetortoday at 1:04 PM2 repliesview on HN

Do you have evals for this claim? I don't really experience this


Replies

noosphrtoday at 1:08 PM

If given A and not B llms often just output B after the context window gets large enough.

It's enough of a problem that it's in my private benchmarks for all new models.

show 1 reply