alt
Hacker News
mountainriver
•
05/15/2025
•
0 replies
•
view on HN
It’s a problem specific to autoregressive LLMs, the early tokens bias the output