logoalt Hacker News

aeternumtoday at 1:04 AM0 repliesview on HN

Hallucinations are also trained by the incentive structure: reward for next-token prediction, no penalty for guessing.