logoalt Hacker News

JSR_FDEDtoday at 7:38 AM1 replyview on HN

Why would you deprive the LLM of a signal that indicates how badly it screwed up?


Replies

carsareoktoday at 8:28 AM

Because it's a completion engine and has no notion of "signals".

Swearing was in the texts they were trained on to complete token by token. I suspect it weren't texts with a lot of high-quality reasoning.