LLMs produce a distribution of token probabilities which is then sampled. This sampling is the only ...

Hugsun • today at 4:54 PM • 0 replies • view on HN

LLMs produce a distribution of token probabilities which is then sampled. This sampling is the only random part of the system.

If you just take the most probable token every time, the system becomes fully deterministic. We don't do this as the output becomes more stiff and less creative.

alt Hacker News