When sampling from an LLM people normally truncate the token probability distribution so that low-pr...

jgammell • today at 1:35 AM • 0 replies • view on HN

When sampling from an LLM people normally truncate the token probability distribution so that low-probability tokens are never sampled. So the model shouldn't produce really weird outputs even if they technically have nonzero probability in the pre/post training data.

alt Hacker News