The LLMs are deterministic but they only return a probability distribution over following tokens. Th...

geysersam • last Tuesday at 8:29 PM • 1 reply • view on HN

The LLMs are deterministic but they only return a probability distribution over following tokens. The tokens the user sees in the response are selected by some typically stochastic sampling procedure.

Replies

danielmarkbruce • last Tuesday at 10:28 PM

Assuming decent data, it won't be stochastic sampling for many math operations/input combinations. When people suggest LLMs with tokenization could learn math, they aren't suggesting a small undertrained model trained on crappy data.

➕ show 1 reply

alt Hacker News

Replies