logoalt Hacker News

tikhonjyesterday at 10:41 PM1 replyview on HN

The point is that it's the same process with—much—better priors.

This seems like a reasonable view to me. It's surprising just how much better priors matter and how we can develop those priors by training on a bunch of text. But it also explains, or at least hints at an explanation, for why LLM capabilities are so jagged, and in such inhuman ways.


Replies

dparkyesterday at 11:27 PM

> The point is that it's the same process

Except it’s not at all the same process. The fact that LLM are non deterministic is not the same as churning out random garbage.

show 1 reply