The point is that it's the same process with—much—better priors.
This seems like a reasonable view to me. It's surprising just how much better priors matter and how we can develop those priors by training on a bunch of text. But it also explains, or at least hints at an explanation, for why LLM capabilities are so jagged, and in such inhuman ways.
> The point is that it's the same process
Except it’s not at all the same process. The fact that LLM are non deterministic is not the same as churning out random garbage.