Isn’t the limit exactly what you’re describing? There’s always uncertainty, and your asymptote can a...

FromTheFirstIn • last Saturday at 12:13 PM • 1 reply • view on HN

Isn’t the limit exactly what you’re describing? There’s always uncertainty, and your asymptote can approach its limit but it does have a limit. That’s the limit to the intelligence. And this is just for cross entropy loss- even if you could get loss to 0, I’m still not convinced at all that an enormous semantic map and its convoluted geometries amounts to intelligence.

Replies

aspenmartin • last Saturday at 3:58 PM

If you get to E you have generated a Bayes-optimal model of the conditional distribution (as in, next token conditional on context). This is something I thought too, but even if you're a fraction of a nat above the floor, you could have enormous headroom in performance left because there are still rare tokens amongst the irreducible noise that require so much capability to predict. It's not to suggest there truly is no cap on capability, but just that this constant isn't really saying what that is.

➕ show 1 reply

alt Hacker News

Replies