Hamiltonian paths and previous work by Donald Knuth is more than likely in the training data.

hmmmmmmmmmmmmmm • yesterday at 5:33 PM • 1 reply • view on HN

Replies

The specific sequence of tokens that comprise the Knuth's problem with an answer to it is not in the training data. A naive probability distribution based on counting token sequences that are present in the training data would assign 0 probability to it. The trained network represents extremely non-naive approach to estimating the ground-truth distribution (the distribution that corresponds to what a human brain might have produced).

➕ show 1 reply

alt Hacker News

Replies