> There are grammar rules And they're made out of weights.

throw310822 • today at 2:38 AM • 1 reply • view on HN

> There are grammar rules

And they're made out of weights.

As opposed to integers in normal programming.

The 'magic' in weights is that the rules are spread through the whole model and you can't point to one place which encodes them.

The grokking paper shows that this stops being the case with enough training data and enough compute.

➕ show 1 reply

alt Hacker News