logoalt Hacker News

throw310822today at 2:38 AM1 replyview on HN

> There are grammar rules

And they're made out of weights.


Replies

noosphrtoday at 4:53 AM

As opposed to integers in normal programming.

The 'magic' in weights is that the rules are spread through the whole model and you can't point to one place which encodes them.

The grokking paper shows that this stops being the case with enough training data and enough compute.

show 1 reply