logoalt Hacker News

imtringuedlast Thursday at 7:49 AM0 repliesview on HN

The Q, K, V matrices form neural networks at runtime, that's the entire point.