logoalt Hacker News

xmcqdpt2today at 1:35 AM0 repliesview on HN

You can understand how transformers work from just reading the Attention is All You Need paper, which is 15 pages of pretty accessible DL. That's not the part that is impressive about LLMs.