logoalt Hacker News

Koshkinyesterday at 8:51 PM2 repliesview on HN

(Going on a tangent.) The number of transformer explanations/tutorials is becoming overwhelming. Reminds me of monads (or maybe calculus). Someone feels a spark of enlightenment at some point (while, often, in fact, remaining deeply confused), and an urge to share their newly acquired (mis)understanding with a wide audience.


Replies

nospiceyesterday at 9:12 PM

So?

There's no rule that the internet is limited to a single explanation. Find the one that clicks for you, ignore the rest. Whenever I'm trying to learn about concepts in mathematics, computer science, physics, or electronics, I often find that the first or the "canonical" explanation is hard for me to parse. I'm thankful for having options 2 through 10.

kadushkayesterday at 9:15 PM

Maybe so, but this particular blog post was the first and is still the best explanation of how transformers work.