logoalt Hacker News

Show HN: Microgpt is a GPT you can visualize in the browser

55 pointsby b44today at 6:40 PM4 commentsview on HN

very much inspired by karpathy's microgpt of the same name. it's (by default) a 4000 param GPT/LLM/NN that learns to generate names. this is sorta an educational tool in that you can visualize the activations as they pass through the network, and click on things to get an explanation of them.


Comments

kfsonetoday at 9:18 PM

Minor nit: In familiarity, you gloss over the fact that it's character rather than token based which might be worth a shout out:

"Microgpt's larger cousins using building blocks called tokens representing one or more letters. That's hard to reason about, but essential for building sentences and conversations.

"So we'll just deal with spelling names using the English alphabet. That gives us 26 tokens, one for each letter."

show 1 reply
mslatoday at 10:02 PM

About how many training steps are required to get good output?

show 1 reply