logoalt Hacker News

serflast Tuesday at 5:51 PM1 replyview on HN

It's a modeling of language, it's not structurally anything like an LLM.


Replies

mattkrauselast Tuesday at 6:10 PM

It’s literally a trigram (character) language model. Check any NLP book from before 2015 or so.

LLMs have more stuff bolted onto them (embeddings, RLHF) but the autoregressive core is a direct descendent of that sort of language model.

show 1 reply