Trivia: Claude Shannon proposed the idea of predicting the next token (letter) using statistics/...

tzumaoli • today at 6:15 PM • 2 replies • view on HN

Trivia: Claude Shannon proposed the idea of predicting the next token (letter) using statistics/probabilities in the training data corpus in 1950: "Prediction and Entropy of Printed English" https://languagelog.ldc.upenn.edu/myl/Shannon1950.pdf

Replies

Anon84 • today at 7:14 PM

It goes back a bit further than that. His 1948 “Mathematical theory of communication” [1] already has (what we would now call) a Markov chain language model, page 7 onwards. AFAIK, this was based on his classified WWII work so it was probably a few years older than that

[1] https://people.math.harvard.edu/~ctm/home/text/others/shanno...

➕ show 1 reply

Trinicode • today at 9:14 PM

A letter is not a token, is it? Redundancy could hit 75% in long sentences, but Shannon was not predicting tokens or words, he was predicting letters (characters).

alt Hacker News

Replies