I love this because it gets to the heart of information theory. Shannon's foundational insight ...

SnowProblem • yesterday at 10:26 PM • 1 reply • view on HN

I love this because it gets to the heart of information theory. Shannon's foundational insight was that information is surprise. A random sequence is incompressible by definition. But what counts as surprise depends on context, and for text, we know a large amount of it is predictable slop. I suspect there's a lot of room to go along this style of compression. For example, maybe you could store an upfront summary that makes prediction more accurate. Or perhaps you could encode larger sequences or some kind of hierarchical encoding. But this is great.

Replies

bambax • yesterday at 10:44 PM

Yes! information is surprise, and that's why a measure of intelligence is the ability to predict.

alt Hacker News

Replies