Don't make me tap the sign: There is no such thing as "bytes". There are only encodin...

rryan • today at 5:38 AM • 0 replies • view on HN

Don't make me tap the sign: There is no such thing as "bytes". There are only encodings. UTF-8 is the encoding most people are using when they talk about modeling "raw bytes" of text. UTF-8 is just a shitty (biased) human-designed tokenizer of the unicode codepoints.

alt Hacker News