logoalt Hacker News

chuckadamstoday at 4:05 PM0 repliesview on HN

> surrogates, regardless of whether they’re paired, are invalid in UTF-8

Java did not get the memo. Since the char type is fixed at 16 bits, it uses surrogates to encode everything outside the BMP, regardless of the encoding.