logoalt Hacker News

nwellnhoflast Tuesday at 2:54 PM1 replyview on HN

Normalization forms NFKC and NFKD that also handle compatibility equivalence do.


Replies

mananaysiemprelast Tuesday at 3:20 PM

A few deprecated characters, including the Kelvin and Ångström symbols, are in fact canonically equivalent to their replacements and not just compatibility equivalent, so plain NFC/NFD is enough. (It’s generally better to avoid NFKC/NFKD normalizations unless you fully understand the implications, as they do lose meaning and at the same time do not account for all possible confusables.)