logoalt Hacker News

frohtoday at 7:25 AM1 replyview on HN

> _lazy_ search function developers

doing non-ascii first needs awareness and then quickly becomes tricky (encodings yay).

getting combining characters and/or homoglyphs right is hard.

and if you're still bored out: have fun with Unicode confusables.txt ...

with this in mind I dare to give them lazy bums the honor of the doubt and rather call them something between naïve and scared.


Replies

mmoosstoday at 4:29 PM

ok, fine. :)

Isn't there a library out there for this common set of problems? I know Unicode provides normalization tables, though I don't know how good they are and I don't know if Unicode also provides a library.