Interesting. I wonder how memory is handled due to the # of potential characters in UTF-8, or maybe only a subset of characters are allowed. Or is there a TSR handling that from a database on disk.
I cannot get to the twitter site and xcancel just loops so I could n0t see the post.
Modern UTF-8 encoding and present day tools makes it relatively easy to make many codepoints work relatively better even on DOS, thanks to readily available bitmap fonts.