logoalt Hacker News

keyboredlast Wednesday at 7:55 AM3 repliesview on HN

Could just be Linux. Filenames are just bytes so two equivalent Unicode filenames that have been normalized differently could be confusing. I guess?

I guess since I’m too afraid to use non-ASCII in filenames much.


Replies

bawolfflast Wednesday at 8:02 AM

I guess that is fair. If i remember right mac uses NFD where literally everyone else in the world uses NFC (linux might not normalize but basically it usually ends up being NFC).

That said, i feel like this is something most tooling could just handle, and not really an issue.

Certainly its not a problem diffX is going to solve since it appears to only store charset and not filename normalization rules.

dotancohenlast Wednesday at 8:54 AM

I had this condition a few years ago. A folder shared with Dropbox was then renormalized either by Dropbox or by another system, then when it was synced back to the original machine I had two folders with identical names, normalized differently.

I still have some ls and hd output that I stored in my notes files, if anybody is interested.

show 1 reply
Joker_vDlast Wednesday at 5:03 PM

> I’m too afraid to use non-ASCII in filenames much.

I suggest installing a fresh Linux distribution with e.g. bg_BG.UTF-8 locale and playing with it, especially with XDG directories like "Плот", "Свалени" and "Документи", and apps that should use them by default. Everything should Just Work™.

Although I admit that when reporting bugs for apps that can't handle non-ASCII paths, the responses from the developers (unless they're themselves from non-English speaking countries, but sometimes even then) quite often seem to be very thinly veiled "I can't be bothered to figure out where I botch things, why can't you just speak English like all reasonable people".

show 1 reply