Weirdly this is in line with Unicode in general. Widespread (and not even widespread) historic use in say print results in characters getting included.