I don't think that's practical, because the subset would still be quite large. I did a quick analysis of the Factorio localiization files and found more than 1200 unique characters