logoalt Hacker News

intendedtoday at 5:28 PM1 replyview on HN

There’s many nation states working on this, have you looked into availability of those data sets?

What languages are you prioritizing?


Replies

ks2048today at 5:41 PM

Yes, there are government datasets, languge "acadamies" (or "regulators") - organizations focused on preserving / teaching the language, and often smaller, local publishers that publish material in their local language.

I'm living in Guatemala, so have been focusing on the Mayan languages here (22 languages, millions of speakers).

show 1 reply