logoalt Hacker News

Koaisutoday at 2:23 PM1 replyview on HN

Just tried it with an unsupported language and it still worked I set it to Chinese and inputted the audio. Still got correct results.


Replies

aldertoday at 3:53 PM

Yes, the transcriber API I use (Soniox) actually supports more than 60 languages. I just didn't have any automated testing for them. The way I tested was to find audio with a reliable reference transcription and put it through my pipeline. Then compare the results. Also some languages don't have reliable libraries to get part of speech and lemmas, something that flashcard needs.