logoalt Hacker News

leobglast Sunday at 10:23 AM0 repliesview on HN

Kokoro is small and fast because all the text -> phoneme conversion is done by “dumb code” and only the phoneme -> sound part is done using a neural net.