logoalt Hacker News

floppydlast Sunday at 7:49 AM1 replyview on HN

I tried Kokoro for voicing blog posts and articles and wasn't impressed to be honest. Right now Gemini 2.5 Flash TTS is a much more capable system with generous free limits (about 10 minutes per generation and about 90 minutes per day). Voices are not very consistent between generations, but for shorter pieces it's not a big deal (but will obviously be for books)


Replies

ekianjolast Sunday at 8:12 AM

Kokoro is fine for TTS, but it lacks emotion. But for a model of this size, that is kind of given.

show 2 replies