logoalt Hacker News

GaggiXyesterday at 10:45 PM4 repliesview on HN

I love that everyone is making their own TTS model as they are not as expensive as many other models to train. Also there are plenty of different architecture.

Another recent example: https://github.com/supertone-inc/supertonic


Replies

andaiyesterday at 11:47 PM

In-browser demo of Supertonic with WASM:

https://huggingface.co/spaces/Supertone/supertonic-2

coder543yesterday at 11:30 PM

Another one is Soprano-1.1.

It seems like it is being trained by one person, and it is surprisingly natural for such a small model.

I remember when TTS always meant the most robotic, barely comprehensible voices.

https://www.reddit.com/r/LocalLLaMA/comments/1qcusnt/soprano...

https://huggingface.co/ekwek/Soprano-1.1-80M

nowittyusernametoday at 5:41 AM

Thanks for heads up, this looks really interesting and claimed speed is nuts..

nunobritoyesterday at 10:52 PM

Thank you. Very good suggestion with code available and bindings for so many languages.