Isn't this project the one Microsoft published but then soon after pulled it for security/safety reasons? What has changed since then?
Look at the "News" section in the readme - The original TTS model is gone from this repo (you can still find it other places), but the SST/ASR, long form TTS, and streaming TTS models are newer.
It’s confusing (at least for me) because the project covers a number of things including what you are mentioning.
Look at the "News" section in the readme - The original TTS model is gone from this repo (you can still find it other places), but the SST/ASR, long form TTS, and streaming TTS models are newer.