It means there is zero training involved in getting from voice sample to voice duplicate. The...

spwa4 • last Friday at 7:15 AM • 1 reply • view on HN

It means there is zero training involved in getting from voice sample to voice duplicate. There used to be models that take a voice sample, run 5 or 10 training iterations (which of course takes 10 mins, or a few hours if you have hardware as shitty as mine), and only then duplicate the voice.

This you give the voice sample as part of the input, and immediately it tries to duplicate the voice.

Replies

x3haloed • last Friday at 8:26 AM

Doesn’t NeuTTS work the same way?

alt Hacker News

Replies