logoalt Hacker News

kelseyfrogyesterday at 10:07 PM1 replyview on HN

Can we not sample indefinitely from the latent space of vocal and delivery characteristics?


Replies

parpfishyesterday at 11:16 PM

the "latent space containing all voices" may give you the ability to parametrize voices and make an infinite number of unique voices. BUT... people have a limited ability to distinguish points in that space.

in perceptual psychology/psychophysics, there's the concept of the "just-noticeable difference" (JND) which is the smallest change to a stimulus you can make that is reliable detectable.

normally the JND is measured on physical properties like brightness, pitch, etc but there's no reason it couldn't be applied to a more abstract latent space. two points in a particular latent space may be mathematically unique, but if they're indistinguishable to humans we shouldn't treat them as distinct voices

show 1 reply