logoalt Hacker News

albertwangtoday at 3:30 PM6 repliesview on HN

great news, this looks great! is it just me, or do most of the english audio samples sound like anime voices?


Replies

bityardtoday at 5:42 PM

Well, if you look at the prompts, they are basically told to sound like that.

And if you ask me, I think these models were trained on tween fiction podcasts. (My kids listen to a lot of these and dramatic over-acting seems to be the industry standard.)

Also, their middle-aged adult with an "American English" accent sounds like any American I've ever met. More like a bad Sean Connery impersonator.

rapindtoday at 3:57 PM

> do most of the english audio samples sound like anime voices?

100% I was thinking the same thing.

reactordevtoday at 4:52 PM

The real value I see is being able to clone a voice and change timbre and characteristics of the voice to be able to quickly generate voice overs, narrations, voice acting, etc. It's superb!

devttyeutoday at 3:47 PM

Also like some popular youtubers and popular speakers.

show 1 reply
thehamkercattoday at 4:11 PM

even the Japanese audio samples sound like anime

htrptoday at 4:31 PM

subbed audio training data (much better than cc data) is better