logoalt Hacker News

thereddaikon12/10/20241 replyview on HN

I've heard enough slop using the ElevenLabs voices that I can recognize them almost immediately now. But you're right. Higher end models with less familiar voices are harder to notice. One consistent failing is that they are always too perfect. No mistakes or signs of cuts to edit out where a human VA would have made a mistake. Its all very smooth and perfect. As if they nailed it in the first shot. Once the cheap/free models manage to fix that then we are in real trouble. Also, some really lazy slop creators don't bother to fix issues with pronunciation. But that's not the fault of the model really.


Replies

bumbledraven12/13/2024

"More human than human" is our motto. https://youtu.be/ZbgmYhqFO-4?t=30