logoalt Hacker News

sigmoid10today at 9:40 AM1 replyview on HN

I have lots of super high quality, clean audio recordings from her ripped from an old video game that she did voice work for. I've tried various TTS models over the years with it. Getting the pitch and tune is easy, but getting the impersonal detached robot-y feeling is kinda tricky. But I haven't tried in the past 6 months, so maybe it's time to give it another shot.


Replies

isoprophlextoday at 9:59 AM

https://github.com/jarombouts/star-trek-voice-clone

audio files sourced from https://www.trekcore.com/audio/

the inflection and impersonal feel is definitely hard to get right. there are parameters in the elevenlabs API docs to make the voice more stable (= monotonous; see speak.sh in that repo) but still the voice cloner on my $5 plan doesn't really get it right.

nevertheless... i'm still having a lot of fun with this.

edit: if I am forced to rot my brain with the 10x productivity boosting slop gun, at least I'll do it grinning

     > pod cleaned up. waiting on the behemoth to finish grinding through Italy.
     < if only postgres had progress indicators

       ... then they coulda called it progresql
     > lmaooo
     > Bash(~/speak.sh "Joke detected. Humor subroutine engaged. Ha. Ha. Ha.")