logoalt Hacker News

garblegarbleyesterday at 11:32 PM2 repliesview on HN

For my inputs, whisper distil-large-v3.5 is the best. I tried Parakeet 0.6 v3 last night but it has higher error rates than I'd like (but it is fast...)


Replies

Johnny_Bonkyesterday at 11:35 PM

Nice I'll try it, as of now for my personal stt workflow I use eleven labs api which is pretty generous but curious to play around with other options

show 1 reply
BiraIgnacioyesterday at 11:52 PM

oh I've been looking into whisper and vosk in the last few days. I'll probably go with whisper (with whisper.cpp) but has anyone compared it to vosk models?