logoalt Hacker News

Karrot_Kreamyesterday at 11:57 PM6 repliesview on HN

According to the OpenASR Leaderboard [1], looks like Parakeet V2/V3 and Canary-Qwen (a Qwen finetune) handily beat Moonshine. All 3 models are open, but Parakeet is the smallest of the 3. I use Parakeet V3 with Handy and it works great locally for me.

[1]: https://huggingface.co/spaces/hf-audio/open_asr_leaderboard


Replies

reitzensteinmtoday at 1:18 AM

Parakeet V3 is over twice the parameter count of Moonshine Medium (600m vs 245m), so it's not an apples to apples comparison.

I'm actually a little surprised they haven't added model size to that chart.

show 2 replies
theologictoday at 1:49 AM

By the way, I've been using a Whisper model, specifically WhisperX, to do all my work, and for whatever reason I just simply was not familiar with the Handy app. I've now downloaded and used it, and what a great suggestion. Thank you for putting it here, along with the direct link to the leaderboard.

I can tell that this is now definitely going to be my go-to model and app on all my clients.

show 1 reply
tuananhtoday at 3:27 AM

Handy is amazing. Super quality app.

show 1 reply
agentifyshtoday at 4:16 AM

hmmm looks like assembyAI is still unbeatable here in terms of cost/performance unless im mistaken

edit: holy shit parakeet is good.... Moonshine impressive too and it is half the param

Now if only there was something just as quick as Parakeet v3 for TTS ! Then I can talk to codex all day long!!!

show 2 replies
syntaxingtoday at 2:07 AM

How much VRAM does parakeet take for you? For some reason it takes 4GB+ for me using the onyx version even though it’s 600M parameters

tomr75today at 2:23 AM

why V3 over V2 (assuming English only)?