Do we know if this is better than Nvidia Parakeet V3? That has been my go-to model locally and it's hard to imagine there's something even better.
I liked Parakeet v3 a lot until it started to drop whole sentences, willy-nilly.
I've been using Parakeet V3 locally and totally ancedotaly this feels more accurate but slightly slower
Came here to ask the same question!
I've been using nemotron ASR with my own ported inference, and happy about it:
https://huggingface.co/nvidia/nemotron-speech-streaming-en-0...
https://github.com/m1el/nemotron-asr.cpp https://huggingface.co/m1el/nemotron-speech-streaming-0.6B-g...