logoalt Hacker News

sbrotheryesterday at 4:54 PM2 repliesview on HN

Do you have experience with that model for diarization? Does it feel accurate, and what's its realtime factor on a typical GPU? Diarization has been the biggest thorn in my side for a long time..


Replies

ashenkeyesterday at 7:03 PM

You can test it yourself for free on https://console.mistral.ai/build/audio/speech-to-text I tried it on an english-speaking podcast episode, and apart from identying one host as two different speakers (but only once for a few sentences at the start), the rest was flawless from what I could see

show 1 reply
coder543yesterday at 5:22 PM

> Do you have experience with that model

No, I just heard about it this morning.