logoalt Hacker News

GaggiXyesterday at 5:43 PM1 replyview on HN

There is also: https://github.com/linto-ai/whisper-timestamped

It doesn't use an extra model (so it supports every language that works with Whisper out of the box and use less memory), it works by applying Dynamic Time Warping to cross-attention weights.


Replies

oeziyesterday at 7:35 PM

Just a warning that plain WhisperX is more accurate and Whisper-timestamped has many weird quirks.