Great post last night from Simon:

CubsFan1060 • today at 12:44 PM • 2 replies • view on HN

Great post last night from Simon: https://simonwillison.net/2026/Apr/27/vibevoice/

Replies

Note that this just covers the Speech-to-Text/Speech-Recognition aspect (a-la whisper), there's also models for long-form Text-To-Speech and steaming Text-To-Speech.

JumpCrisscross • today at 1:23 PM

“VibeVoice can only handle up to an hour of audio”

Why?

alt Hacker News

Replies