Note that this just covers the Speech-to-Text/Speech-Recognition aspect (a-la whisper), there&#...

542458 • today at 12:53 PM • 0 replies • view on HN

Note that this just covers the Speech-to-Text/Speech-Recognition aspect (a-la whisper), there's also models for long-form Text-To-Speech and steaming Text-To-Speech.

alt Hacker News