logoalt Hacker News

e12e10/01/20241 replyview on HN

Are you looking into speech to speech (no text) models?


Replies

hassaanr10/01/2024

Yeah we are! The issue we're seeing is with controllability and hallucinations in speech to speech models that we're trying to work through still