logoalt Hacker News

Lucasoatoyesterday at 10:26 PM1 replyview on HN

Thanks, I’ll try it, even if my experience wasn’t that great with Google models lately (503s)


Replies

dharma1yesterday at 10:32 PM

Give it a shot, 3.1 live one in AI studio/API and max out reasoning - not the one in Gemini app it’s an older model.

Another option is to use pipecat with their VAD and separate STT and TTS and any (fast) LLM of your choice - but it’s more plumbing and not a true speech to speech model

show 1 reply