logoalt Hacker News

dharma1yesterday at 10:23 PM1 replyview on HN

Yes the voice part of OpenAI realtime/voice mode is great but it’s pretty dumb compared to newer models and often gets stuck repeating itself.

Google’s Gemini flash live 3.1 is better, especially used via the API - it can do tool calling (including to other, even smarter LLMs if you set it up yourself), you can set the reasoning level (even high is still close enough to realtime) and it can ground answers in google search. I love bidirectional voice and right now it’s probably the best option. You can try it in AI studio


Replies

Lucasoatoyesterday at 10:26 PM

Thanks, I’ll try it, even if my experience wasn’t that great with Google models lately (503s)

show 1 reply