logoalt Hacker News

ericmceryesterday at 11:05 PM1 replyview on HN

yeh exactly, you cannot get a strong signal that a user is done speaking without some amount of “wait for 500ms of silence”. You could kick of processing and abandon if they continued talking, but that seems over optimized.

1-2s replies feel natural and like you pointed out pausing for 2-3s mid sentence is super normal.


Replies

charcircuittoday at 1:08 AM

The AI should be able to model a probability for when is a natural moment to start talking.