logoalt Hacker News

qwertoxtoday at 9:43 AM0 repliesview on HN

I was also impressed with Handy.

I played around with it this week, and when you enable advanced mode and add a post-transcription AI model to point to your own server which mimics a minimal ChatGPT-compatible behavior, then you can use it to modify the output, even return an empty string if you noticed that the transcript was more targeted to do other stuff ("turn the lights on"), if you then return an empty string, it won't inject keypresses.

So one gets the best for both worlds: transcription for dictation and transcription to trigger events.

If I now only could let it listen constantly and react to voice, so that no push to talk is active, that would be nice.

Maybe this project here could be used for that.

Also, this seems to support streaming transcription.