logoalt Hacker News

Handy – Free open source speech-to-text app

247 pointsby tin7in01/15/2026110 commentsview on HN

Comments

d4rkp4ttern01/15/2026

I’ve tried several, including this one, and I’ve settled on VoiceInk (local, one-time payment), and with Parakeet V3 it’s stunningly fast (near-instant) and accurate enough to talk to LLMs/code-agents, in the sense that the slight drop in accuracy relative to Whisper Turbo3 is immaterial since they can “read between the lines” anyway.

My regular cycle is to talk informally to the CLI agent and ask it to “say back to me what you understood”, and it almost always produces a nice clean and clear version. This simultaneously works as confirmation of its understanding and also as a sort of spec which likely helps keep the agent on track.

UPDATE - just tried handy with Parakeet v3, and it works really well too, so I'll use this instead of VoiceInk for a few days. I just also discovered that turning on the "debug" UI with Cmd-shift-D shows additional options like post processing and appending trailing space.

show 1 reply
blutoot01/15/2026

I have dystonia which often stiffens my arms in a way that makes it impossible for me to type on a keyboard. TTS apps like SuperWhisper have proven to be very helpful for me in such situations. I am hoping to get a similar experience out of "Handy" (very apt maming from my perspective).

I do, however, wonder if there is a way all these TTS tools can get to the next level. The generated text should not be just a verbatim copy of what I just said, but depending on the context, it should elaborate. For example, if my cursor is actively inside an editor/IDE with some code, my coding-related verbal prompts should actually generate the right/desired code in that IDE.

Perhaps this is a bit of combining TTS with computer-use.

show 5 replies
ryanshrott01/25/2026

I've been going down this rabbit hole too. I ended up building DictaFlow (https://dictaflow.vercel.app/) because I needed something that specifically works in VDI/Citrix environments where clipboard pasting is blocked (I work in finance).

It uses a 'character-typing' method instead of clipboard injection, so it's compatible with pretty much anything remote. Also kept it super lightweight (<50MB RAM) for Windows users who don't want to run a full local server stack.

Cool to see Handy using the newer models—local voice tech is finally getting good.

kuatroka01/15/2026

Love it. I had been searching for STT app for weeks. Every single app was either paid as a one off or had a monthly subscription. It felt a bit ridiculous having to pay when it’s all powered by such small models on the back end. So I decided to build my own. But then I found “Handy” and it’s been a really amazing partner for me. Super fast, super simple, doesn’t get in my way and it’s constantly updated. I just love it. Thanks a lot for making it! Thanks a lot

P.S. The post processing that you are talking about, wouldn’t it be awesome.

frankdilo01/15/2026

This looks great! What’s missing for me to switch from something like Wispr Flow is the ability to provide a dictionary for commonly mistaken words (name of your company, people, code libraries).

show 3 replies
Barbing01/15/2026

Quick thoughts re: mentioned transcribers

Superwhisper — Been using it a long time. It's paid with a lifetime subscription available. Tons of features. Language models are built right in without additional charge. Solo dev is epic; may defer upgrades to avoid occasional bugs/regressions (hey, it's complex software).

Trying each for a few minutes:

Hex — Feels the leanest (& cleanest) free options mentioned for Mac in this thread.

Fluid Voice — Offers a unique feature, a real-time view of your speech as you talk! Superwhisper has this, but only with an online model. (You can't see your entire transcript in Fluid, though. The recording window view is limited to about one sentence at a time--of course you do see everything when you complete your dictation.)

Handy — Pink and cute. I like the history window. As far as clipboard handling goes, I might note that the "don't modify clipboard" setting is more of a "restore clipboard" setting. Though it doesn't need as many permissions as Hex because it's willing to move clipboard items around a bit, if I'm not mistaken.

Note Hex seems to be upset about me installing all the others... lots of restarting in between installs all around. Each has something to offer.

---

Big shout out to Nvidia open-sourcing Parakeet--all of these apps are lightning fast.

Also I'm partial to being able to stream transcriptions to the cursor into any field, or at least view live like Fluid (or superwhisper online). I know it's complex b/c models transcribe the whole file for accuracy. (I'm OK with seeing a lower quality transcript realtime and waiting a second for the higher-quality version to paste at the end.)

mncharity01/15/2026

A cautionary user experience report. The default hotkey upon download is ctrl+space. Press to begin recording, release to transcribe and insert. Key-up on the space key constitutes hotkey release. If the ctrl key is still down when the insertion lands, the transcribed text is treated as ctrl characters. The test app was emacs. (x64 linux x11, with and without xdotool)

PhilippGille01/15/2026

Has anyone compared this with https://github.com/HeroTools/open-whispr already? From the description they seem very similar.

Handy first release was June 2025, OpenWhispr a month later. Handy has ~11k GitHub stars, OpenWhispr has ~730.

show 2 replies
aucisson_masque01/15/2026

It’s incredibly fast on my MacBook m1 air and more accurate that the native speech to text.

The ui is well thought out, just the right amount of setting for my usage.

Incredible !

Btw, do you know what « discharging the model » does ? It’s set to never by default, tried to check if it has an impact on ram or cpu but it doesn’t seem to do anything.

show 1 reply
peterldowns01/15/2026

Huge fan! Parakeet v3 works great with it. I have used Monologue, Superwhisper, and Aqua, at various times in the past. But Handy is at least as good, and it's not an expensive subscription. I love that it runs locally, too. Strongly recommend!

Jack550001/15/2026

The Parakeet V3 model is really great!

show 1 reply
Jayakumark01/15/2026

Its great, i have been using it . Two requests though 1. iOS app 2. API option to use against meeting transcription or route audio from Mic .

show 1 reply
holtwick01/15/2026

FluidVoice for macOS is pretty handy as well. Open source under Apache License. https://altic.dev/fluid https://github.com/altic-dev/FluidVoice

show 1 reply
unutranyholas01/15/2026

https://hex.kitlangton.com/ is good

llarsson01/15/2026

A question because I'm not using speech-to-text, but find it intriguing (especially since it's now possible to do locally and for free).

How have your computing habits changed as a result of having this? When do you typically use this instead of typing on the keyboard?

show 2 replies
dumbmrblah01/15/2026

I just set this up today. I had Whispering app set up on my Windows computer, but it really wasn't working well on my Ubuntu computer that I just set up. I found Handy randomly. It was the last app I needed to go Linux full-time. Thank you!

wi5eif6E01/15/2026

This looks and works great! A settings option to keep no recording history at all would be terrific.

show 1 reply
vladstudio01/15/2026

Use it daily. Looks and works great.

mrroryflint01/15/2026

On a M4 Macbook Air, there was enough lag to make it unusable for me. I hit the shortcut and start speaking but there was always a 1-2sec delay before it would actually start transcribing even if the icon was displayed.

show 3 replies
miniwark01/15/2026

Did this thing (or open-whispr) work well with other languages than english ?

show 2 replies
erelong01/15/2026

WhisperTux on linux worked ok, curious how Handy compares: https://github.com/cjams/whispertux

qprofyeh01/15/2026

As a Mac user, am I missing something? macOS has Dictation built-in, when you short press F5 it should start transcribing your spoken words into text in real time. It even does non-English languages.

show 2 replies
walthamstow01/15/2026

Nice. I spent most of Christmas vibe coding with Google Antigravity with one hand while holding a sleeping baby in the other. MacOS built in dictation is OK, but struggles with technical language.

chainmail202901/15/2026

There's a slightly awkward naming overlap with an existing product.

show 3 replies
bn-usd-mistake01/15/2026

Does anyone have a similar mobile application that works locally and is not too expensive? Mostly looking to transcribe voice messages sent over Signal which does not offer this OOTB

show 3 replies
jborichevskiy01/15/2026

Big Handy fan!

oybng01/15/2026

On Windows this depends on webview2, which the installer attempts to download. No mention of this requirement in the readme. It's a shame this software isn't portable

mnmalst01/15/2026

This is really cool. Works out of the box and I'm typing this using handy.

Is there any way to execute commands directly on Linux?

Also a feature to edit or correct already typed text would be really great.

skor01/15/2026

This is so handy, thank you very much. Good work!!

dotancohen01/15/2026

Looks interesting. Why does it need a GUI at all?

show 6 replies
swordsith01/16/2026

from the read-me, 'Handy isn't trying to be the best speech-to-text app—it's trying to be the most forkable one.' Why cant we write a readme without using generative AI, seriously, it's not that hard. :<

ekjhgkejhgk01/15/2026

Explain to me why a speech-to-text app has 50% of its code in typescript...?

show 2 replies
Dnguyen01/15/2026

Would be nice if the output can be piped directly into Claude Code.

laylower01/15/2026

Is it deployed locally or does it send data to your servers?

show 1 reply
fittingopposite01/15/2026

Is there any good android app featuring parakeet v3?

show 1 reply
blutoot01/15/2026

Crashes on Tahoe 26.3 Betq 1 :(

show 1 reply
sirjaz01/15/2026

This is great, and I love that this is not another webapp

atay12301/16/2026

[dead]

olya_pllkh01/16/2026

[dead]