logoalt Hacker News

johnfnyesterday at 5:33 PM7 repliesview on HN

That's a fun idea. How do you get the transcript into Claude Code (or whatever you use)? What transcription service do you use?


Replies

hn_throw2025yesterday at 5:43 PM

I'm not the person you're replying to, but I use Whispering connected to the whisper-large-v3-turbo model on Groq.

It's incredibly cheap and works reliably for me.

I have got it to paste my voice transcriptions into Chrome (Gemini, Claude, ChatGPT) as well as Cursor.

https://github.com/EpicenterHQ/epicenter

rgbrgbyesterday at 6:50 PM

I use Handy with Claude code. Nice to just have a key combo to transcribe into whatever has focus.

https://github.com/cjpais/Handy

show 1 reply
quinncomyesterday at 6:35 PM

I use Spokenly with local Parakeet 0.6B v3 model + Cerebras gpt-oss-120b for post-processing (cleaning up transcription errors and fixing technical mondegreens, e.g., `no JS` → `Node.js`). Almost imperceptible transcription and processing delay. Trigger transcription with right ⌥ key.

show 1 reply
hurturueyesterday at 5:56 PM

your OS might have a built in dictation thing. Google for that and try it before online services.

singhractoday at 2:57 AM

I use VoiceInk (needed some patches to get it to compile but Claude figured it out) and the Parakeet V3 model. It’s really good!

bogtogyesterday at 6:21 PM

There are a few apps nowadays for voice transcription. I've used Wispr Flow and Superwhisper, and both seem good. You can map some hotkey (e.g., ctrl + windows) to start recording, then when you press it again to stop, it'll get pasted into whatever text box you have open

Superwhisper offers some AI post-processing of the text (e.g., making nice bullets or grammar), but this doesn't seem necessary and just makes things a bit slower

elvin_dyesterday at 9:25 PM

made this tool to press double control to start and another ctrl to stop which copies to the cliboard

https://github.com/elv1n/para-speak/