logoalt Hacker News

blopkeryesterday at 9:54 PM3 repliesview on HN

Nice! I really like how many variations on this idea are coming out. MacWhisper used to be great, but is kinda of a buggy mess now.

I'm making my own, for personal use. I did a survey of many and they all (that I could find) skip the fundamentals.

The major issues that I've run into:

- Crash recovery. Most of these apps are incredibly buggy and crash all the time, taking the recorded audio with them. Macwhisper is incredibly bad at this.

- Disk space. Many of these apps save wav files to disk. After a few hours of meetings, you may end up with gigabytes eaten.

- Microphone bleed. People don't always use headphones, the system mic will pick up the speaker sounds, causing duplicate (approximately) transcriptions.

I've yet to find a solution that handles all these correctly, let alone having high quality transcriptions.

Anyway, most of these apps are built around https://github.com/FluidInference/FluidAudio, if anyone is curious. Their readme has a big list of similar apps as well.


Replies

jv22222yesterday at 10:01 PM

Nice tip on FluidAudio that's the kind of thing I've been looking for. Thanks!

highmastdonyesterday at 10:46 PM

I’m using MacParakeet these days. If your language is supported, definitely give it a try. It’s much faster and lower footprint

Folcontoday at 12:47 AM

> I've yet to find a solution that handles all these correctly, let alone having high quality transcriptions.

Wait really? I honestly would have thought this was a solved problem by now, especially high quality transcriptions bit, just out of curiosity, is the problem that the quality isn't high enough?

show 1 reply