Worked on some features at open reader, a local-first PDF TTS reader that highlights the words spoken and uses the excellent local kokoro tts engine.
Got fed up with web tech, it's so slow and clunky, so made my own version in python and qt. I changed the design to be based on a doclayout llm, so you can skip or include things like tables and references easily.
It now works so beautifully fast, it's code is readable and simple, no apis or multiple services. Just a qt app, some local llms that can run on a decent cpu and word-leven highlighting and playback selection.
https://github.com/thepycoder/projectwhy-tts
I can listen to papers now!