logoalt Hacker News

sciencesamayesterday at 5:43 PM4 repliesview on HN

Need to transcribe and skip ads in the podcast ! That would be a killer feature !


Replies

jdprgmyesterday at 6:07 PM

I actually have a WIP cross platform app that does exactly this. It is more generic around processing any audio/video with whisper and integrating with openai or local llm's for summarization and other things but I also added a podcast specific ad skipping feature (it's not as perfect as something manual like sponsorblock for youtube yet but i'd say it's about 85% accuracy at the moment dependent on the models used)

show 1 reply
xnxyesterday at 5:50 PM

I wrote something that does this (in R of all things)! I still need to polish it up a little in share it though. It goes: podcast mp3 -> whisper transcription -> Gemini Flash to identify ad blocks -> FFMPEG to replace ad blocks with skip sound.

show 1 reply
n0vellayesterday at 6:36 PM

Hi, I have a task to check SponsorBlock API to see if this can work for podcasts. In the other hand I doubt a bit because I don't know if that could harm podcasters in any way. But I think something like SponsorBlock eventually will be a feature. PD, awesome idea other people are doing using Whisper.

show 1 reply
Armazonyesterday at 5:46 PM

Something like Sponsorblock for non youtube videos would be great. Another killer feature for me are automatic chapters if the podcast doesn't provide any