Need to transcribe and skip ads in the podcast ! That would be a killer feature !

sciencesama • yesterday at 5:43 PM • 4 replies • view on HN

Replies

I actually have a WIP cross platform app that does exactly this. It is more generic around processing any audio/video with whisper and integrating with openai or local llm's for summarization and other things but I also added a podcast specific ad skipping feature (it's not as perfect as something manual like sponsorblock for youtube yet but i'd say it's about 85% accuracy at the moment dependent on the models used)

➕ show 1 reply

xnx • yesterday at 5:50 PM

I wrote something that does this (in R of all things)! I still need to polish it up a little in share it though. It goes: podcast mp3 -> whisper transcription -> Gemini Flash to identify ad blocks -> FFMPEG to replace ad blocks with skip sound.

➕ show 1 reply

n0vella • yesterday at 6:36 PM

Hi, I have a task to check SponsorBlock API to see if this can work for podcasts. In the other hand I doubt a bit because I don't know if that could harm podcasters in any way. But I think something like SponsorBlock eventually will be a feature. PD, awesome idea other people are doing using Whisper.

➕ show 1 reply

Armazon • yesterday at 5:46 PM

Something like Sponsorblock for non youtube videos would be great. Another killer feature for me are automatic chapters if the podcast doesn't provide any

alt Hacker News

Replies