logoalt Hacker News

walz09/30/20244 repliesview on HN

I've listened to a bunch of the snippets and you can usually just barely hear the sound in the background. Which makes me think Shazam is very accurate. I really should read more about how Shazam's algorithm works, because it feels like magic.

The phone records 10 minute chunks of audio at a time, in airplane mode. Every 10 minutes, airplane mode is turned off and the audio is uploaded to a server. The server then splits the audio into 15 second overlapping chunks, and each is passed to Shazam's API (no official API, but someone reverse engineered it and made a great Python package). This setup is super power efficient! The phone dips down to a minimum 70% percent battery by the early morning.


Replies

jldugger09/30/2024

> I really should read more about how Shazam's algorithm works, because it feels like magic.

https://www.ee.columbia.edu/~dpwe/papers/Wang03-shazam.pdf

show 1 reply
gffrd09/30/2024

Holy cow.

Just clicked around and you're right: the Sep 29 5:19pm snippet detected "Celebration" by Kool and the Gang, and there's almost nothing there. But it's in there.

Had I not known what I was listening for, and been intentionally listening, there's zero chance I'd have picked up on it.

It does feel like magic.

callalex09/30/2024

The battery will live much longer if you run it from 80% down to 50%. There are some clever plugs you can get off the shelf if your phone doesn’t support setting this in software.

show 1 reply
rconti09/30/2024

will be interesting to see how it fares in the winter!

show 1 reply