logoalt Hacker News

davidcollantes09/30/20243 repliesview on HN

I played, more than once, a few of the sound snippets. I think the Shazam "findings" are highly inaccurate. Fun project nonetheless!

walz, could you write more about the setup, maybe to propitiate others to replicate it in other cities?


Replies

walz09/30/2024

I've listened to a bunch of the snippets and you can usually just barely hear the sound in the background. Which makes me think Shazam is very accurate. I really should read more about how Shazam's algorithm works, because it feels like magic.

The phone records 10 minute chunks of audio at a time, in airplane mode. Every 10 minutes, airplane mode is turned off and the audio is uploaded to a server. The server then splits the audio into 15 second overlapping chunks, and each is passed to Shazam's API (no official API, but someone reverse engineered it and made a great Python package). This setup is super power efficient! The phone dips down to a minimum 70% percent battery by the early morning.

show 4 replies
trainyperson09/30/2024

Same, although I know Shazam does most of its work on very high frequencies so it’s possible we’re not able to hear the part that got matched.

The “Not Like Us” snippet (09/29 2:43pm) is easily recognizable though. And “Rockabye” can be heard at 3:05pm.

show 1 reply