I played, more than once, a few of the sound snippets. I think the Shazam "findings" are highly inaccurate. Fun project nonetheless!
walz, could you write more about the setup, maybe to propitiate others to replicate it in other cities?
Same, although I know Shazam does most of its work on very high frequencies so it’s possible we’re not able to hear the part that got matched.
The “Not Like Us” snippet (09/29 2:43pm) is easily recognizable though. And “Rockabye” can be heard at 3:05pm.
I've listened to a bunch of the snippets and you can usually just barely hear the sound in the background. Which makes me think Shazam is very accurate. I really should read more about how Shazam's algorithm works, because it feels like magic.
The phone records 10 minute chunks of audio at a time, in airplane mode. Every 10 minutes, airplane mode is turned off and the audio is uploaded to a server. The server then splits the audio into 15 second overlapping chunks, and each is passed to Shazam's API (no official API, but someone reverse engineered it and made a great Python package). This setup is super power efficient! The phone dips down to a minimum 70% percent battery by the early morning.