I have always been dubious of this because:
Someone would have noticed if all the phones on their network started streaming audio whenever a conversation happened.
It would be really expensive to send, transcribe and then analyze every single human on earth. Even if you were able to do it for insanely cheap ($0.02/hr) every device is gonna be sending hours of talking per day. Then you have to somehow identify "who" is talking because TV and strangers and everything else is getting sent, so you would need specific transcribers trained for each human that can identify not just that the word "coca-cola" was said, but that it was said by a specific person.
So yeah if you managed to train specific transcribers that can identify their unique users output and then you were willing to spend the ~0.10 per person to transcribe all the audio they produce for the day you could potentially listen to and then run some kind of processing over what they say. I suppose it is possible but I don't think it would be worth it.
> Someone would have noticed if all the phones on their network started streaming audio whenever a conversation happened.
You don't have to stream the audio. You can transcribe it locally. And it doesn't have to be 100% accurate. As for user identify, people have mentioned it on their phones which almost always have a one-to-one relationship between user and phone, and their smart devices, which are designed to do this sort of distinguishing.
I have a weird and unscientific test, and at the very least it is a great potential prank.
At one point I had the misfortune to be the target audience for a particular stomach churning ear wax removal add.
I felt that suffering shared is suffering halved, so decided to test this in a park with 2 friends. They pulled out their phones (an Android and a IPhone) and I proceeded to talk about ear wax removal loudly over them.
Sure enough, a day later one of them calls me up, aghast, annoyed and repelled by the add which came up.
This was years ago, and in the UK, so the add may no longer play.
However, more recently I saw an ad for a reusable ear cleaner. (I have no idea why I am plagued by these ads. My ears are fortunately fine. That said, if life gives you lemons)
who says you need to transcribe everything you hear? You just need to monitor for certain high-value keywords. 'OK, Google' isnt the only thing a phone is capable of listening for.
Google literally just settled for $68m about this very issue https://www.theguardian.com/technology/2026/jan/26/google-pr...
> Google agreed to pay $68m to settle a lawsuit claiming that its voice-activated assistant spied inappropriately on smartphone users, violating their privacy.
Apple as well https://www.theguardian.com/technology/2025/jan/03/apple-sir...