This is a little damning of the way Google does things honestly.
>When an app runs on a single machine, you can often trace an error by scrolling through a log file. But when it runs across 50 microservices, that single request gets scattered into a chaotic firehose of disconnected events.
Yep this is about Google. It's painful for humans to debug and it's also an extremely bespoke issue to deal with. No one else has quite the same level of clusterfuck and there's going to be no training for LLMs on this.
It's bespoke to debug across multiple services?
This seems like typical work in any business that isn't trivial.
isn't that what trace IDs are for?