logoalt Hacker News

k8sToGotoday at 4:10 PM1 replyview on HN

It's not about outages. It's about the why. Hardware can fail. Bugs can happen. But to continue a roll out despite warning sings and without understanding the cause and impact is on another level. Especially if it is related to the same problem as last time.


Replies

udev4096today at 4:56 PM

And yet, it's always clownflare breaking everything. Failures are inevitable, which is widely known, therefore we build resilience systems to overcome the inevitable

show 1 reply