This is a quite old technique. The idea, as I understood it, was that lots of data at Google was sto...

why_only_15 • today at 7:40 AM • 2 replies • view on HN

This is a quite old technique. The idea, as I understood it, was that lots of data at Google was stored in triplicate for reliability purposes. Instead of fetching one, you fetched all three and then took the one that arrived first. Then you sent UDP packets cancelling the other two. For something like search where you're issuing hundreds of requests that have to resolve in a few hundred milliseconds, this substantially cut down on tail latency.

Replies

yvdriess • today at 7:47 AM

Tournament parallelism is the technical term IIRC.

100ms • today at 8:36 AM

Aha that makes more sense, I thought it was specifically to do with job scheduling from the description. You can do something similar at home as a poor man's CDN by racing requests to regionally replicated S3 buckets. Also magic eyeballs (ipv4/v6 race done in browsers and I think also for Quic/HTTP selection) works pretty much the same way

➕ show 1 reply

alt Hacker News

Replies