logoalt Hacker News

cookmeplox10/10/20242 repliesview on HN

Thanks! I've been meaning to write up a post that talks about some of the specific tricks we're using. A couple big ones:

- Heavy use of Cloudflare Workers to cache ~95% of logged-out pageviews, with a particular focus on doing a lot of edge-side modifications to minimize cache fragmentation

- Using the MediaWiki jobrunners to repopulate the parser cache before pageviews are requested, so even when pageviews hit the server, there's a high chance that the core contents have already been computed somewhere

- I realized that MediaWiki latency is usually dominated by I/O wait time. For example, some pageviews require thousands of synchronous database/redis cache reads, so the difference between 0.5ms lookup and 0.1ms lookup adds up. So we colocated more of those caches on the same physical machines as the webservers that were reading them, which on average dropped latency by ~40%


Replies

starkparker10/10/2024

Is there a RSS feed on the WG blog? I couldn't track one down, and it looks like a Jekyll site, so I'm not sure if there is one. I don't want to miss that post.

show 1 reply
renewiltord10/10/2024

Would love to read that post. Thank you for these tips. I’ll subscribe to your feed and wait for it.