logoalt Hacker News

lostmsulast Monday at 2:53 PM0 repliesview on HN

In large providers KV caches are the main bottleneck, no?