logoalt Hacker News

anon373839yesterday at 8:43 AM0 repliesview on HN

That's not what consumes the most memory at scale. The KV caches are per-user.