logoalt Hacker News

kokakiwiyesterday at 4:11 PM1 replyview on HN

Headroom looks great for client-side trimming. If you want to tackle this at the infrastructure level, we built Edgee (https://www.edgee.ai) as an AI Gateway that handles context compression, caching, and token budgeting across requests, so you're not relying on each client to do the right thing.

(I work at Edgee, so biased, but happy to answer questions.)


Replies

gilles_opononoyesterday at 6:39 PM

100% agree