Not a single mention of prompt caching in this article, which is a massive benefit of append-only co...

theowaway213456 • today at 2:01 AM • 2 replies • view on HN

Not a single mention of prompt caching in this article, which is a massive benefit of append-only context.

Replies

If it were, I can in theory see situations where improving content cleanliness is worth blowing away the KV cache.

But I absolutely can't see how feeding the entire context into a more expensive model multiple times per task, just to propose context edits that might indirectly help, could ever be worthwhile.

jauntywundrkind • today at 3:56 AM

Cost wise yes, but in terms of getting the correct best work done? Meh, not helpful!

I think more what's missing here is the comparison of different tries, from the same head. And there prompt caching does help!

alt Hacker News

Replies