logoalt Hacker News

a_t48yesterday at 9:09 PM4 repliesview on HN

I got exactly this warning message yesterday, saying that it could use up a significant amount of my token budget if I resumed the conversation without compaction.


Replies

jhogendorntoday at 3:14 AM

Compaction wont save you, in fact calling compaction will eat about 3-5x the cold cache cost in usage ive found.

doubleunplussedtoday at 3:17 AM

I saw that too, but that's actually even worse on cache - the entire conversation is then a cache miss and needs to be loaded in in order to do the compaction. Then the resulting compacted conversation is also a cache miss.

You ideally want to compact before the conversation is evicted from cache. If you knew you were going to use the conversation again later after cache expiry, you might do this deliberately before leaving a session.

Anthropic could do this automatically before cache expiry, though it would be hard to get right - they'd be wasting a lot of compute compacting conversations that were never going to be resumed anyway.

onemoresoopyesterday at 9:31 PM

Im glad they chose to do that as opposed to hidden behavior changes that only confuse users more.

fhubyesterday at 9:24 PM

Really good to know. That should have made it into their update letter in point (2). Empowering the user to choose is the right call.