logoalt Hacker News

shepherdjerredyesterday at 10:58 PM1 replyview on HN

> 3. Tokens cost money even you are currently being subsidized.

keep in mind that those 50k memory tokens would likely be cached after the first run and thus significantly cheaper


Replies

sarchertechtoday at 4:16 AM

Caching has so many caveats. The cache expiration window is short, if you change document in the context it clears the cache, if you change anything in the prompt prefix it clears the cache. And there’s no reason to think that Anthropic will keep charging dramatically less for cached tokens on the future once they start trying to make a profit.

show 1 reply