One thing I don't see often mentioned - OpenAI API's auto token caching approach results i...

QuadrupleA • today at 6:23 PM • 0 replies • view on HN

One thing I don't see often mentioned - OpenAI API's auto token caching approach results in MASSIVE cost savings on agent stuff. Anthropic's deliberate caching is a pain in comparison. Wish they'd just keep the KV cache hot for 60 seconds or so, so we don't have to pay the input costs over and over again, for every growing conversation turn.

alt Hacker News