logoalt Hacker News

somesnmtoday at 12:15 PM4 repliesview on HN

Hasn't this been largely solved by auto-caching introduced recently by Anthropic, where you pass "cache_control": {"type": "ephemeral"} in your request and it puts breakpoints automatically? https://platform.claude.com/docs/en/build-with-claude/prompt...


Replies

philipp-gayrettoday at 12:27 PM

Looking at my own usage with claude code out of the box and nothing special around caching set up. For this month according to ccusage I have in tokens 0.2M input, 0.6M output, 10M cache create, 311M cache read for 322M total tokens. Seems to me that it caches out of the box quite heavily, but if I can trim my usage somehow with these kind of tools I'd love to know.

show 1 reply
stingraycharlestoday at 12:26 PM

Yes, it has, this is a non-problem, and even if it was a problem, an MCP server would most definitely be one of the worst ways to fix it.

gostsamotoday at 12:50 PM

It is answered in the FAQ.

ermistoday at 12:36 PM

[flagged]

show 1 reply