logoalt Hacker News

miki123211yesterday at 10:33 PM1 replyview on HN

Anthropic's model deployments for Claude Code are likely optimized for Claude Code. I wouldn't be surprised if they had optimizations like sharing of system prompt KV-cache across users, or a speculative execution model specifically fine-tuned for the way Claude Code does tool calls.

When setting your token limits, their economics calculations likely assume that those optimizations are going to work. If you're using a different agent, you're basically underpaying for your tokens.


Replies

echelonyesterday at 10:49 PM

- OR - it's about lock-in.

Build the single pane of glass everyone uses. Offer it under cost. Salt the earth and kill everything else that moves.

Nobody can afford to run alternative interfaces, so they die. This game is as old as time. Remember Reddit apps? Alternative Twitter clients?

In a few years, CC will be the only survivor and viable option.

It also kneecaps attempts to distill Opus.

show 3 replies