logoalt Hacker News

syntaxingtoday at 12:59 AM5 repliesview on HN

While Anthropic can choose whatever tool uses their api or subscription but I never fully understood what they gain from having the subscription explicitly only work for claude code. Is the issue that it disincentivizes the use of their API?


Replies

Aurornistoday at 1:53 AM

It’s basic market segmentation.

They gave Claude Code a discount to make it work as a product.

The API is priced for all general purpose usage.

They never sold the Claude Code endpoint as a cheaper general purpose API. The stories about “blocking OpenCode” are getting kind of out of hand because they’d block any use of the Claude Code endpoint that wasn’t coming from their Claude Code tool.

drakenottoday at 1:00 AM

Perhaps concentrated use of Claude Code increases their perceived market value.

It also perhaps tries to preserve some moat around their product/service.

show 1 reply
_boffin_today at 1:03 AM

Are they ZDR with prompts and completions and possibly rely on usage statistics from their CLI to infer how people are using it?

show 1 reply
paxystoday at 1:05 AM

Owning the client gives them full control over which model to use for which query, prompt caching, rate limiting and lots more. So they can drive massive savings for the ~same output over just giving unrestricted access to the API.

show 1 reply
ankit219today at 1:04 AM

The issue is that claude code is cheap because it uses API's unused capacity. These kind of circumventions hurt them both ways, one they dont know how to estimate api demand, and two, the nature of other harnesses is more bursty (eg: parallel calls) compared to claude code, so it screws over other legit users. Claude code very rarely makes parallel calls for context commands etc. but these ones do.

re the whole unused capacity is the nature of inference on GPUs. In any cluster, you can batch inputs (ie takes same time for say 1 query or 100 as they can be parallelized) and now continuous batching[1] exists. With API and bursty nature of requests, clusters would be at 40%-50% of peak API capacity. Makes sense to divert them to subscriptions. Reduces api costs in future, and gives anthropic a way to monetize unused capacity. But if everyone does it, then there is no unused capacity to manage and everyone loses.

[1]: https://huggingface.co/blog/continuous_batching

show 2 replies