Owning the client gives them full control over which model to use for which query, prompt caching, rate limiting and lots more. So they can drive massive savings for the ~same output over just giving unrestricted access to the API.
Wouldn’t most of the savings be done on the server side anyway? I would be very surprised if Claude code does those on the client side.
Wouldn’t most of the savings be done on the server side anyway? I would be very surprised if Claude code does those on the client side.