I think the subscription pricing exists because it’s a far more palatable way to bill people for day to day personal use.
I used Claude back when API per token pricing was the only option and it was bad for all the usual reasons pay-per-use sucks compared to flat billing: you’re constantly thinking about cost. Like trying to watch a Netflix video with a ticker in the corner counting up the cents you owe them.
I don’t understand your claim that they want people paying per token - the subscription is the opposite of that, and it also has upsides for them as a business since most people don’t saturate the usage limits, and the business gets to stuff a bunch of value-adds on a bundle offering which is generally a more lucrative and enticing consumer pricing model.
The cost difference is pretty staggering for the same usage. Being on the sub hacks your reward system to push you to be productive, legitimately hitting limits feels like a win, and you start looking for ways to max your utilization %. A lot of people get quite obsessive about it. The sub is 100% the innovation that makes Claude Code "work."
If the pay-per-use cost predictable enough, it’s less of an issue. That’s how electricity works and it’s fine.
The issue with Claude Code is it’s not at all obvious how any given task or query translates to cost. I was finding some days I spent very little and other days cost a fortune despite what seemed to me to be similar levels of usage.
People 100% want subscriptions in this space.
The alternative is AWS where you need to be a billing expert to keep costs locked at $20/month.
The bundle only works if it’s +EV for them. A lot of analyses (though not all - it’s complicated) say that the $200/mo bundle (and certainly the $20/mo bundle) costs more than that for most users, and the bundle is currently a loss leader. If so, then eventually prices will need to go up, and API per usage pricing will seem much more attractive.