To be clear they weren’t banned from Claude usage, they were required to use the API and API rates rather than Claude Max tokens.
Claude code uses a bunch if best practices to maximize cache hit rate. Third party harnesses are hit or miss, so often use a lot more tokens for the same task.
nah this doesn't explain it.
most of the users of those third party harnesses care just as much about hitting cache and getting more usage.