You are paying per token, but what you care about is token efficiency. If token efficiency has improved by as much as they claim it did (i.e. you need less tokens to complete a task successfully) all seems well.
If it uses half the tokens to complete a task, then doubling the cost is perfectly fine. But is that actually true?
Not for coding because it actually needs to read and write large files