We noticed this two weeks ago where we found some of our requests are unexpected took more tokens than measured by count_tokens call. At the end they were Anthropic's A/B testing routing some Opus 4.6 calls to Opus 4.7.
https://matrix.dev/blog-2026-04-16.html (We were talking to Opus 4.7 twelve days ago)