logoalt Hacker News

cbg0yesterday at 6:32 PM2 repliesview on HN

If it uses half the tokens to complete a task, then doubling the cost is perfectly fine. But is that actually true?


Replies

2001zhaozhaoyesterday at 6:36 PM

This happens with every new model release though. The model makes less mistakes and spends less time fixing them, resulting in a token usage reduction for the same difficulty of task. Almost any task other than straight boilerplate will benefit from this.

In the same vein, I would guess that Opus 4.7 is probably cheaper for most tasks than 4.6, even though the tokenizer uses more tokens for the same length of string.

show 2 replies
jstummbilligyesterday at 7:20 PM

We'll find out!