logoalt Hacker News

monsieurbananayesterday at 8:35 PM2 repliesview on HN

Are you saying that some models will take 100x more tokens than other (models in the same ballpark) for the same task? Is the 100 a real measured metric or just random numbers to illustrate a point?


Replies

simpaticoderyesterday at 9:10 PM

With thinking models, yes 100x is not just possible, but probable. You get charged for the intermediate thinking tokens, even if you don't see them (which is the case for Grok, for example). And even if you do see them, they won't necessarily add value.

datadrivenangeltoday at 1:10 PM

the GPT 5 models use ~10x more tokens depending on the reasoning settings.