logoalt Hacker News

swyxyesterday at 6:18 PM1 replyview on HN

why would chip affect token quantity. this is all models.


Replies

louiereedersonyesterday at 6:32 PM

Chip costs strongly impact the economics of model serving.

It is entirely plausible to me that Opus 4.7 is designed to consume more tokens in order to artificially reduce the API cost/token, thereby obscuring the true operating cost of the model.

I agree though, I chose poor phrasing originally. Better to say that GB200 vs Tranium could contribute to the efficiency differential.