It's interesting that they charge more for the > 200k token window, but the benchmark score ...

strongpigeon • yesterday at 6:32 PM • 3 replies • view on HN

It's interesting that they charge more for the > 200k token window, but the benchmark score seems to go down significantly past that. That's judging from the Long Context benchmark score they posted, but perhaps I'm misunderstanding what that implies.

Replies

_heitoo • yesterday at 11:36 PM

It makes sense in scenarios where a model needs >200k tokens to answer a single prompt. You're shackled to a single session, and if the model hits compaction limits, it'll get lobotomized and give a shitty answer, so higher limits, even with degraded performance, are still an improvement.

Tiberium • yesterday at 6:57 PM

They don't actually seem to charge more for the >200k tokens on the API. OpenRouter and OpenAI's own API docs do not have anything about increased pricing for >200k context for GPT-5.4. I think the 2x limit usage for higher context is specific to using the model over a subscription in Codex.

simianwords • yesterday at 6:43 PM

[flagged]

➕ show 1 reply

alt Hacker News

Replies