Here is the pricing per M tokens.

algorithm314 • yesterday at 5:05 PM • 2 replies • view on HN

Why is GLM 5 more expensive than GLM 4.7 even when using sparse attention?

There is also a GLM 5-code model.

I think it's likely more expensive because they have more activated parameters, which kind of outweighs the benefits of DSA?

l5870uoo9y • yesterday at 5:32 PM

It's roughly three times cheaper than GPT-5.2-codex, which in turn reflects the difference in energy cost between US and China.

➕ show 2 replies

alt Hacker News