Here is the pricing per M tokens. https://docs.z.ai/guides/overview/pricing
Why is GLM 5 more expensive than GLM 4.7 even when using sparse attention?
There is also a GLM 5-code model.
It's roughly three times cheaper than GPT-5.2-codex, which in turn reflects the difference in energy cost between US and China.
I think it's likely more expensive because they have more activated parameters, which kind of outweighs the benefits of DSA?