I'm using OpenRouter for GLM5.2, but if there's a cheaper option out there I'd love to know about it!
In the few days I've been using it, my expenses have been higher than prorating my Claude subscription to 20 working days per month.
My experience with GLM5.2 is that it doesn't overthink nearly as much as Claude Code, has better and far more concise responses (I'm so siiiiick of 10 paragraph Claude babble trying to fill out some sort of answer length target by going on tangents I'm uninterested in... I'm sure that performs better on whatever eval they're doing, but apparently their evals don't include SNR?)
I think that there are some subscriptions to go by. Z.ai subscription might still be interesting. I once haggled with kimi to get it for 1$ per month. I can only help in providing pointers:
If you wish to go Non-API but rather subscription route: Z.Ai subscription/ Kimi subscription / MiniMax subscriptions are good. You could also take a look at ollama subscription and opencode subscriptions.
If you wish to go API route: Deepseek v4 pro /mimo v2.5 pro models are comparably good if your work can do that. Codex for all its failure and for as much respect that I had within Anthropic when they had fought against the govt. which Anthropic is slowly losing again by doing some pretty dystopian actions again so Codex subscription might make sense as well.
It depends on multiple things but hopefully i am able to provide some interesting things
If you wish to run models locally, unless you are specifically buying gigs for running them locally which is almost always about privacy rather than costs, then you are always better off with qwen models so if you got a 64-128GB laptop for example. You could run Qwen models and see where things go.
Hope this helps ya!