Let us hope this only accelerates the proliferation of local models
It will. Moves like this will only lead to a drift of brains and talents to tweak & tune open harnesses and open models.
There is the undocumented 3rd option of simply shrugging and moving on without LLMs, you know, business as usual.
Serving barely useful GLM 5.2 costs what? $15k? Actually useful is like $50k? You’ll never recoup the cost unless you ‘locally’ means ‘inference provider is not the model provider’?