Any cloud vendor offering this model? I would like to try it.
We don't have lot of GPUs available right now, but it is not crazy hard to get it running on our MI300x. Depending on your quant, you probably want a 4x.
ssh admin.hotaisle.app
Yes, this should be made easier to just get a VM with it pre-installed. Working on that.
The model literally came out less than a couple hours ago, it's going to take people a while in order to tool it for their inference platforms.
z.ai itself, or Novita fow now, but others will follow soon probably
https://openrouter.ai/z-ai/glm-4.7-flash/providers