logoalt Hacker News

epolanskiyesterday at 3:38 PM5 repliesview on HN

Any cloud vendor offering this model? I would like to try it.


Replies

PhilippGilleyesterday at 3:45 PM

z.ai itself, or Novita fow now, but others will follow soon probably

https://openrouter.ai/z-ai/glm-4.7-flash/providers

show 2 replies
latchkeyyesterday at 4:32 PM

We don't have lot of GPUs available right now, but it is not crazy hard to get it running on our MI300x. Depending on your quant, you probably want a 4x.

ssh admin.hotaisle.app

Yes, this should be made easier to just get a VM with it pre-installed. Working on that.

show 1 reply
xenayesterday at 3:41 PM

The model literally came out less than a couple hours ago, it's going to take people a while in order to tool it for their inference platforms.

show 1 reply