logoalt Hacker News

PhilippGilleyesterday at 3:45 PM2 repliesview on HN

z.ai itself, or Novita fow now, but others will follow soon probably

https://openrouter.ai/z-ai/glm-4.7-flash/providers


Replies

sdrinfyesterday at 8:39 PM

Note: I strongly recommend against using Novita -their main gig is serving quantized versions of the model to offer it for cheaper / at better latency; but if you ran an eval against other providers vs novita, you can spot the quality degradation. This is nowhere marked, or displayed in their offering.

Tolerating this is very bad form from openrouter, as they default-select lowest price -meaning people who just jump into using openrouter and do not know about this fuckery get facepalm'd by perceived model quality.

epolanskiyesterday at 3:52 PM

Interesting, it costs less than a tenth than Haiku.

show 1 reply