From my own experience, GLM-5.2 generally cost more tokens and much more slow.

jklmnopqrstuvw • today at 6:19 PM • 3 replies • view on HN

Replies

I use GLM 5.2 Fast from Fireworks and its very fast. Where are you using it from?

Which inference provider do you use? (Admittedly, I currently use K2.7 a lot more currently.)

Tokens and speed are a factor but does it require less back and forth to get things right? Being "fast and cheap but wrong" still has a cost that an otherwise "expensive and slow" exchange does not

alt Hacker News

Replies