logoalt Hacker News

jklmnopqrstuvwtoday at 6:19 PM3 repliesview on HN

From my own experience, GLM-5.2 generally cost more tokens and much more slow.


Replies

pimeystoday at 6:57 PM

I use GLM 5.2 Fast from Fireworks and its very fast. Where are you using it from?

microtonaltoday at 6:21 PM

Which inference provider do you use? (Admittedly, I currently use K2.7 a lot more currently.)

james2doyletoday at 6:24 PM

Tokens and speed are a factor but does it require less back and forth to get things right? Being "fast and cheap but wrong" still has a cost that an otherwise "expensive and slow" exchange does not