alt
Hacker News
generalizations
•
today at 4:50 PM
•
0 replies
•
view on HN
Presumably a deepswe benchmark, which IIRC puts GLM 5.2 between opus 4.8 and fable.