Blog post and hugging face link are out.
See related thread: https://news.ycombinator.com/item?id=46977210
[1] https://z.ai/blog/glm-5
[2] https://huggingface.co/zai-org/GLM-5
Why did they have to tweak sampling parameters so much for the benchmarks? Looks like rerun hacking.
Why did they have to tweak sampling parameters so much for the benchmarks? Looks like rerun hacking.