https:/... | alt Hacker News

latchkey • yesterday at 4:29 PM • 2 replies • view on HN

https://huggingface.co/unsloth/GLM-4.7-GGUF

This user has also done a bunch of good quants:

Replies

I find it hard to trust post training quantizations. Why don't they run benchmarks to see the degradation in performance? It sketches me out because it should be the easiest thing to automatically run a suite of benchmarks

➕ show 1 reply

dajonker • yesterday at 4:43 PM

Yes I usually run Unsloth models, however you are linking to the big model now (355B-A32B), which I can't run on my consumer hardware.

The flash model in this thread is more than 10x smaller (30B).

➕ show 2 replies