Thanks, already suspected as much. Also gives context to the other comment here that says it is basi...

sigmoid10 • today at 8:06 AM • 1 reply • view on HN

Thanks, already suspected as much. Also gives context to the other comment here that says it is basically equivalent in accuracy to Qwen3.5-4B. Essentially seems to be a very good quantization of that model, not a new BitNet.

Replies

yorwba • today at 8:22 AM

It's a good-per-byte-but-not-in-absolute-terms quantization of Qwen3-8B that's comparable in accuracy to Qwen3.5-4B at 4-bit quantization (which makes the 4B model larger in terms of storage, though the lower number of parameters and hybrid attention give it a speed advantage if you're not bottlenecked on memory bandwidth for the model weights.)

alt Hacker News

Replies