logoalt Hacker News

gardnryesterday at 4:02 PM2 repliesview on HN

The new model they just released has impressive benchmark results: https://huggingface.co/microsoft/bitnet-b1.58-2B-4T

Except on GSM8K and math...


Replies

xenoniteyesterday at 10:44 PM

Thanks, but where did you actually find the new model? The newest one seems to be 11 months old, from Apr 15, 2025.

naaskingyesterday at 4:44 PM

Thanks for the link, the GSM8K result actually leads the pack in that table, but math is indeed underwhelming. Qwen 2.5 is in the lead, but bitnet isn't far behind and it takes 1/6th as much memory during inference, and was trained on less than 1/4 the number of tokens. Pretty cool.