logoalt Hacker News

152334Htoday at 12:56 PM1 replyview on HN

but there is no trained 100b param model? "can run a 100B BitNet" is about the inference implementation, not about the existence of any such model


Replies

webXLtoday at 3:39 PM

I think they used a dummy model or else they would have linked to it. Just google '1-bit 100b model' and you'll only see references to this project without any download links.