logoalt Hacker News

mathisfun123today at 3:07 AM1 replyview on HN

this is the pcmasterrace equivalent of being all upper body and with scrawny legs lol


Replies

tempoponettoday at 3:24 AM

It's fine for dense models where you need them in VRAM, less so for MoE where you're offloading layers to ram. But 32/32 is pretty good for both in the popular ~30b range right now.