logoalt Hacker News

zozbot234yesterday at 5:49 PM1 replyview on HN

It's a MoE model so I'd assume a cheaper MBP would simply result in some experts staying on CPU? And those would still have a sizeable fraction of the unified memory bandwidth available.


Replies

pitchedyesterday at 6:04 PM

I haven’t tried this myself yet but you would still need enough non-vram ram available to the cpu to offload to cpu, right? This is a fully novice question, I have not ever tried it.

show 1 reply