logoalt Hacker News

Rotary GPU: Exploring Local Execution for Large MoE Models Under Limited VRAM

26 pointsby dryarzegyesterday at 9:05 PM4 commentsview on HN

Comments

martinaldtoday at 1:18 AM

Why is this a paper? It's just using the n-cpu-moe option on llama.cpp? What am I missing here?

show 2 replies
sandworm101today at 1:06 AM

Um, doesn't the 4060 laptop card have the ability to share system memory?

Wait... My mistake. Google AI says the 4060 mobile can access system memory but tech sheets say no.