logoalt Hacker News

weird-eye-issuetoday at 12:42 AM1 replyview on HN

Yeah just run a LLM with over 100 billion parameters on a CPU.


Replies

kristjanssontoday at 12:50 AM

200 GB is an unfathomable amount of main memory for a CPU

(with apologies for snark,) give gpt-oss-120b a try. It’s not fast at all, but it can generate on CPU.