$12,000 for the base model is insane. I have an Apple M3 Max with 128GB RAM that can run 120B parame...

alexfromapex • today at 12:50 AM • 2 replies • view on HN

$12,000 for the base model is insane. I have an Apple M3 Max with 128GB RAM that can run 120B parameter models using like 80 watts of electricity at about 15-20 tokens/sec. It's not amazing for 120B parameter models but it's also not 12 grand.

Replies

Thaxll • today at 1:03 AM

M3 max tflops is tiny compared to the 12k box. It's not even comparable.

➕ show 2 replies

segmondy • today at 12:54 AM

it's for fools. i bought 160gb of vram for $1000 last year. 96gb of p40 VRAM can be had for under $1000. And it will run gpt-oss-120b Q8 at probably 30tk/sec

➕ show 1 reply

alt Hacker News

Replies