logoalt Hacker News

tcdentyesterday at 8:01 PM2 repliesview on HN

We're not yet to the point where a single PCIe device will get you anything meaningful; IMO 128 GB of ram available to the GPU is essential.

So while you don't need a ton of compute on the CPU you do need the ability address multiple PCIe lanes. A relatively low-spec AMD EPYC processor is fine if the motherboard exposes enough lanes.


Replies

skhamenehyesterday at 8:24 PM

There is plenty that can run within 32/64/96gb VRAM. IMO models like Phi-4 are underrated for many simple tasks. Some quantized Gemma 3 are quite good as well.

There are larger/better models as well, but those tend to really push the limits of 96gb.

FWIW when you start pushing into 128gb+, the ~500gb models really start to become attractive because at that point you’re probably wanting just a bit more out of everything.

show 1 reply
p1neconeyesterday at 11:13 PM

I'm holding out for someone to ship a gpu with dimm slots on it.

show 3 replies