logoalt Hacker News

singularity2001yesterday at 11:38 PM2 repliesview on HN

Why are there so few 32,64,128,256,512 GB models which could run on current consumer hardware? And why is the maximum RAM on Mac studio M4 128 GB??


Replies

eldenringtoday at 1:59 AM

the only real benefit is privacy which 99.9% of people dont get about. Almost all serving metrics (cost, throughput, ttft) are better with large gpu clusters. Latency is usually hidden by prefill cost.

show 1 reply
jameslktoday at 12:04 AM

128 GB should be enough for anybody (just kidding). I hope the M5 Max will have higher RAM limits

show 1 reply