logoalt Hacker News

gpmtoday at 6:32 PM8 repliesview on HN

An interesting implication of this is that AI inference and training has a path to a ~3x hardware cost reduction (and maybe ~2x total cost reduction) without any technical innovation whatsoever, we just need to wait for dram supply to meet demand (either by manufacturing scaling or just waiting for the current rate of manufacturing to fill the demand spike).


Replies

radialstubtoday at 7:23 PM

The memory makers will not expand demand drastically. It is in the nature of their business to keep the market under-supplied, otherwise the following oversupply will kill them. Instead, supply is just rerouted from less profitable segments such as mobile and personal computing.

show 6 replies
overfeedtoday at 7:29 PM

It sure looks like Sam Altman's masterful gambit to corner the memory market has had unforeseen consequences.

show 1 reply
Waterluviantoday at 6:51 PM

What’s the lifespan/refurbishability of the capex elements like the “GPU” modules or even the DRAM soldered into them?

show 1 reply
andrepdtoday at 7:15 PM

I wonder if we will see an adoption of alternative floating point formats. IEEE floats are notoriously terrible at lower widths (<= 16 bits). Floating point formats such as posits do much better at 16 or 8 bits. If you could train at 16 bits per value instead of 32, and suffer a much smaller inaccuracy penalty than you would from IEEE32 to IEEE16...

cubefoxtoday at 7:10 PM

For some reason I still haven't heard any predictions on when new fabs will come online to meet the current demand. This shouldn't be too hard to find out, since the building time of fabs is very predictable process.

The difficult question is more whether foreseeable memory demand will remain at the current level, grow even further, or shrink again.

sandworm101today at 7:10 PM

Supply will not meet demand. What incentive do the handful of dram manufacturers have to end the party? This is what happens when legal monopolies finally win control. Dont't worry. The patents will expire in a few decades. Our grandkids will see DDR5 get cheap again. The system functions as intended.

show 3 replies
shevy-javatoday at 7:19 PM

> a path to a ~3x hardware cost reduction

Really?

How long do we have to wait until that ... cost reduction hits us?

show 1 reply
eldenringtoday at 7:08 PM

2-3x is completely dwarfed by the remaining improvements in training which is still in its infancy relatively

show 2 replies