The power-constrained part of compute is data movement, not the elementary arithmetic per se. Anywa...

zozbot234 • today at 8:35 AM • 0 replies • view on HN

The power-constrained part of compute is data movement, not the elementary arithmetic per se. Anyway, it's very possible to tweak the underlying design to increase throughput a lot for any given power budget at the cost of high latency. This seems especially useful for training workloads where we don't really care about latency as much.

alt Hacker News