100 times more chips for equivalent memory, sure.

villgax • today at 3:53 PM • 3 replies • view on HN

Replies

Check the specs again. Per chip, TPU 7x has 192GB of HBM3e, whereas the NVIDIA B200 has 186GB.

While the B200 wins on raw FP8 throughput (~9000 vs 4614 TFLOPs), that makes sense given NVIDIA has optimized for the single-chip game for over 20 years. But the bottleneck here isn't the chip—it's the domain size.

NVIDIA's top-tier NVL72 tops out at an NVLink domain of 72 Blackwell GPUs. Meanwhile, Google is connecting 9216 chips at 9.6Tbps to deliver nearly 43 ExaFlops. NVIDIA has the ecosystem (CUDA, community, etc.), but until they can match that interconnect scale, they simply don't compete in this weight class.

➕ show 2 replies

croon • today at 4:08 PM

Ironwood is 192GB, Blackwell is 96GB, right? Or am i missing something?

NaomiLehman • today at 3:55 PM

I think it's not about the cost but the limits of quickly accessible RAM

alt Hacker News

Replies