Regardless of architecture (which is anyways basically the same for all LLMs), the computational nee...

HarHarVeryFunny • today at 3:58 PM • 0 replies • view on HN

Regardless of architecture (which is anyways basically the same for all LLMs), the computational needs of modern neural networks are pretty generic, centered around things like matrix multiply, which is what the TPU provides. There is even TPU support for some operations built into PyTorch - it is not just a proprietary interface that Google use themselves.

alt Hacker News