logoalt Hacker News

written-beyondyesterday at 1:31 PM2 repliesview on HN

I thought these TPUs were primarily used for inference?


Replies

vlovich123yesterday at 1:59 PM

TPU8t is for training. But even still, once you’ve trained, you need to run the model too. And these kinds of models already have a huge latency hit so there’s not much hurting running it away from the trading switches.

knowaveragejoeyesterday at 2:11 PM

As the article states, there's both training and inference dedicated chips.