Google has been releasing a new TPU generation every year since 2023 and the eight generation consists of a training and an inference optimized design.
Google's eight generation TPU inference chip has 384 MB of on-chip SRAM vs 500 MB for Groq's third generation LPU.