Inference costs are higher than training now. I think.
Nvidia is king of general purpose training chips. But inferences can be specialized.