Nvidia said in March that they're working on specialized inference hardware, but they don't have any right now. You can do inference from Nvidia's current hardware offerings, but it's not as efficient.
AMD has been doing inference chips for many years and are the leader for HPC.
https://www.amd.com/en/products/accelerators/instinct.html
AMD has been doing inference chips for many years and are the leader for HPC.
https://www.amd.com/en/products/accelerators/instinct.html