How many do you need to run inference for 1 user on a model like Opus 4.5?
8x 3090.
Actually better make it 8x 5090. Or 8x RTX PRO 6000.
8x 3090.
Actually better make it 8x 5090. Or 8x RTX PRO 6000.