logoalt Hacker News

woggyyesterday at 8:57 PM1 replyview on HN

How many do you need to run inference for 1 user on a model like Opus 4.5?


Replies

ronsoryesterday at 9:00 PM

8x 3090.

Actually better make it 8x 5090. Or 8x RTX PRO 6000.

show 2 replies