logoalt Hacker News

tgrowazayyesterday at 8:08 PM1 replyview on HN

Just pick up any >240GB VRAM GPU off your local BestBuy to run a quantized version.

> The full Kimi K2.5 model is 630GB and typically requires at least 4× H200 GPUs.


Replies

CamperBob2yesterday at 10:00 PM

You could run the full, unquantized model at high speed with 8 RTX 6000 Blackwell boards.

I don't see a way to put together a decent system of that scale for less than $100K, given RAM and SSD prices. A system with 4x H200s would cost more like $200K.

show 1 reply