logoalt Hacker News

sourcecodeplzyesterday at 1:28 PM1 replyview on HN

$1.5kpm for SOTA. 128gb you run DSV4 Flash.


Replies

pqtywyesterday at 6:38 PM

What's the point of running it locally though? Inference for open models is quite cheap already. They could just selfhost, anyway. The experience of running LLMs locally will be excruciatingly bad in comparison at least for the near future.