$1.5kpm for SOTA. 128gb you run DSV4 Flash.

sourcecodeplz • yesterday at 1:28 PM • 1 reply • view on HN

Replies

What's the point of running it locally though? Inference for open models is quite cheap already. They could just selfhost, anyway. The experience of running LLMs locally will be excruciatingly bad in comparison at least for the near future.

alt Hacker News

Replies