I'm kind of interested in a setup where one buys local hardware specifically to run a crap ton ...

2001zhaozhao • today at 6:22 PM • 0 replies • view on HN

I'm kind of interested in a setup where one buys local hardware specifically to run a crap ton of small-to-medium LLM locally 24/7 at high throughput. These models might now be smart enough to make all kinds of autonomous agent workflows viable at a cheap price, with a good queue prioritization system for queries to fully utilize the hardware.

alt Hacker News