logoalt Hacker News

bastawhiztoday at 2:02 PM2 repliesview on HN

That's the point: why would you buy a device that's specifically not optimized to be used for 24/7 inference? It's expensive hardware that's not designed to be used in that situation! The power use for inference isn't especially good and you're not getting even a fraction of the benefit from the hardware that you're paying for.


Replies

apf6today at 4:46 PM

Good question but people are doing it anyway. It's a fact that right now tons of people are buying Mac Minis specifically for this use case, to treat them as their personal data center for agents. The concept of "power use for inference" is foreign. Those people are the ones that motivated this blog post I think.

dist-epochtoday at 4:32 PM

> why would you buy a device that's specifically not optimized to be used for 24/7 inference

because it costs $1k-$2k instead of $10k-30k+ for optimized devices