logoalt Hacker News

dist-epochtoday at 1:02 PM2 repliesview on HN

using it 24/7 brings the average cost down, not up.

the less you use local LLM, the less sense it makes since you paid a lot for hardware you don't use


Replies

bastawhiztoday at 2:02 PM

That's the point: why would you buy a device that's specifically not optimized to be used for 24/7 inference? It's expensive hardware that's not designed to be used in that situation! The power use for inference isn't especially good and you're not getting even a fraction of the benefit from the hardware that you're paying for.

groundzeros2015today at 1:30 PM

The hardware has multiple uses for the same cost. The pay-per-use server does not.