logoalt Hacker News

Dylan16807today at 1:04 AM1 replyview on HN

That's a silly reason. For non-agent use cases what kind of utilization are you going to average on your own GPU, 5-10%? And that's without batching.

Even with overhead and scaling for peak use and a large profit margin, any company with an ounce of competition will be vastly cheaper than self-hosting. And for models you can run yourself, there will be plenty of competition.


Replies

LtWorftoday at 4:53 AM

I think you are calculating with current prices. Try to extrapolate the price in one year, seeing the current trends instead.

show 1 reply