logoalt Hacker News

andybaktoday at 10:35 AM2 repliesview on HN

I'm yet to see any convincing argument that inference is subsidised in any substantial way. Training and speculative expansion are where the spend is from what I can see.


Replies

rjh29today at 10:47 AM

A few days ago Gemini redid their rate limits, making images/audio/video generation much more expensive, shrunk limits across the board (including a new weekly limit) and added more expensive tiers.

At the moment you can pay $20/month to do thousands of expensive queries a month (involving file uploads, the Pro model, extended thinking), and evidence suggests that heavy users are not profitable.

show 1 reply
automatic6131today at 10:47 AM

If inference was profitable - they'd tell us. Msft, goog, public companies. They'd break out the numbers and show us, if they were good.

But instead, all we get is known liars going on podcasts and repeating "stylized facts" that aren't literally true about their supposed profitability on inference, from companies losing billions per year in a situation where they don't have to tell the truth.

That is VERY far from a convincing argument that they are profitable. So I can & will safely conclude that the opposite is true.