logoalt Hacker News

drakythelast Monday at 1:52 PM5 repliesview on HN

Anthropic also recently tweaked their usage limits to discourage use during peak hours. Why would they do that if inference was profitable?


Replies

infectolast Monday at 1:54 PM

Don’t confuse inference (api usage) with the consumer plan products. When people say inference is profitable they are referring to the cost to serve a token via the API. The consumer products are absolutely a question mark on profitability and as we see with most of the business and enterprise plans, going away for pure on demand use (api cost) full time.

strangegeckolast Monday at 2:42 PM

Profitability doesn't imply infinite ability to scale. Of course they will want to prioritize their most profitable customers when they hit capacity issues.

aurareturnlast Monday at 7:35 PM

They do it because their demand is higher than the compute that they have available to them. Their GPUs must be melting during peak hours so they're encouraging people who move their workload to off peak hours if possible.

This is the opposite of an AI bubble burst.

paulddraperlast Monday at 6:40 PM

Those are subscription plans. They tweaked the limits/periods included in the subscription. Having higher limits for subscription plans didn't give them any more revenue.

financltravstylast Monday at 5:04 PM

Their infra team is very understaffed and they are reacting to the public backlash of "no 9s?"