logoalt Hacker News

benlivengoodtoday at 5:23 PM2 repliesview on HN

I dunno, GPT-OSS and Llama and QWEN and any half dozen of other large open-weight models?

I really can't imagine OpenAI or Anthropic turning off inference for a model that my workplace is happy to spend >$200*person/month on. Google still has piles of cash and no reason to turn off Gemini.

The thing is, if inference is truly heavily subsidized (I don't think it is, because places like OpenRouter charge less than the big players for proportionally smaller models) then we'd probably happily pay >$500 a month for the current frontier models if everyone gave up on training new models because of some oddball scaling limit.


Replies

crimsoneertoday at 5:47 PM

Yeah, this is silly. Plenty of companies are hosting their own now, sometimes on prem. This isn't going away

iLoveOncalltoday at 5:35 PM

> we'd probably happily pay >$500 a month for the current frontier models

Try $5,000. OpenAI loses hundreds of billions a year, they need a 100x, not 2x.

show 4 replies