For me it depends on who you listen to
If you're following a bunch of people who are from LLM labs, you're going to be more incentivised to tokenmaxx because it's in the Lab's best interest tonget you to behave that way.
Practically, many companies aren't labs with endless runway. Companies hopefully follow a PnL model. And when you look at things with that lens, many of the times the LLM use case falls apart.
You're seeing a bunch of companies starting to realise that tokenmaxing yields very little ROI.
Even the LLM labs, the guy that spent $1+mil tokens has nothing to show for it in terms of revenue to the company. And you have to keep sinking that much into AI for ... "features".
There are some good use cases for AI. I ended up with a positive ROI on a greenfield project myself, albeit on a small scale.
The way that AI has been making people have totally irrational decisions on executive, pure business and technical standpoints is simply mindblowing. I don't understand how people can't take a step back and see what's actually happening from a macro perspective.
I might have a different take, I’m happy with this price per token so only those who’re using it for value would use for what they want.
There are so many useless cases such as people bragging about their token consumption that has no product and no value add, or those with OpenClaw doing useless automation that could be a Python script.
Useful context for this is that token usage keeps rising at an exponential pace. I mean, we don't have numbers for the big labs, but Openrouter's numbers are quite telling (can't post link because corporate decided to block all "non-validated AI tools"), and I think they're probably representative of the global trend. +500% year to date, +50% over the month of May alone. It's unsurprising that providers are struggling to find and pay for the compute.
Perhaps advanced AI isn't cheaper than humans.
Assuming the intelligence of a model continuously improves with scale, the token price of the best model will become increasingly expensive.
I know that tokens are currently experiencing rapid price drops, but they will eventually encounter physical limitations.
> Kirsten: All of this, to me, illustrates how quickly things are moving. I mean, when you really think about it, the whole tokenmaxxxing thing has become a thing, peaked, and now is seen disfavorably, within six months
Pretty sure from inception the phrase “tokenmaxxing” was never seen in a positive light…
I'm guessing it's going to be an absolute banger of a month for forge[1] and the like.
We need cheap tokens from China...
Probably the cost model for LLM providers for consumers will be somewhat subsidised by providers basically linking up extremely specific profiles about users and using these to sell products directly in an agentic pipeline which includes agentic commerce. Maybe it's less one click purchases and more one prompt purchases. Of course this stands to be pretty bad for consumers in a lot of ways (deeply invasive marketing, possibly being missold products).
Of course the question remains, who is supposed to be buying products through this system if AI systems continue to displace jobs?
Isn't it weird not to see a wave of C-level firings for making such a basic mistake?
Even on the consumer side AI providers are enshitifying the plans. Everyone saw this coming three months ago plus.
The corporate side seems to be well... stupid? Execs asking their people to burn tokens do not understand the politics and cadence of business. Corporations do not actually demand more work to be completed in the way we traditionally think. Creating a lot of stuff in a corporation tends to naturally banish most of it to the void because that stuff requires other people to exist and engage with it in order to use it, deploy it, get customers using it, etc. AI does not take up that slack in the way that we are being told because it lacks agency. For most people in corporations the problem is not that they can't do their work, their real jobs are mostly being political nodes in a vast system. There is no solution on the table to change that at all.
Betteridge's law [0] - NO
Anecdotal experience - my coworkers will use the "max-think" and the most expensive model on every change they do with Claude, pumping out 100k's of tokens just because they can (and brag about hitting the limits).
I suspect this kind of behaviour will need to change in the very near future.
[0] - https://en.wikipedia.org/wiki/Betteridge%27s_law_of_headline...
This might help Mistral sell more on-prem solutions. Not only do you get to keep your data, it might make more financial sense too.
it's simple, how much dollars you get out for every dollar put into tokens
as Jensen said, get ready for $1000 per mil token
those for which this price makes sense will push out those for which it doesn't - to lower models or to local models
but those who want to run local models need to compete for hardware with the data centers, which have strong scale effects thus will always be able to out price local hardware allocations - can already be seen now as hardware makers get out of retail business
I see at least two potential positives in this:
- The frontier AI companies have realized they won't be able to count on gaining ground and earning more in the future through sheer moat. They have to start earning right now.
- The playing field on the market got a whole lot more even as a result. Now everyone is competing on cost and quality - while there are still a lot of competition. AI can't easily get away with subsidizing their own product and enshittify later.
I might be missing something obvious here? It feels to me that if the frontier AI companies thought they could gain a lot more moat they wouldn't raise their prices this much this early? And their current moats/head start doesn't seem insurmountable?
The linked Reddit thread is quite hilarious. Earlier this year my company hired a new CEO and his first company address was solely to tell everyone to use AI or they’d lose their job and become unemployable in general.
I knew right there and then that he was a moron. There’s something about American companies where the best and brightest rarely show up in senior management. It seems to be populated by some weird class of golf playing NPCs that figured out how to game the system and bring all their cult members along for the ride.
My own company spent 2+ years enforcing extreme austerity, to the point of firing the very people who built everything, only to run wild with AI spending and seeing little results from it.
Surely, out there in the wilderness, there is a company staffed by intelligent, skilled people. Right?
it'd be really funny if we got the RSI -> ASI world, all human labor became worthless, etc., but everyone with any money in the labs also lost their shirts because OSS is maximally good for most inference anyway.
How would one position one's portfolio if one was worried things are about to start crashing hard?
> “Can these AI labs collapse that cost [and] progress the tech enough in a way that it eventually meets in the middle with customers’ appetite for spending?” Sean wondered.
It depends where you buy the tokens from. Jevon's paradox exists in China and not in the US for now.
> In just a few months, companies became obsessed with “tokenmaxxxing,” then turned against it due to the high costs.
Casinos (in the US) telling customers to spend more on tokens, introduces free spins, discounts, resetting limits on peak hours. Then introduces new slot-machine that promises to give better odds to the gamblers, but instead is more expensive to use.
The ones in China did the opposite and made their discount on tokens permanent.
All this 'tokenmaxxing' was an outright scam. Now the AI companies want you 'tokenmaxxing' your agents on loops as the token prices increase.
I think it is easy to make some ragebait doomer articles with eye catching headlines. There are a lot of people who are ready to eat up AI catastrophism because something about apocalyptic predictions and catastrophism seems to attract certain kind of doomer-pilled people.
Here are my concrete predictions
1. Token costs will come down and performance will go up
2. Everyone will spend even more on LLMs not less - the article points at small blips but if anyone thinks it will go down from now, you are mistaken
3. AI Companies will be profitable
If anyone wants to counter bet on me, please go ahead.
[flagged]
One thing that seems under-discussed is how quickly token cost changes product behavior once people move from demos to recurring workflows.
When the interaction is exploratory, the marginal cost feels invisible: ask again, summarize again, try another agent. In a business workflow, the same pattern becomes a metering problem. You have to decide which parts actually need a frontier model, which can use a smaller/local model, and which should not be generated at all.
That probably pushes AI products away from "chat with everything" and toward much narrower tools with explicit ROI: less open-ended generation, more constrained pipelines, caching, evaluation, and human review at the points where mistakes are expensive.
With no details, a bird told me of a project which estimated using several millions of tokens per day to automate a team work which got laid off. The operation is now a mess, there is no one willing to be considered liable and since the cheap model they used is about to be retired the company is going to see a 4x increase in price at least.
I have the feeling that the age of 'i can't be blamed by AI stuff' will be a "this was the computer guy mistake" for a moment.
PS. I've been using Claude opus 4.8 and it is worse than 4.6 and I will say that even sonnet 4.6 is better. PhD. Level of software and engineering I believe! I know many PhD who never coded or worked anyway