logoalt Hacker News

Anthropic downgraded cache TTL on March 6th

215 pointsby lsdmtmetoday at 5:45 AM165 commentsview on HN

Comments

sunaurustoday at 9:03 AM

Has anybody else noticed a pretty significant shift in sentiment when discussing Claude/Codex with other engineers since even just a few months ago? Specifically because of the secret/hidden nature of these changes.

I keep getting the sense that people feel like they have no idea if they are getting the product that they originally paid for, or something much weaker, and this sentiment seems to be constantly spreading. Like when I hear Anthropic mentioned in the past few weeks, it's almost always in some negative context.

show 18 replies
cassianolealtoday at 8:38 AM

The title should be changed. It makes it look like they upped the TTL from 1 h to 5 months.

The SI symbol for minutes is "min", not "M".

A compromise would be to use the OP notation "m".

show 2 replies
albert_etoday at 1:30 PM

So a side effect of this is -- even at 1 hour caching -- ...

If you run out of session quota too quickly and need to wait more than an hour to resume your work ... you are paying even more penalty just to resume your work -- a penalty you wouldnt have needed if session quota was not so restrictive in first place, and which in turn causes you to burn through next session quota even faster.

Seems like a vicious cycle that made the UX very poor. I remember Claude Code with Pro became virtually unuseable in middle of March with session quota expiring within first hour or less for me -- which was wildly different experience from early March.

hirako2000today at 1:28 PM

There is a chef, he opens a restaurant. Delicious food.

It costs him more in ingredients alone than he charges. He even offers some pseudo unlimited buffet, combo sets, and happy hours.

He announced a new restaurant, apparently it will be even better, so good he's a bit worried. He makes sure to share his worries while he picks a few select enterprise for business parties and the likes.

In the meantime he cracks down on free buffet goers who happen to eat too much, and downgrades all ingredients without notice to finally hope to make a profit.

show 1 reply
disillusionedtoday at 8:49 AM

It's also routinely failing the car wash question across all models now, which wasn't the case a month ago. :-/

Seeing some things about how the effort selector isn't working as intended necessarily and the model is regressing in other ways: over-emphasizing how "difficult" a problem is to solve and choosing to avoid it because of the "time" it would take, but quoted in human effort, or suggesting the "easier" path forward even if it's a hack or kludge-filled solution.

show 2 replies
layer8today at 1:49 PM

From the recent-ish Dwarkesh podcast, Anthropic seems to be wary about buying/building too much compute [0]. That probably means that they have to attempt to minimize compute usage when there is a surge in demand. Following the argument in the podcast, throwing more money after them, as some in this thread are suggesting, won’t solve the issue, at least not in the short term.

[0] https://www.dwarkesh.com/i/187852154/004620-if-agi-is-immine...

davidkuennentoday at 8:59 AM

On slightly off topic note: Codex is absolutely fantastic right now. I'm constantly in awe since switching from Claude a week ago.

show 6 replies
Tarcroitoday at 7:39 AM

This coincides with Anthropic's peak-hour announcement (March 26th). Could the throttling be partly a response to infrastructure load that was itself inflated by the TTL regression?

show 1 reply
perks_12today at 9:30 AM

Just give us the option to get the quality back, Anthropic. I get that even a $200 subscription is not possible eventually, but give us the option to sub the $1000 tier or tell us to use the API tier, but give us some consistency.

show 2 replies
eaf7e281today at 1:25 PM

I think they changed the quantification to save computer power for their new model. This might be why the benchmark scores look good, but the real world performance is much worse. I'm wondering if they're testing the model internally and didn't find anything wrong with the new parameter.

I canceled my subscription and switched to a codex, but it's not as good. I'm tired of Anthropic changing things all the time. I use Claude because it doesn't redirect you to a different model like OpenAI does. But now it seems like both companies are doing the same thing in different way.

show 1 reply
bsaultoday at 2:22 PM

could it be that anthropic is experiencing a massive shortage of compute capacity, and is desperately trying to find means to overcome it ?

All the news i hear about this company for the past weeks made it sound like they're really desperate.

azuanrbtoday at 1:35 PM

As a Pro user, even though these issues and bugs are “new,” the downgrade has been noticeable since January. I’ve unsubscribed because the Pro plan is no longer usable for me.

It’s only making the news now because it’s affecting Max users as well ($100/$200 plans). I understand the need for change, but having zero communication about it is just wrong.

ikekkdcjkfketoday at 8:40 AM

If youre reading this claude, people are willing to pay extra if you want to make more money, just please stop doing this undermining, it devreases the trust of your platform to something that cannot be relied on

show 1 reply
throwaway2027today at 9:35 AM

I also noticed this, just resuming something eats up your entire session. The past two weeks also felt like a substantial downgrade and made me regret renewing my subscription, it sucks because I wish I kept my Codex subscription instead and renewed that.

throwaway2027today at 10:09 AM

It's absolutely ridiculous how stupid Claude is now. I sometimes notice it and last year too but it feels like it's just last year before December model.

show 1 reply
the_mitsuhikotoday at 9:08 AM

Since I (until Anthropic decided to remove access for subs) used Anthropic models extensively with pi I explored the two caching options and the much higher cost of 1h caches is almost never a good tradeoff.

Since the caching really primarily is something they can be judged at scale from across many users I can only assume that Anthropic looked at their infra load and impact and made a very intentional change.

PunchyHamstertoday at 9:39 AM

Well, how entirely expected. The money man comes to collect and they are squeezing for money

sscaryterrytoday at 8:37 AM

Anthropic is leaving so much evidence around… proving damages and a pattern is becoming trivial

mrdwtoday at 12:44 PM

I noticed another limitation: "An image in the conversation exceeds the dimension limit for many-image requests (2000px). Start a new session with fewer images."

So I can't continue my claude code session I started yesterday.

coffinbirthtoday at 8:57 AM

Am I the only one who sees striking parallels between being a Claude Code customer and Cuckoldry (as in biology)?

I mean, you are investing a lot (infrastructure and capital) into something that is essentially not yours. You claim credit for the offspring (the solution) simply because it resides in your workspace. You accept foreign code to make your project appear more successful and populated than you could manage alone. Your over-reliance on a surrogate for the heavy lifting leads to the loss of your own survival skills (coding and debugging). Last but not least, you handle the grunt work of territory defense (clients and environments) while the AI performs the actual act of creation (Displaced Agency).

show 2 replies
taffydavidtoday at 10:23 AM

This is the same shit openAI used to do last year, quietly downgrading their offerings while hyping the next big thing. I thought Anthropic were different but it seems they're playing the exact same long con with Mythos.

They can't really revolutionize AI again so they make the product worse and worse and then offer you a "better" one

simianwordstoday at 9:07 AM

There’s a case for intelligent caching: coarse grained 1h and 5min type TTls are not optimal.

show 1 reply
poly2ittoday at 10:36 AM

One of the largest AI companies on Earth cannot figure out an algorithm for when not to drop caches in long-running sessions?

ares623today at 9:28 AM

AGI finding bugs again. Actual Guys/Gals Instead.

WhereIsTheTruthtoday at 10:35 AM

Changing "regression" to "Anthropic silently downgraded" sensationalizes the story

Why the FUD?

I notice some interesting public opinion weather change since Anthropic passed OpenAI wrt revenue

show 1 reply
AlexSalikovtoday at 1:32 PM

[dead]

EthanFrostHItoday at 5:47 AM

[dead]

GetBurndtoday at 12:15 PM

[dead]