Anthropic downgraded cache TTL on March 6th

410 points - today at 5:45 AM

Comments

sunaurus today at 9:03 AM

Has anybody else noticed a pretty significant shift in sentiment when discussing Claude/Codex with other engineers since even just a few months ago? Specifically because of the secret/hidden nature of these changes.

I keep getting the sense that people feel like they have no idea if they are getting the product that they originally paid for, or something much weaker, and this sentiment seems to be constantly spreading. Like when I hear Anthropic mentioned in the past few weeks, it's almost always in some negative context.

foofloobar today at 7:43 PM

Claude Code and the subscription are now less useful than a few months ago. Claude Code and the service seem to pick up more and more issues as time goes by: more bugs, fast quota drain, reduced quota, poor model performance, cache invalidation problems, MCP related bugs, potential model quantization and other problems.

Claude Code was able to implement something in one shot. It was decent for a proof of concept initial implementation. It's barely able to do work now with full specs and detailed plans.

ChatGPT is also being watered down.

It seems obvious that Anthropic and OpenAI aren't the solution to any problem.

cassianoleal today at 8:38 AM

The title should be changed. It makes it look like they upped the TTL from 1 h to 5 months.

The SI symbol for minutes is "min", not "M".

A compromise would be to use the OP notation "m".

albert_e today at 1:30 PM

So a side effect of this is -- even at 1 hour caching -- ...

If you run out of session quota too quickly and need to wait more than an hour to resume your work ... you are paying even more penalty just to resume your work -- a penalty you wouldnt have needed if session quota was not so restrictive in first place, and which in turn causes you to burn through next session quota even faster.

Seems like a vicious cycle that made the UX very poor. I remember Claude Code with Pro became virtually unuseable in middle of March with session quota expiring within first hour or less for me -- which was wildly different experience from early March.

par today at 8:23 PM

Claude code has gone down hill in a really bad way. It is often far too quick to make significant changes, and requires much higher level of hand-holding and explanation than I am used to. r/claudecode on reddit shows a litany of complaints!

disillusioned today at 8:49 AM

It's also routinely failing the car wash question across all models now, which wasn't the case a month ago. :-/

Seeing some things about how the effort selector isn't working as intended necessarily and the model is regressing in other ways: over-emphasizing how "difficult" a problem is to solve and choosing to avoid it because of the "time" it would take, but quoted in human effort, or suggesting the "easier" path forward even if it's a hack or kludge-filled solution.

benced today at 4:43 PM

Anthropic responded: https://github.com/anthropics/claude-code/issues/46829#issue...

hattimaTim today at 5:52 PM

Classic scammer tactics: first, lure users in by promising a huge deal, then scam the hell out of them.

davidkuennen today at 8:59 AM

On slightly off topic note: Codex is absolutely fantastic right now. I'm constantly in awe since switching from Claude a week ago.

hirako2000 today at 1:28 PM

There is a chef, he opens a restaurant. Delicious food.

It costs him more in ingredients alone than he charges. He even offers some pseudo unlimited buffet, combo sets, and happy hours.

He announced a new restaurant, apparently it will be even better, so good he's a bit worried. He makes sure to share his worries while he picks a few select enterprise for business parties and the likes.

In the meantime he cracks down on free buffet goers who happen to eat too much, and downgrades all ingredients without notice to finally hope to make a profit.

layer8 today at 1:49 PM

From the recent-ish Dwarkesh podcast, Anthropic seems to be wary about buying/building too much compute [0]. That probably means that they have to attempt to minimize compute usage when there is a surge in demand. Following the argument in the podcast, throwing more money after them, as some in this thread are suggesting, won’t solve the issue, at least not in the short term.

[0] https://www.dwarkesh.com/i/187852154/004620-if-agi-is-immine...

zeckalpha today at 6:59 PM

I find similar happening with Gemini Pro. Despite paying for Pro, it regularly locks me out, without visibility into consumption. Nothing on the plan comparison page indicates limits. https://one.google.com/about/plans

Edit: I may have conflated these two threads. https://news.ycombinator.com/item?id=47739260

Tarcroi today at 7:39 AM

This coincides with Anthropic's peak-hour announcement (March 26th). Could the throttling be partly a response to infrastructure load that was itself inflated by the TTL regression?

perks_12 today at 9:30 AM

Just give us the option to get the quality back, Anthropic. I get that even a $200 subscription is not possible eventually, but give us the option to sub the $1000 tier or tell us to use the API tier, but give us some consistency.

pkaye today at 6:05 PM

Actually I remember the change being reported in the Reddit /r/claueai chat back around that time frame. I was concerned that it would increase costs but nobody made a fuss so I presumed it was not a big deal.

computerex today at 8:15 PM

Good job anthropic. You had a clear lead with all devs singing the praises of Opus. Way to lose all that by Enshittifying the experience.

bsaul today at 2:22 PM

could it be that anthropic is experiencing a massive shortage of compute capacity, and is desperately trying to find means to overcome it ?

All the news i hear about this company for the past weeks made it sound like they're really desperate.

ikekkdcjkfke today at 8:40 AM

If youre reading this claude, people are willing to pay extra if you want to make more money, just please stop doing this undermining, it devreases the trust of your platform to something that cannot be relied on

lordmoma today at 5:05 PM

Claude Code is not performing on par since September 2025, there was already a huge backlash then, and many people just keep cheering for CC every time it made some model upgrade or TUI change, it just feels so unreal.

eaf7e281 today at 1:25 PM

I think they changed the quantification to save computer power for their new model. This might be why the benchmark scores look good, but the real world performance is much worse. I'm wondering if they're testing the model internally and didn't find anything wrong with the new parameter.

I canceled my subscription and switched to a codex, but it's not as good. I'm tired of Anthropic changing things all the time. I use Claude because it doesn't redirect you to a different model like OpenAI does. But now it seems like both companies are doing the same thing in different way.

c16 today at 5:09 PM

I’ve definitely noticed in evenings it stops trying as hard to solve the issue and suggests I go find the answer. Never the case in the morning.

foobar10000 today at 2:56 PM

So, this especially bites if your validation step (let’s say integration tests) take 1hr plus. The harness is just waiting, prefix caching should happily resume things with just a minor new prefill chunk of output from the harness, and bam - completely new prefill.

throwaway2027 today at 9:35 AM

I also noticed this, just resuming something eats up your entire session. The past two weeks also felt like a substantial downgrade and made me regret renewing my subscription, it sucks because I wish I kept my Codex subscription instead and renewed that.

azuanrb today at 1:35 PM

As a Pro user, even though these issues and bugs are “new,” the downgrade has been noticeable since January. I’ve unsubscribed because the Pro plan is no longer usable for me.

It’s only making the news now because it’s affecting Max users as well ($100/$200 plans). I understand the need for change, but having zero communication about it is just wrong.

snowstormsun today at 6:12 PM

Well, the 10x promised revenue increase must come from somewhere...

willworktill4pm today at 5:51 PM

This Friday CC wrote wall off gibberish text for me. No reason, happened twice with different gibberish text

https://ibb.co/4wcVQG5k

throwaway2027 today at 10:09 AM

It's absolutely ridiculous how stupid Claude is now. I sometimes notice it and last year too but it feels like it's just last year before December model.

motbus3 today at 4:32 PM

The TOS basically states you need to deal with whatever they want.

Meanwhile their 'best' competitor just announced they want to provide unreliable mass destruction guidance tools but they don't wanna feel said.

Honestly speaking, we are wrong whenever we do business with this sort of people

the_mitsuhiko today at 9:08 AM

Since I (until Anthropic decided to remove access for subs) used Anthropic models extensively with pi I explored the two caching options and the much higher cost of 1h caches is almost never a good tradeoff.

Since the caching really primarily is something they can be judged at scale from across many users I can only assume that Anthropic looked at their infra load and impact and made a very intentional change.

PunchyHamster today at 9:39 AM

Well, how entirely expected. The money man comes to collect and they are squeezing for money

sscaryterry today at 8:37 AM

Anthropic is leaving so much evidence around… proving damages and a pattern is becoming trivial

poly2it today at 10:36 AM

One of the largest AI companies on Earth cannot figure out an algorithm for when not to drop caches in long-running sessions?

yobid20 today at 7:07 PM

i thought it was always 5 minutes? ive been telling people 5 minutes for months so i dont think this is anything new?

taffydavid today at 10:23 AM

This is the same shit openAI used to do last year, quietly downgrading their offerings while hyping the next big thing. I thought Anthropic were different but it seems they're playing the exact same long con with Mythos.

They can't really revolutionize AI again so they make the product worse and worse and then offer you a "better" one

coffinbirth today at 8:57 AM

Am I the only one who sees striking parallels between being a Claude Code customer and Cuckoldry (as in biology)?

I mean, you are investing a lot (infrastructure and capital) into something that is essentially not yours. You claim credit for the offspring (the solution) simply because it resides in your workspace. You accept foreign code to make your project appear more successful and populated than you could manage alone. Your over-reliance on a surrogate for the heavy lifting leads to the loss of your own survival skills (coding and debugging). Last but not least, you handle the grunt work of territory defense (clients and environments) while the AI performs the actual act of creation (Displaced Agency).

mrdw today at 12:44 PM

I noticed another limitation: "An image in the conversation exceeds the dimension limit for many-image requests (2000px). Start a new session with fewer images."

So I can't continue my claude code session I started yesterday.

deleted today at 12:27 PM

simianwords today at 9:07 AM

There’s a case for intelligent caching: coarse grained 1h and 5min type TTls are not optimal.

ares623 today at 9:28 AM

AGI finding bugs again. Actual Guys/Gals Instead.

idrdex today at 8:03 PM

[dead]

AlexSalikov today at 1:32 PM

[dead]

EthanFrostHI today at 5:47 AM

[dead]

GetBurnd today at 12:15 PM

[dead]

bustah today at 5:24 PM

[flagged]

WhereIsTheTruth today at 10:35 AM

Changing "regression" to "Anthropic silently downgraded" sensationalizes the story

Why the FUD?

I notice some interesting public opinion weather change since Anthropic passed OpenAI wrt revenue

taf2 today at 5:20 PM

I don't understand who's still using anthropic? The model produces more bugs and agrees to solutions that are clearly wrong at a much higher rate then codex. Codex produces significantly better code with fewer bugs and far less oversight. with /fast on codex it's not even slower then claude and consider it implements working code more reliably you have to use it less anyway. Beside anthropic appears to be more focused on fear mongering and other types of FUD and is a more closed solution I do not understand why so many people still appear to care what anthropic does and have not already moved on? </rant>

siscia today at 3:21 PM

Lately I am finding myself doing more and more of what I called "ambient coding" so that I am not directly using anymore all of those coding harnesses.

https://redbeardlab.gitbook.io/acem/essays/ambient-developme...

I basically wrote a small GitHub app and I simply create a GitHub issue, the bot read it, run an LLM loop and come up with a PR (or a design)

Then I simply approve the pr (or the design)

I find it much calmer and much more productive