Literally "cancelled" my Anthropic subscription this morning (meaning disabled renewal), annoyed hitting Opus limits again. Going to enable billing again.
The neat thing is that Anthropic might be able to do this as they massively moving their models to Google TPUs (Google just opened up third party usage of v7 Ironwood, and Anthropic planned on using a million TPUs), dramatically reducing their nvidia-tax spend.
Which is why I'm not bullish on nvidia. The days of it being able to get the outrageous margins it does are drawing to a close.
Anthropic are already running much of their workloads on Amazon Inferentia, so the nvidia tax was already somewhat circumvented.
AIUI everything relies on TSMC (Amazon and Google custom hardware included), so they're still having to pay to get a spot in the queue ahead of/close behind nvidia for manufacturing.
The behavioral modeling is the product
I was one of you two, too.
After a frustrating month on GPT Pro and a half a month letting Gemini CLI run a mock in my file system I’ve come back to Max x20.
I’ve been far more conscious of the context window. A lot less reliant on Opus. Using it mostly to plan or deeply understand a problem. And I only do so when context low. With Opus planning I’ve been able to get Haiku to do all kinds of crazy things I didn’t think it was capable of.
I’m glad to see this update though. As Sonnet will often need multiple shots and roll backs to accomplish something. It validates my decision to come back.