I can easily get Claude Code to run for 8-10 hours unsupervised without stopping with sub-agents entirely within Claude Code.
I think it is more likely that if you stick with Claude Code, then you are more likely to stick with Opus/Sonnet, whereas if you use a third party CLI you might be more likely to mix and match or switch away entirely. It's in their interest to get you invested in their tooling.
I've yet to come up with a workflow where I would want Claude to do this much work... unless I had an extremely detailed spec defined for it. How do you ensure it doesn't go off the rails?
On the flip side I started using Claude with other LLMs (openai) because my Pro sub gets maxed out quickly and I want a cheaper alternative to finish a project.
I just use claude code proxy or litellm and set the ANTHROPIC_BASE_URL to my proxy and chose another LLM.
Multi model is the way of the future though as much as I like and prefer Anthropic.
> if you use a third party CLI you might be more likely to mix and match or switch away entirely.
I really like doing this, be it with OpenCode or Copilot or Cline/RooCode/KiloCode: I do have a Cerebras Code subscription (50 USD a month for a lot of tokens but only an okayish model) whereas the rest I use by paying per-token.
Monthly spend ends up being somewhere between 100-150 USD total, obviously depending on what I do and the proportion of simple vs complex tasks.
If Sonnet isn’t great for a given task, I can go for GPT-5 or Gemini 3.