Not to sound like an LLM, but that seems exactly right to me. Use it as a cheaper, high-functioning...

mwigdahl • today at 6:24 PM • 1 reply • view on HN

Not to sound like an LLM, but that seems exactly right to me. Use it as a cheaper, high-functioning task subagent and lower reasoning for a master Opus session. As long as not every portion of your task requires maximum intelligence, you should come out ahead.

Replies

user43928 • today at 7:12 PM

Won't any input be charged uncached, and the output of the small model charged again as uncached input to the bigger model?

I don't know whether that comes out ahead compared to just staying with the better model in the first place.

➕ show 1 reply

alt Hacker News

Replies