logoalt Hacker News

rao-vyesterday at 9:43 PM1 replyview on HN

In a month or three we’ll have the sensible approach, which is smaller cheaper fast models optimized for looking at a query and identifying which skills / context to provide in full to the main model.

It’s really silly to waste big model tokens on throat clearing steps


Replies

Calavaryesterday at 9:49 PM

I thought most of the major AI programming tools were already doing this. Isn't this what subagents are in Claude code?

show 1 reply