logoalt Hacker News

alextillmantoday at 12:57 PM0 repliesview on HN

This is because the current AI approach relies on AI to be a glorified search engine – know everything about everything requiring enormous, ever growing models, and demanding search-engine like near instant responses requiring bigger more complex chips and sprawling data centers to run them in. This leads to a loop demanding ever bigger models, updated at a more and more expensive cost, and chipsets that become much more expensive to deploy.

If you move those things to software and utilize tools that are cheap at scale (databases, web search etc.) the hardware arms race ends and the price becomes sustainable. With the right tools preparing dynamic context for a conversation, models are used for their reasoning and not for their knowledge. And waiting even a minute or two for a model to prepare a response, evaluate it, and iterate to improve quality makes a huge difference.