I mean, this really doesn't sound useful even if LLMs worked that way.
First, if you know nothing you don't even know what you're missing or what to search for.
Then, without unlimited context, you have to do research for every task all over again every time.
The model they built knows a fair bit apparently. You can't get 94.3 on AIME26 knowing nothing.
> First, if you know nothing you don't even know what you're missing or what to search for.
RAG on the initial prompt would be the first thing to try.
> Then, without unlimited context, you have to do research for every task all over again every time.
Thing is, we're really really good at building very fast search engines. Doing research all over again every time shouldn't be a problem.