> LLMs have dramatically worse performance on basic algebra questions when you add in irrelevant information
"Attention is all you need" /
(It is part of the general problem solving process to evaluate what is relevant and what is not.)
Differential attention that filters out noise is all you need :)
Differential attention that filters out noise is all you need :)