I'd be interested in a way to handle large swaths of simple tooling calling for LLMs (Anthropic recently had something about this, not sure if it would apply) so that they can know to _never_ attempt math, because that's not what they're for. Giving it a bunch of tools for things like arithmetic, date math, and other Wolfram style queries and making sure they always lean on those when appropriate would be fantastic.