the decent models will (mostly) decide when they need to write code for problem solving themselves.
either way a reply with a bogus answer is the fault of the provider and model, not the question-asker -- if we all need to carry lexicons around to remember how to ask the black box a question we may as well just learn a programming language outright.
Yes, Sonnet 4.5 tried like 10min until it had it. Way too long though.