You're not making any sense. The best guess will often be refusals if they see enough of that in the training data, so of course it is down to training
And I literally saw the effect of this first hand, in seeing how the project I worked on was actively part of training this behaviour into a major model.
As for your assertion they don't learn the more complex dynamics, that was trite and not true already several years ago.