The likelihood is that models will "box" questionable stuff for radiologist review, and the boxing threshold will probably be set low enough that radiologists stay sharp (though we probably won't do this at first and skills may atrophy for a bit).
This is also a free source training data over time so market incentives are there.
Far more likely to be the reverse: people care about this right now and after 99% model-agreement rate the obvious thing to do will be to save money and change the threshold.
There isn't a human in the loop if the loop determines whether to involve a human.