The difference in fields is key here: AI models are going to have a very different impact in fields where ground truth is available instantly (does the generated code have the expected output?) or takes years of manual verification.
(Not a binary -- ground truth is available enough for AI to be useful to lots of programmers.)
> does the generated code have the expected output?
That's many times not easy to verify at all ...