AI needs evolutionary pressures beyond a simple reward algo. IRL is extremely data rich and nuanced. Current learning is just ingesting semantics and that's it.
There's the beginnings of it with things like icot to force it to internalise basic reasoning but I have a few ideas for more things and I'm sure actual ML researchers do, too.