Yes - how exactly are you going to conjure new data to increase the model’s real world fidelity?
The most common sense way will always need to have humans in the loop, because humans are being used as judges.
If you try and entertain the idea of a human-less loop, I landed up with something like an AI creating real world products, selling them, then tracking the usage and popularity of the product. Essentially, creating and launching a firm and product, only to update its weights?
Perhaps there are some subsets of tasks that can be regressively self improved - and for those tasks: Holy hell thats awesome!
For general tasks? How are you going to get that data validated?