Like my grandfather used to say "They let anyone on TV now", They let anyone be called an expert now.
Love talking about 'recursive self-improvement' of a prediction engine. What about 'recursive self-degradation'?
Regression tests
Yes - how exactly are you going to conjure new data to increase the model’s real world fidelity?
The most common sense way will always need to have humans in the loop, because humans are being used as judges.
If you try and entertain the idea of a human-less loop, I landed up with something like an AI creating real world products, selling them, then tracking the usage and popularity of the product. Essentially, creating and launching a firm and product, only to update its weights?
Perhaps there are some subsets of tasks that can be regressively self improved - and for those tasks: Holy hell thats awesome!
For general tasks? How are you going to get that data validated?
They don't throw away the weights. If it gets worse just use an older model.