AI companies have been using synthetic data for ages now. The data doesn't need to yield new insights to be useful for training.