Good method to generate synthetic training data, but only works for domains where validation can be scaled up.