Isn't there a difference between: distilling specific AI input/output vs scraping whatever random AI output (with unknown input)?