the entire internet, books, news, regardless of license.
The companies using distillation are still training on all that data too, aren't they?
The companies using distillation are still training on all that data too, aren't they?