You're playing word games to justify something which is clearly ethically wrong.
You can't re-stream free over-the-air network TV.
That one company with the datacenter full of TV tuners tried and was sued out of existence.
Over the air TV also isn’t public domain. It’s licensed to a station for broadcast. The output of an LLM has been deemed ineligible for copyright. Until you square that pickle your circle isn’t circling.
Why is the ethical line specifically on model distillation for you?
Was it ethical for Anthropic/OpenAI to train their models by gobbling a treasure trove of copyrighted material?
Free over-the-air network TV is (generally) copyrighted.
The output of LLMs cannot be copyrighted. This isn't a semantic game; it's literally the case that Anthropic cannot seek relief for people duplicating the output of an LLM.
Using a bunch of nonsensical/irrelevant analogies to somehow make a point seems worse than these “word games”? What does streaming copyrighted content have to do with LLM outputs (which are public domain)?
> clearly ethically wrong
Ethics are subjective. That’s why we have courts judge based on the law and not ethics
If I understand your argument it's ethically ok to destill huge swathes of copyrighted work into a model without compensation, but then it is ethically wrong to use that model without compensation (well actually reduced pricing)?
I don't get the moral framework that you're applying. Could you elaborate?