Do I have to publish my book for free because I got inspiration from 100's of other books I read during my life?
false equivalence because machines are not human beings
a lossy compression algorithm is not "inspired" when it is fed copyrighted input
Issue to me is that I or someone else bought those books. Or in case of local libraries the authors got money for my borrowing copy.
And I can not copy paste myself to discuss with thousands or millions of users at time.
To me clear solution is to make some large payment to each author of material used in traing per training of model say 10k to 100k range.
If you are plagiarizing, “for free” doesn’t even save you.
If your book reproduces something 95% verbatim, you won't even be able to publish it.
Humans are punished for plagiarism all the time. Myriad examples exist of students being disenrolled from college, professionals being fired, and personal reputations tarnished forever.
When a LLM is trained on copyright works and regurgitates these works verbatim without consent or compensation, and then sells the result for profit, there is currently no negative impact for the company selling the LLM service.