At some point, there will be a successful copyright infringement suit against an LLM user who redistributes infringing output generated by an LLM. It could be the NYTimes suit, or it could be another, but it's coming — after which the industry will face a Napster-style reckoning.
What comes next? Perhaps it won't be that hard to assemble a proprietary licensed corpus and get decent performance out of it. Look at all the people already willing to license their voices.
OpenAI's valuation is more than basically all traditional media companies combined. Nvidia could buy the NYTimes with a month's worth of profits. The top 8 companies in the S&P 500 all benefit more from LLMs being successful than strict copyright enforcement. Congress has very broad power over copyright law. If a suit is successful there is a lot of money and power to be deployed to change copyright law.
You are comparing the fight between a p2p program and the entire music industry with the fight between the entire LLM industry and a newspaper. Notice how the order seems inconsistent.
And what happened after Napster? Filesharing totally stopped, right?
With the chinese in the mix it wont stop ai. It probably will change Copyright.
We will see such attempts first against weaker target. Users who are not having the enterprise indemnifications.
The law exists to protect the elite and punish the underclass. We’re not in a Hollywood movie. Nothing will happen.
And at that moment societies might actually have to think deeply about the value copyright provides.
Because having access to the condensed knowledge of humanity might be more valuable for society then having access to Lars Ulrich's shitty drumming.
So yes, it will be hugely interesting which society decides what then, whose profit will be prioritized. And societies won't easily find good answers.