From my understanding, Meta's use of the pirated book was accepted as fair use and the plaintiffs admitted to no harm. In the case of pirated music and films, neither of those points are made. Copyright holders assume people who pirate would have bought the content, usually even assuming that one download is one lost sale. And I am not aware of a single case where watching or listening to pirated content was accepted as fair use.
It is interesting to follow how this plays out for Meta and how that will impact future cases.
One of the underlying issues is that punitive damages seem to be the norm in US courts.
In the UK you can only claim for the actual damages incurred, which at most will be the profit you would've made on the sale of that book. Which makes most claims for private infringement uneconomical for corporations.
The use of the pirated book is a totally separate action than acquiring the pirated book.
We consumers just need BiTorrent clients that come with LLM training code incorporated, as that transforms the downloads into fair use (according to the very expensive Meta legal team).
I don’t get that, the use of these books was instrumental and necessary for the success of the training run. The expected value of these training runs is high as the build out of 100 billion+ infrastructure demonstrates, so the book publishers should at a minimum be paid a licensing fee, a small fraction of every inference run revenue or whatever they decide. The fact that authors and publishers didn’t get any say under what conditions their intellectual property can be used is pretty outrageous.