It would be incredible for LLMs. Searching it, using it as training data, etc. Would probably have to be done in Russia or some other country that doesn't respect international copyright though.
> Would probably have to be done in Russia or some other country that doesn't respect international copyright though.
Incredible, several years of major American AI companies showing that flaunting copyright only matters if it's college kids torrenting shows or enthusiasts archiving bootlegs on whatcd, but if it's big corpos doing it it's necessary for innovation.
Yet some people still believe "it would have to be done in evil Russia".
> or some other country that doesn't respect international copyright though.
Like the US? OpenAI et al. don't give a shit.
Do you have a reason to believe this ain't already being done? I would assume that the big guys like openai are already training on basically all text in existence.