logoalt Hacker News

namlemlast Wednesday at 7:46 AM4 repliesview on HN

It would be incredible for LLMs. Searching it, using it as training data, etc. Would probably have to be done in Russia or some other country that doesn't respect international copyright though.


Replies

jxjnskkzxxhxlast Wednesday at 7:54 AM

Do you have a reason to believe this ain't already being done? I would assume that the big guys like openai are already training on basically all text in existence.

show 2 replies
andrepdlast Wednesday at 1:41 PM

> Would probably have to be done in Russia or some other country that doesn't respect international copyright though.

Incredible, several years of major American AI companies showing that flaunting copyright only matters if it's college kids torrenting shows or enthusiasts archiving bootlegs on whatcd, but if it's big corpos doing it it's necessary for innovation.

Yet some people still believe "it would have to be done in evil Russia".

show 3 replies
executesorder66last Wednesday at 8:26 AM

> or some other country that doesn't respect international copyright though.

Like the US? OpenAI et al. don't give a shit.

show 2 replies
sam_lowry_last Wednesday at 7:53 AM

LLMs already use it, dude )

show 1 reply