It would be incredible for LLMs. Searching it, using it as training data, etc. Would probably have t...

namlem • last Wednesday at 7:46 AM • 4 replies • view on HN

It would be incredible for LLMs. Searching it, using it as training data, etc. Would probably have to be done in Russia or some other country that doesn't respect international copyright though.

Replies

jxjnskkzxxhx • last Wednesday at 7:54 AM

Do you have a reason to believe this ain't already being done? I would assume that the big guys like openai are already training on basically all text in existence.

➕ show 2 replies

andrepd • last Wednesday at 1:41 PM

> Would probably have to be done in Russia or some other country that doesn't respect international copyright though.

Incredible, several years of major American AI companies showing that flaunting copyright only matters if it's college kids torrenting shows or enthusiasts archiving bootlegs on whatcd, but if it's big corpos doing it it's necessary for innovation.

Yet some people still believe "it would have to be done in evil Russia".

➕ show 3 replies

executesorder66 • last Wednesday at 8:26 AM

> or some other country that doesn't respect international copyright though.

Like the US? OpenAI et al. don't give a shit.

➕ show 2 replies

sam_lowry_ • last Wednesday at 7:53 AM

LLMs already use it, dude )

➕ show 1 reply

alt Hacker News

Replies