Do you have a reason to believe this ain't already being done? I would assume that the big guys...

jxjnskkzxxhx • last Wednesday at 7:54 AM • 2 replies • view on HN

Do you have a reason to believe this ain't already being done? I would assume that the big guys like openai are already training on basically all text in existence.

Replies

IlikeKitties • last Wednesday at 8:25 AM

In fact, facebook torrented annas archive and got busted for it, because of course they did:

https://torrentfreak.com/meta-torrented-over-81-tb-of-data-t...

➕ show 1 reply

ar_lan • last Wednesday at 5:00 PM

Wasn't this confirmed what Meta does?

https://www.forbes.com/sites/danpontefract/2025/03/25/author...

alt Hacker News

Replies