logoalt Hacker News

rtbruhan00yesterday at 11:47 AM3 repliesview on HN

It's generous of them to ask for permission.


Replies

gizajobyesterday at 2:38 PM

They wanted access to a faster pipe to slurp 500 terabytes, and that access comes at a cost. It wasn’t about permission.

And yeah they should be sued into the next century for copyright infringement. $4Trillion company illegally downloading the entire corpus of published literature for reuse is clearly infringement, its an absurdity to say that it’s fair use just to look for statistical correlations when training LLMs that will be used to render human authors worthless. One or two books is fair use. Every single book published is not.

show 1 reply
breakingcupsyesterday at 2:39 PM

It wasn't about permission, it was about high-speed access. They needed Anna's Archive to facilitate that for them, scraping was too slow. It's incredible that they were allowed to continue even after Anna's Archive themselves explicitly pointed out that the material was acquired illegally.

show 1 reply
kristofferRyesterday at 3:40 PM

It's not permission, it's a service they offer:

https://annas-archive.li/llm