All of it belongs to Anna's Archive. They may not have the rights to have it, but the data is t...

literalAardvark • today at 1:30 PM • 4 replies • view on HN

All of it belongs to Anna's Archive. They may not have the rights to have it, but the data is there no less.

They're asking for support to cover archival and bandwidth.

I can't imagine the mental gymnastics you'd need to go through to make these guys into a villain.

Replies

If you genuinely can't imagine how anyone would object to somebody taking other people's creative output and distributing it for free against their wishes then you probably need to work on your imagination a little bit.

➕ show 1 reply

notachatbot123 • today at 1:37 PM

Anna's Archived themselves scraped together all this data from other sources. See the notes of origin for example, often they are from zlib or libgen et ceteta.

plaidfuji • today at 1:55 PM

It’s the exact same mental gymnastics that cause people to accuse model providers of large-scale plagiarism.

That is to say, not that much gymnastics. Like a cartwheel at most.

➕ show 2 replies

petcat • today at 1:36 PM

I don't really care about Anna's Archive, but let's not make them out to be some kind of Robin Hood story.

They have (illegally) scraped and re-hosted mountains of proprietary data and are now deliberately prompt-injecting unwitting LLM users in order to steal money from them too.

➕ show 4 replies

alt Hacker News

Replies