As far as I know, no one has fully implemented full-text search directly over Anna's Archive. T...

imdavidsantiago • last Wednesday at 5:21 AM • 3 replies • view on HN

As far as I know, no one has fully implemented full-text search directly over Anna's Archive. Technically it’s feasible with tools like Meilisearch, Elasticsearch, or Lucene, but the main challenges are:

    Converting all documents (PDFs, EPUBs, etc.) to clean plaintext.

    Indexing at scale efficiently.

    Managing potential legal issues.

Z-Library does something similar, but it’s smaller in scope and doesn't integrate AA’s full catalog.

Replies

bendangelo • last Wednesday at 5:48 AM

I’ve done something like this before. Meilisearch will not be viable, because it indexes very slow and it takes up a lot of space.

In my experience only Tantivy can index this much data. Check out Lnx.

➕ show 1 reply

_ache_ • last Wednesday at 11:28 AM

To manage the legal issues, you just have to put AI on the search. "AI search".

samuel_shadrach • yesterday at 5:17 PM

[dead]

alt Hacker News

Replies