logoalt Hacker News

imdavidsantiagolast Wednesday at 5:21 AM3 repliesview on HN

As far as I know, no one has fully implemented full-text search directly over Anna's Archive. Technically it’s feasible with tools like Meilisearch, Elasticsearch, or Lucene, but the main challenges are:

    Converting all documents (PDFs, EPUBs, etc.) to clean plaintext.

    Indexing at scale efficiently.

    Managing potential legal issues.
Z-Library does something similar, but it’s smaller in scope and doesn't integrate AA’s full catalog.

Replies

bendangelolast Wednesday at 5:48 AM

I’ve done something like this before. Meilisearch will not be viable, because it indexes very slow and it takes up a lot of space.

In my experience only Tantivy can index this much data. Check out Lnx.

show 1 reply
_ache_last Wednesday at 11:28 AM

To manage the legal issues, you just have to put AI on the search. "AI search".

samuel_shadrachyesterday at 5:17 PM

[dead]