logoalt Hacker News

podgorniylast Wednesday at 8:08 AM2 repliesview on HN

There is a search solution for zipped fb2 files. Not exactly what you need, but it has potential.

The project has similar story to Anna's archive. There is 0.5 TB of archived books, and the project creates index of all the books with text, title and aruthor search capabilities, gives html UI for search and reading. On weak machine it takes about 2 hours to build that index.

So if you have zipped archives of fb2, you can use the project to create web UI with search for those files. Without need of enough space to unpack all the files.

You'll have to translate some russian though to get instructions on how to set it up.

https://gitlab.com/opennota/fb2index/-/blob/master/README.ru...


Replies

tanguslast Wednesday at 8:34 AM

But fb2 files are marked up text, which is (relatively) trivial to index. The bulk of Anna's Archive's books are made of from scanned images.

jmb99last Wednesday at 7:29 PM

Worth mentioning that 0.5TB is tiny compared to Anna’s, which currently sits around 1.1PB.