logoalt Hacker News

nine_kyesterday at 12:42 AM1 replyview on HN

Decompress, scan as you go, discard. Having to read a few hundred GB and scan a terabyte is a nuisance. Not having to write a terabyte is priceless.


Replies

mikepurvisyesterday at 12:53 AM

Could also maintain an in-memory index so that you can go back after the fact and extract individual files.

show 1 reply