logoalt Hacker News

sdellisyesterday at 4:16 PM2 repliesview on HN

According to the Wikipedia page, it seems that copies of the archive are stored around the world.

LOCKSS is a decentralized strategy for preservation which includes archival copies at remote sites. It has been in use for a very long time. I feel like preservation via IPFS would introduce quite a bit of risk to the goal.


Replies

bityardyesterday at 5:02 PM

Yeah, there is some popular misunderstanding about what IPFS is... a lot of people seem to think its essentially free or subsidized distributed cloud storage. But the more you dig into it, the more you realize it's just a fairly inefficient caching system.

LOCKSS looks interesting but it seems like it's exclusively for libraries.

badlibrarianyesterday at 5:40 PM

I can find no current public document from the Internet Archive explaining what is backed up, where, or at what redundancy level.

From a 2016 blog post:

"Do you do backups too, for example to guard against corrupt data getting mirrored across both copies, or accidental deletion?"

John Gonzalez, the author and IA infrastructure lead, replied:

"We have done experiments to confirm that we can back up large portions of our corpus... but this is not a regular practice for us at this time."

https://blog.archive.org/2016/10/25/20000-hard-drives-on-a-m...