Computer Science Colloquium
Tuesday, March 25th,  2003
4:15 PM
B17 Upson Hall

Brian Cooper
Stanford University

Data preservation in self-supervising networks

 

An ever increasing amount of information is being stored digitally, and people are becoming more and more dependent on it. However, very little is understood about how to preserve digital information for long time periods. Media failures, natural disasters and bankruptcy all conspire to cause information loss over decades or centuries. I will talk about a distributed digital archive I have developed to address this problem. This system is based on a community of archives that trade data under the principle of "I'll preserve your data if you preserve mine." These trades result in an archive network that self-organizes into a reliable system, self-tunes to improve efficiency, and self-heals after a failure. I'll discuss the architecture of the system, and techniques for making trades to achieve the highest reliability. I'll also sketch work I have done on information discovery through peer-to-peer searches of the archive network.