Department of Computer Science 


CS 6464: Spring 2009

Advanced Distributed Storage Systems

   
    
* Home
* Information
* Schedule
* Labs
* Project
 

Schedule for course.

Note that papers are subject to change

#

Presenter

Date

Topic

Required Reading

0.

Hakim
PPTX PDF
01/20 Introduction None

1.

Hakim
PPTX PDF
01/22 Cloud Computing Cumulus: Filesystem Backup to the Cloud. Michael Vrable, Stefan Savage, and Geoffrey M. Voelker. Appears in Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST), February 2009

2.

Hakim
PPTX PDF
01/27 Network File Systems Design and Implementation of the Sun Network File System. Russel Sandberg, David Goldberg, Steve Kleiman, Dan Walsh, and Bob Lyon. Appears in Proceedings of the 7th USENIX Annual Technical Conference, 1985

Additional Reading
Implementing Remote Procedure Calls. Andrew D. Birrell and Bruce Jay Nelson. Appears in ACM Transaction on Computer Systems (TOCS), 1984

Reference reading
RFC 1813 (NFS v3), RFC 1832 (XDR), RFC 1831 (RPC)

3.

Hakim
PDF
01/29 Network File Systems A Toolkit for User-Level File Systems. David Mazières. Appears in Proceedings of the USENIX Annual Technical Conference, June 2001

Due: Lab 0

4.

Hakim
("Guest" speaker,
Werner Vogels)
02/03 Consistency, Availability, Partition Tolerance Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services, Seth Gilbert and Nancy Lynch. Appears in ACM SIGACT News, 2002.

Eventually Consistent, Werner Vogels. Appears in Communications of ACM (CACM), January 2009.

5.

Hakim
PPTX PDF
02/05 Consistency, Availability, Partition Tolerance Smoke and Mirrors: Reflecting Files at a Geographically Remote Location Without Loss of Performance, Hakim Weatherspoon, Lakshmi Ganesh, Tudor Marian, Mahesh Balakrishnan, Ken Birman. Appears in Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST), February 2009.

Additional Reading
Message Logging: Pessimistic, Optimistic, Causal, and Optimal, Lorenzo Alvisi and Keith Marzullo/ Appears in IEEE Transactions on Software Engineering, February, 1998.

6.

Guy
PPTX PDF
02/10 Wide-area Peer to Peer Storage Wide-area cooperative storage with CFS, Frank Dabek, M. Frans Kaashoek, David Karger, Robert Morris, and Ion Stoica. Appears in Proceedings of 18th ACM SIGOPS Symposium on Operating Principles (SOSP), October, 2001.

Additional Reading
Chord: A Peer-to-Peer Lookup Service for Internet Applications, Ion Stoica, Robert Morris, David Karger, Frans Kaashoek, Hari Balakrishnan. Appears in Proceedings of the ACM SIGCOMM Conference, September, 2001.

Due: Lab 1

7.

Hussam
PPT PDF
02/12 Datacenter Peer to Peer Storage Dynamo: Amazon's Highly Available Key-value Store, Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels. Appears in Proceedings of the 21st ACM Symposium on Operating Principles (SOSP), October, 2007.

Additional Reading
Manageability, availability and performance in Porcupine: a highly scalable, cluster-based mail service, Yasushi Saito, Brian N. Bershad, and Henr y M. Levy. Appears in Proceedings of the 17th ACM Symposium on Operating Principles (SOSP), December, 1999.

    03/16  


Due: Lab 2 -- Design Doc

8.

Lonnie
PDF
02/17   Secure untrusted data repository (SUNDR), Jinyuan Li, Maxwell Krohn, David Mazières, and Dennis Shasha. Appears in Proceedings of the 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI), December 2004.

Additional Reading
Fast and secure distributed read-only filesystem (aka SFSRO), Kevin Fu, M. Frans Kaashoek, and David Mazières. Appears in Proceedings of the 4th USENIX Symposium on Operating Systems Design and Implementation (OSDI), October, 2000.

9.

Hussam
PPT PDF
02/19   Attested append-only memory: making adversaries stick to their word, Byong-Gon Chun, Petros Maniatis, Scott Shenker, and John Kubiatowicz. Appears in Proceedings of the 21st ACM Symposium on Operating Principles (SOSP), October, 2007.

Additional Reading
Simulating authenticated broadcasts to derive simple fault-tolerant algorithms , T. K. Srikanth and Sam Toueg. Appears in Distributed ComputingVol 2, No2, June, 1987.

Due: Lab 2 -- A

10.

Rakesh
PPTX PDF
02/24 Weakly Consistent Storage Systems (i.e. Eventual Consistency) Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System, Douglas B. Terry, Marvin M. Theimer, Karin Petersen, Alan J. Demers, Mike J. Spreitzer and Carl H. Hauser. Appears in Proceedings of the 15th ACM Symposium on Operating Systems Principles (SOSP), December, 1995.

Additional Reading
Epidemic algorithms for replicated database maintenance, Alan Demers, Dan Greene, Carl Hauser, Wes Irish, John Larson, Scott Shenker, Howard Sturgis, Dan Swinehart, and Doug Terry. Appears in Proceedings of the 6th ACM Symposium on Principles of Distributed Computing (PODC), 1987.

11.

Darshan
PPTX PDF
02/26 Weakly Consistent Storage Systems (i.e. Eventual Consistency) Cimbiosys: A Platform for Content-based Partial Replication, Venugopalan Ramasubramanian, Thomas L. Rodeheffer, and Douglas B. Terry, Meg Walraed-Sullivan, Ted Wobber and Cathy Marshall, Amin Vahdat. Appears in Proceedings of the 6th USENIX Symposium on Networked Systems Design and Implementation (NSDI), April, 2009.

Additional Reading
PRACTI Replication, Nalini Belaramani, Mike Dahlin, Lei Gao, Amol Nayate, Arun Venkataramani, Praveen Yalagandula, Jiandan Zheng. Appears in Proceedings of the 3rd USENIX Symposium on Networked Systems Design and Implementation (NSDI), April, 2006.

Due: Lab 2 -- B

12.

Rahul 03/03 Replication and Fault Tolerance Replication in the Harp File System, Barbara Liskov, Sanjay Ghemawat, Robert Gruber, Paul Johnson, Liuba Shrira, Michael Williams. Appears in Proceedings of the 13th ACM Symposium on Operating Systems Principles (SOSP), October, 1991.

Additional Reading
Implementing fault-tolerant services using the state machine approach: A tutorial. Fred B. Schneider. Appears in ACM Computing Surveys 22, 4 (December 1990), 299--319.

13.

Yeounoh
PPT PDF
03/05 Replication, Fault Tolerance, and Maliciousness Practical Byzantine Fault Tolerance (aka PBFT), Miguel Castro and Barbara Liskov. Appears in Proceedings of the 3rd USENIX Symposium on Operating Systems Design and Implementation (OSDI), February, 1999.

Additional Reading
The Byzantine Generals Problem, Leslie Lamport, Robert Shostak, Marshall Pease. Appears in ACM Transactions on Programming Languages and Systems (TOPLAS), Vol 4, No 3, pages 382-401, July, 1982.

14.

Michael S.
PPT PDF
03/10 Scalability and Fault Tolerance The Google File System, Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. Appears in Proceedings of the 19th ACM Symposium on Operating System Principles (SOSP), October, 2003.

Additional Reading
Serverless Network File Systems (aka xFS), Thomas Anderson, Michael Dahlin, Jeanna Neefe, David Patterson, Drew Roselli, and Randolph Wang. Appears in Proceedings of the 15th ACM Symposium on Operating System Principles (SOSP), December, 1995.

Due: Initial Project Proposal

15.

Petko
PPT PDF
03/12 Scalability and Fault Tolerance The Chubby lock service for loosely-coupled distributed systems, Mike Burrows. Appears in Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI), November, 2006.

Paxos Made Simple, Leslie Lamport. Appears in ACM SIGACT News (Distributed Computing Column), Vol. 32, No. 4 (December 2001), pages 51-58.

Alternatively, Paxos Made Practical by David Mazières

    03/17  

Spring Break

    03/19  

Spring Break

16.

Robert B.
PDF
03/24 Scalability Bigtable: A Distributed Storage System for Structured Data, Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber. Appears in Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI), November, 2006.

Additonal Reading
Boxwood: Abstractions as the foundation for storage infrastructureJohn MacCormick, Nick Murphy, Marc Najork, Chandramohan A. Thekkath, and Lidong Zhou. Appears in Proceedings of the 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI), December, 2004.

17.

Avinash
PPTX PDF
03/26 Scalability FAB: Building Distributed Enterprise Disk Arrays from Commodity Components, Yasushi Saito, Svend Frĝlund, Alistair Veitch, Arif Merchant, Susan Spence. Appears in Proceedings of the 13th ACM International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), October, 2004.

Migrating enterprise storage to SSDs: analysis of tradeoffs, Dushyanth Narayanan, Eno Thereska, Austin Donnelly, Sameh Elnikety, Antony Rowstron. Appears in Proceedings of the 4th ACM European Conference on Computer Systems (EuroSys), March/April, 2009.

18.

Aaron
PPTX PDF
03/31   MapReduce: Simplified Data Processing on Large Clusters, Jeffrey Dean and Sanjay Ghemawat. Appears in Proceedings of the 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI), December, 2004.

Active disks for large-scale data processing, Erik Riedel, Christos Faloutsos, Garth A. Gibson, and David Nagle. Appears in IEEE Computer, pages 68-74, June 2001.

Due: Project Midterm Survey Paper
(i.e. motivation and related work, and implementation and evaluation plan for project paper)

19.

Mike G.
PDF LATEX.TGZ
04/02 Energy-Based Storage Systems DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language, Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Ulfar Erlingsson, Pradeep Kumar Gunda, and Jon Currey. Appears in Proceedings of the 8th USENIX Symposium on Operating Systems Design and Implementation (OSDI), December, 2008.

Additional Reading
Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks, Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly. Appears in Proceedings of the 2nd ACM European Conference on Computer Systems (EuroSys), March, 2007.

20.

Guy
PPTX PDF
04/07 Energy-Based Storage Systems Write Off-loading: Practical power management for enterprise storage, Dushyanth Narayanan, Austin Donnelly and Antony Rowstron. Appears in Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST), February, 2008.

Hibernator: helping disk arrays sleep through the winter, Qingbo Zhu, Zhifeng Chen, Lin Tan, Yuanyuan Zhou, Kimberly Keeton, John Wilkes. Appears in Proceedings of the 20th ACM Symposium on Operating System Principles (SOSP), October, 2005.

Additional Reading
Optimizing Power Consumption in Large Scale Storage Systems, Lakshmi Ganesh, Hakim Weatherspoon, Mahesh Balakrishnan, Ken Birman. Appears in Proceedings of the 11th USENIX Workshop on Hot Topics in Operating Systems (HotOS), May 2007.

21.

Yeounoh
PPT PDF
04/09 Wide-area Peer-to-Peer Storage Systems Pond: the OceanStore Prototype, Sean Rhea, Patrick Eaton, Dennis Geels, Hakim Weatherspoon, Ben Zhao, and John Kubiatowicz. Appears in Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST), March 2003

Additional Reading
OceanStore: An Architecture for Global-Scale Persistent Storage, John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, Ramakrishna Gummadi, Sean Rhea, Hakim Weatherspoon, Westley Weimer, Chris Wells, and Ben Zhao. Appears in Proceedings of the Ninth international Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), November 2000

22.

Robert B.
PDF
04/14 Wide-area Peer-to-Peer, Content-Addressable, Storage Systems Antiquity: Exploiting a Secure Log for Wide-Area Distributed Storage, Hakim Weatherspoon, Patrick Eaton, Byung-Gon Chun, and John Kubiatowicz. Appears in Proceedings of the 2nd ACM European Conference on Computer Systems (Eurosys), March 2007.

Additional Reading
OpenDHT: A Public DHT Service and Its Uses, Sean Rhea, Brighten Godfrey, Brad Karp, John Kubiatowicz,Sylvia Ratnasamy, Scott Shenker, Ion Stoica, and Harlan Yu. Appears in Proceedings of the ACM SIGCOMM Conference, August, 2005.

23.

Rakesh
PPTX PDF
04/16 Wide-area file systems A Low-bandwidth Network File System (aka LBFS), Athicha Muthitacharoen, Benjie Chen, and David Mazières. Appears in Proceedings of the 18th ACM Symposium on Operating System Principles (SOSP), October, 2001.

Shark: Scaling file servers via cooperative caching, Siddhartha Annapureddy, Michael J. Freedman, David Mazières. Appears in Proceedings of the 2nd USENIX Symposium on Networked Systems Design and Implementation (NSDI), May, 2005.

24.

Llanos
PPT PDF
04/21 Scalable File Systems Frangipani: a scalable distributed file system, Chandramohan A. Thekkath,Timothy Mann, and Edward K. Lee. Appears in Proceedings of the 16th ACM Symposium on Operating Principles (SOSP), September, 1997.

Additional Reading
Petal: Distributed virtual disks, Edward K. Lee and Chandramohan A. Thekkath. Appears in Proceedings of the 7th ACM International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), October, 1996.

25.

Avinash 04/23 Distributed RAID The Zebra striped network file system, John H. Hartman and John K. Ousterhout. Appears in Proceedings of the 14th ACM Symposium on Operating Principles (SOSP), December, 1993.

Additional Reading
The HP AutoRAID hierarchical storage system, John Wilkes, Richard Gelding, Carl Staelin, and Tim Sullivan. Appears in Proceedings of the 15th ACM Symposium on Operating Principles (SOSP), December, 1995

26.

Hakim
PPT PDF
04/28   Efficient Replica Maintenance for Distributed Storage Systems, Byung-Gon Chun, Frank Dabek, Andreas Haeberlen, Emil Sit, Hakim Weatherspoon, M. Frans Kaashoek, John Kubiatowicz, and Robert Morris. Appears in Proceedings of the 3rd USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2006.

Unreliable Failure Detectors for Reliable Distributed Systems , Tushar Deepak Chandra and Sam Toueg. Appears in Journal of the ACM, 43:2, March 1996, 225-267.

27.

  04/30 Final Final Presentations/Demo's

Due: Final Project Demo