My UW
|
UW Search
Computer Science Home Page
> ~dusseau
A. Arpaci-Dusseau Home
CS739 Home
Reading List
Schedule
Questions
C.S. Dept. Home Page
|
 |

|
CS739 Spring 2008: Reading List
Distributed Operating Systems
- Survey : Distributed
Operating Systems
Andrew S. Tanenbaum and Robbert Van Renesse
ACM Computing Surveys, Volume 17, Issue 4 (December 1985)
- Sprite vs. Amoeba :
A Comparison of Two Distributed Systems: Amoeba and Sprite
Fred Douglis, M. Frans Kaashoek, John K. Ousterhout, Andrew S. Tanenbaum.
Computing Systems, Vol. 4, No. 3, pp. 353-384, December 1991.
Networked File Systems
- NFS
- State : The Role
of Distributed State
John Ousterhout
1991
- NFSv3 : NFS Version 3: Design and Implementation
Brian Pawlowski and Chet Juszczak and Peter Staubach and Carl Smith
and Diane Lebel and Dave Hitz
In USENIX Summer 1994
- NFSv4 : The NFS Version 4 Protocol
Brian Pawlowski and Spencer Shepler and Carl Beame and Brent Callaghan
and Michael Eisler and David Noveck and David Robinson and Robert
Thurlow
Proceedings of the 2nd international system administration and networking conference (SANE2000)",
- NFS Background:
Design and Implementation of the Sun Network Filesystem
Sandberg, R., Goldberg, D., Kleiman, S., Walsh, D., and Lyon, B.
Proceedings of the Summer 1985 USENIX Conference, Portland OR, June
1985, pp. 119-130.
- SpriteFS Optional: Caching in
the Sprite network file system
Michael N. Nelson, Brent B. Welch, John K. Ousterhout
ACM Transactions on Computer Systems (TOCS) Volume 6 , Issue 1
(February 1988) Pages: 134 - 154
- Coda : Disconnected Operation in the Coda File System
James J. Kistler, M. Satyanarayanan
13th Symposium on Operating Systems Principles, Asilomar, California, pp. 213-225. October 1991.
- AFS Background : Scale and Performance in a Distributed File System
Howard, J.H., Kazar, M.L., Menees, S.G., Nichols, D.A., Satyanarayanan, M., Sidebotham, R.N., and West, M.J.
ACM Transactions on Computer Systems, Vol. 6, No. 1, February 1988, pp. 51-81.
- LBFS : A Low-Bandwidth Network
File System
Athicha Muthitacharoen, Benjie Chen (MIT), David Mazieres
(NYU), SOSP'01
Theory: Time and Order
- Time and Order
Analysis of Distributed Systems
- Centera : Deconstructing Commodity Storage Clusters
Haryadi Gunawi, Nitin Agrawal, Andrea Arpaci-Dusseau, Remzi
Arpaci-Dusseau
ISCA'05
- Group Presentation
- Black-Box : Performance Debugging for
Distributed Systems of Black Boxes
Marcos K. Aguilera, Jeffrey C. Mogul, Janet L. Wiener,
Patrick Reynolds, Athicha Muthitacharoen
(HP Labs, Duke, and MIT), SOSP'03
- Paths : Path-Based Failure and Evolution Management
Mike Y. Chen, University of California, Berkeley; Anthony
Accardi, Tellme; Emre Kiciman, Stanford University; Dave
Patterson, University of California, Berkeley; Armando Fox,
Stanford University; Eric Brewer, University of California,
Berkeley, NSDI'04
- Pip: Pip:
Detecting the Unexpected in Distributed Systems
Patrick
Reynolds, Janet L. Wiener, Jeffrey C. Mogul, Mehul A. Shah, Charles
Killian, and Amin Vahdat
Proceedings of the 3rd ACM/USENIX Symposium on Networked Systems Design and Implementation (NSDI), San Jose, CA,
May 2006.
- X-Trace : X-Trace: A
Pervasive Network Tracing Framework
Rodrigo Fonseca, George Porter, Randy H. Katz, Scott Shenker, and Ion
Stoica
4th USENIX Symposium on Networked Systems Design & Implementation
(NSDI'07)
Programming Environments
- DSM: Towards Transparent and Efficient
Software Distributed Shared Memory
D.J. Scales and K. Gharachorloo
16th Symposium on Operating
Systems Principles, Saint Malo, France, October 1997, pp. 157-169.
- MapReduce : MapReduce: Simplified Data Processing on Large Clusters
Jeffrey Dean and Sanjay Ghemawat
OSDI'04
- Group Presentation
Process Migration
- Group Presentation
- V Migration : Preemptable Remote
Execution Facility for the V-System
M. Theimer, K. Lantz, and D. Cheriton
10th Symposium on Operating
Systems Principles, Orcas Island, WA, December 1985, pp. 2-12.
- Zap :The Design and Implementation of Zap: A System for Migrating Computing Environments
Steven Osman, Dinesh Subhraveti, Gong Su, and Jason Nieh, Columbia
University,
OSDI'04
-
VMmigration : Live
Migration of Virtual Machines
Christopher Clark, Keir Fraser, and Steven Hand, University of
Cambridge Computer Laboratory; Jacob Gorm Hansen and Eric Jul,
University of Copenhagen; Christian Limpach, Ian Pratt, and Andrew
Warfield, University of Cambridge
Symposium on Networked Systems Design and Implementation
(NSDI'05), May 2005
-
Policy : Exploiting
Process Lifetime Distributions for Dynamic Load Balancing
Mor Harchol-Balter and Allen Downey.
Proceedings of ACM Sigmetrics '96 Conference on Measurement and
Modeling
of Computer Systems , (SIGMETRICS 96), May 23-26 1996, Philadelphia,
PA.
- SpriteMigration : Transparent
Process Migration: Design Alternatives and the Sprite
Implementation
Fred Douglis and John K. Ousterhout
Software - Practice and Experience, Volume 21, Number 8, 1991,
Pages 757-785.
Specialized Distributed Services
- Group Presentation
- Grapevine :
Grapevine: An Exercise in Distributed Computing
Andrew D. Birrell,
Roy Levin,
Roger M. Needham,
Michael D. Schroeder,
Communications of the ACM,
25
4,
April 1982,
pp. 260-274.
- TACC : Cluster-Based Scalable Network Services
A. Fox, S. Gribble, Y. Chawathe and E. A. Brewer.
Proceedings of SOSP '97, St. Malo, France, October 1997.
- NOWSort:High-Performance Sorting on Networks of Workstations
Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, David E. Culler,
Joseph M. Hellerstein, David A. Patterson.
SIGMOD '97, Tucson, Arizona, May, 1997.
- Porcupine: Manageability, Availability and Performance in
Porcupine: A Highly Scalable Internet Mail Service
Yasushi Saito, Brian Bershad, and Hank Levy
17th ACM Symposium on
Operating Systems Principles, Dec 1999, Kiawah Island Resort
Cluster-Based Distributed File Systems ( Project Related)
- GoogleFS :
The Google File System
Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung
SOSP'03
Theory: Agreement
- Byzantine
-
FailStop : Byzantine generals in action:
Implementing fail-stop processors.
Fred B. Schneider
TOCS 2, 2 (May 1984), 145:154
Recovery
- Microreboot : MicrorebootA Technique for Cheap Recovery
George Candea, Shinichi Kawamoto, Yuichi Fujiki, Greg Friedman,
and Armando Fox, Stanford University, OSDI'04
P2P Systems
- CFS : Wide-Area Cooperative
Storage with CFS
Frank Dabek, M. Frans Kaashoek, David Karger, Robert Morris (MIT),
Ion Stoica (UC Berkeley), SOSP'01
- Dynamo : Dynamo: Amazon's Highly Available Key-Value
Store
Giuseppe DeCandia, Deniz Hastorun, Madan Jampani,
Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swami
Sivasubramanian, Peter Vosshall and Werner Vogels
Proceedings of the 21st ACM Symposium on Operating Systems
Principles, Stevenson, WA, October 2007.
-
Pangaea : Taming Aggressive Replication in the Pangaea Wide-Area File System
Yasushi Saito, Christos Karamanolis, Magnus Karlsson, and Mallik
Mahalingam, HP Labs, OSDI'02
- LOCKSS : Preserving Peer Replicas By Rate-Limited Sampled Voting
Petros Maniatis, Mema Roussopoulos, TJ Giuli, David
S. H. Rosenthal, Mary Baker, Yanto Muliadi
SOSP'03
-
SUNDR : Secure Untrusted Data Repository (SUNDR)
Jinyuan Li, Maxwell Krohn, David Mazières, and Dennis Shasha, New York University, OSDI'04
- Speculator : Speculative execution in a distributed file system
Edmund B. Nightingale, Peter M. Chen, Jason Flinn
Proceedings of the twentieth ACM symposium on Operating systems
principles (SOSP'05), pages 191 - 205.
Cluster-Based Distributed File Systems
Petal + Frangipani
Measurement : An
Analysis of Internet Content Delivery Systems
Stefan Saroiu, Krishna P. Gummadi, Richard J. Dunn, Steven D. Gribble,
and Henry M. Levy, (University of Washington), OSDI'02
Slides
- Additional Related Papers (Not Required)
- Measurement, Modeling, and
Analysis of a Peer-to-Peer File-Sharing Workload
Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble,
Henry M. Levy, John Zahorjan (University of Washington), SOSP'03
- Measurement and Analysis of Spyware in a University Environment
Stefan Saroiu, Steven D. Gribble, and Henry M. Levy, University of
Washington, NSDI'04
Additional Papers
- Mistakes: Understanding and Dealing with Operator Mistakes in Internet Services
Kiran Nagaraja, Fábio Oliveira, Ricardo Bianchini, Richard P. Martin, and Thu D. Nguyen, Rutgers University, OSDI'04
-
Boxwood : Boxwood: Abstractions as the Foundation for Storage Infrastructure
John MacCormick, Nick Murphy, Marc Najork, Chandramohan A. Thekkath, and Lidong Zhou, Microsoft Research Silicon Valley, OSDI'04
- Sensor : TAG: a Tiny AGgregation Service for Ad-Hoc Sensor Networks.
Samuel Madden, Michael Franklin, Joseph Hellerstein, Wei Hong.
In Proceedings of OSDI, 2002.
-
FUSE: Lightweight Guaranteed Distributed Failure Notification
John Dunagan, Microsoft Research; , Nicholas
J. A. Harvey, Massachusetts Institute of Technology; Michael
B. Jones, Microsoft Research; Dejan Kostic, Duke
University; Marvin Theimer and Alec Wolman, Microsoft
Research
OSDI'04
-
Pastiche: Making Backup Cheap and Easy
Landon P. Cox, Christopher D. Murray, and Brian D. Noble,
University of Michigan, OSDI'02
-
FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment
Atul Adya, William J. Bolosky, Miguel Castro, Gerald Cermak, Ronnie
Chaiken, John R. Douceur, Jon Howell, Jacob R. Lorch, Marvin Theimer,
and Roger P. Wattenhofer, Microsoft Research, OSDI'02
-
Ivy: A Read/Write Peer-to-Peer File System
Athicha Muthitacharoen, Robert Morris, Thomer M. Gil, and Benjie Chen,
Massachusetts Institute of Technology, OSDI'02
-
Paxos
Made Simple
Leslie Lamport
November 2001
- The Part-Time Parliament
Leslie Lamport
ACM Transactions on Computer Systems, Vol. 16, No. 2, May 1998
- Next Century Challenges: Scalable Coordination in Sensor Networks
Deborah Estrin, Ramesh Govindan, John Heidemann, Satish Kumar
Mobile Computing and Networking, 1999.
xFS : Serverless Network File Systems
Tom Anderson, Mike Dahlin, Jeanna Neefe, David Patterson, Drew Roselli, Randy Wang.
SOSP 15, December 1995.
The LOCUS Distributed Operating System
Bruce Walker,
Gerald Popek,
Robert English,
Charles Kline,
Greg Thiel,
9th Symposium on Operating Systems Principles (SOSP),
Bretton Woods, New Hampshire,
November 1983,
pp. 49-70.
DEMOS/MP: The Development of a Distributed Operating System
Barton P. Miller,
David L. Presotto,
Michael L. Powell,
Software-Practice & Experience
17
4,
April 1987,
pp. 277-290.
Plan 9 from Bell Labs
Rob Pike,
David L. Presotto,
Sean Dorward,
Bob Flandrena,
Ken Thompson,
Howard Trickey,
Phil Winterbottom,
Computer Systems
8, 3, Summer 1995, pp. 221-254.
The ITC Distributed File System: Principles and Design
M. Satyanarayanan,
John H. Howard,
David A. Nichols,
Robert N. Sidebotham,
Alfred Z. Spector,
Michael J. West,
10th Symposium on Operating Systems Principles (SOSP),
Orcas Island, Washington,
pp. 35-50.
December 1985.
Magp
ie : Using Magpie for Request Extraction and Workload Modelling
Paul Barham, Austin Donnelly, Rebecca Isaacs, and Richard Mortier,
Microsoft Research, Cambridge, UK, OSDI'04
River : Run-Time Adaptation in River
Remzi H. Arpaci-Dusseau
Transactions on Computing Systems (TOCS), February, 2003, v. 21:1, pp. 36-86
Linda: The S/Net's Linda Kernel
N. Carriero and D. Gelernter
ACM Trans. on Computer Systems 4, 2, May 1986, pp. 110-129.
Survey : Process migration
Dejan S. Milojicic and Fred Douglis and Yves Paindaveine and Richard Wheeler and Songnian Zhou
ACM Comput. Surv. 32, 3, 2000.
Scalable, Distributed Data Structures for Internet Service Construction
Steven D. Gribble, Eric A. Brewer, Joseph M. Hellerstein, and
David Culler , UC Berkeley
OSDI 2000
|
Communication
Comparison
- RPC : Performance of the Firefly RPC
M. D. Schroeder and M. Burrows
ACM Trans. on Computer Systems, 8 1, February 1990, pp. 1-17.
- U-Net : U-Net: A User-Level Network Interface
for Parallel and Distributed Computing
Thorsten von Eicken, Anindya Basu, Vineet Buch, Werner Vogels
Proceedings of the 15th ACM Symposium on Operating Systems Principles,
Copper Mountain Resort, Colorado, December 1995, 40-53.
|
|
 |