My UW
|
UW Search
Computer Science Home Page
> ~dusseau
A. Arpaci-Dusseau Home
CS739 Home
Reading List
Schedule
Questions
C.S. Dept. Home Page
|
|
|
CS739 Spring 2009: Reading List
Distributed Operating Systems
- Survey : Distributed
Operating Systems
Andrew S. Tanenbaum and Robbert Van Renesse
ACM Computing Surveys, Volume 17, Issue 4 (December 1985)
- Sprite vs. Amoeba :
A Comparison of Two Distributed Systems: Amoeba and Sprite
Fred Douglis, M. Frans Kaashoek, John K. Ousterhout, Andrew S. Tanenbaum.
Computing Systems, Vol. 4, No. 3, pp. 353-384, December 1991.
Networked File Systems
- NFS
- State : The Role
of Distributed State
John Ousterhout
1991
- NFSv3 : NFS Version 3: Design and Implementation
Brian Pawlowski and Chet Juszczak and Peter Staubach and Carl Smith
and Diane Lebel and Dave Hitz
In USENIX Summer 1994
- NFSv4 : The NFS Version 4 Protocol
Brian Pawlowski and Spencer Shepler and Carl Beame and Brent Callaghan
and Michael Eisler and David Noveck and David Robinson and Robert
Thurlow
Proceedings of the 2nd international system administration and networking conference (SANE2000)",
- NFS Background:
Design and Implementation of the Sun Network Filesystem
Sandberg, R., Goldberg, D., Kleiman, S., Walsh, D., and Lyon, B.
Proceedings of the Summer 1985 USENIX Conference, Portland OR, June
1985, pp. 119-130.
- SpriteFS Optional: Caching in
the Sprite network file system
Michael N. Nelson, Brent B. Welch, John K. Ousterhout
ACM Transactions on Computer Systems (TOCS) Volume 6 , Issue 1
(February 1988) Pages: 134 - 154
- Coda : Disconnected Operation in the Coda File System
James J. Kistler, M. Satyanarayanan
13th Symposium on Operating Systems Principles, Asilomar, California, pp. 213-225. October 1991.
- AFS Background : Scale and Performance in a Distributed File System
Howard, J.H., Kazar, M.L., Menees, S.G., Nichols, D.A., Satyanarayanan, M., Sidebotham, R.N., and West, M.J.
ACM Transactions on Computer Systems, Vol. 6, No. 1, February 1988, pp. 51-81.
- LBFS : A Low-Bandwidth Network
File System
Athicha Muthitacharoen, Benjie Chen (MIT), David Mazieres
(NYU), SOSP'01
- Speculator : Speculative execution in a distributed file system
Edmund B. Nightingale, Peter M. Chen, Jason Flinn
Proceedings of the twentieth ACM symposium on Operating systems
principles (SOSP'05), pages 191 - 205.
Ordering of Events
- Theory: Time and Order
- Analysis of Distributed Systems
- Black-Box : Performance Debugging for
Distributed Systems of Black Boxes
Marcos K. Aguilera, Jeffrey C. Mogul, Janet L. Wiener,
Patrick Reynolds, Athicha Muthitacharoen
(HP Labs, Duke, and MIT), SOSP'03
- Paths : Path-Based Failure and Evolution Management
Mike Y. Chen, University of California, Berkeley; Anthony
Accardi, Tellme; Emre Kiciman, Stanford University; Dave
Patterson, University of California, Berkeley; Armando Fox,
Stanford University; Eric Brewer, University of California,
Berkeley, NSDI'04
- Centera : Deconstructing Commodity Storage Clusters
Haryadi Gunawi, Nitin Agrawal, Andrea Arpaci-Dusseau, Remzi
Arpaci-Dusseau
ISCA'05
Programming Environments
- MapReduce
- Dryad: Distributed
Data-Parallel Programs from Sequential Building Blocks
Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis
Fetterly
European Conference on Computer Systems (EuroSys), Lisbon, Portugal,
March 21-23, 2007
Migration
- OS Approaches
- SpriteMigration -- Transparent
Process Migration: Design Alternatives and the Sprite
Implementation : meenali
Fred Douglis and John K. Ousterhout
Software - Practice and Experience, Volume 21, Number 8, 1991,
Pages 757-785.
- V Migration : Preemptable Remote
Execution Facility for the V-System
M. Theimer, K. Lantz, and D. Cheriton
10th Symposium on Operating
Systems Principles, Orcas Island, WA, December 1985, pp. 2-12.
- VMM-based Migration
- Zap :The Design and Implementation of Zap: A System for Migrating Computing Environments
Steven Osman, Dinesh Subhraveti, Gong Su, and Jason Nieh, Columbia
University,
OSDI'04
-
VMmigration : Live
Migration of Virtual Machines
Christopher Clark, Keir Fraser, and Steven Hand, University of
Cambridge Computer Laboratory; Jacob Gorm Hansen and Eric Jul,
University of Copenhagen; Christian Limpach, Ian Pratt, and Andrew
Warfield, University of Cambridge
Symposium on Networked Systems Design and Implementation
(NSDI'05), May 2005
Specialized Distributed Services
- Porcupine: Manageability, Availability and Performance in
Porcupine: A Highly Scalable Internet Mail Service
Yasushi Saito, Brian Bershad, and Hank Levy
17th ACM Symposium on
Operating Systems Principles, Dec 1999, Kiawah Island Resort
- Grapevine :
Grapevine: An Exercise in Distributed Computing
Andrew D. Birrell,
Roy Levin,
Roger M. Needham,
Michael D. Schroeder,
Communications of the ACM,
25
4,
April 1982,
pp. 260-274.
- TACC : Cluster-Based Scalable Network Services
A. Fox, S. Gribble, Y. Chawathe and E. A. Brewer.
Proceedings of SOSP '97, St. Malo, France, October 1997.
Cluster-Based Distributed File Systems
- xFS : Serverless Network File Systems
Tom Anderson, Mike Dahlin, Jeanna Neefe, David Patterson, Drew Roselli, Randy Wang.
SOSP 15, December 1995.
- GoogleFS :
The Google File System
Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung
SOSP'03
Byzantine Agreement
- Byzantine : The Byzantine Generals Problem
Leslie Lamport, Robert Shostak, and Marshall Pease
ACM Transactions on Programming Languages and Systems, Vol 4,
No. 3, July 1982
-
FailStop : Byzantine generals in action:
Implementing fail-stop processors.
Fred B. Schneider
TOCS 2, 2 (May 1984), 145:154
- LOCKSS : Preserving Peer Replicas By Rate-Limited Sampled Voting
Petros Maniatis, Mema Roussopoulos, TJ Giuli, David
S. H. Rosenthal, Mary Baker, Yanto Muliadi
SOSP'03
- Practical : Practical
Byzantine Fault Tolerance
Miguel Castro and Barbara Liskov, MIT
OSDI'99
P2P Systems
- CFS : Wide-Area Cooperative
Storage with CFS
Frank Dabek, M. Frans Kaashoek, David Karger, Robert Morris (MIT),
Ion Stoica (UC Berkeley), SOSP'01
- Dynamo : Dynamo: Amazon's Highly Available Key-Value
Store
Giuseppe DeCandia, Deniz Hastorun, Madan Jampani,
Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swami
Sivasubramanian, Peter Vosshall and Werner Vogels
Proceedings of the 21st ACM Symposium on Operating Systems
Principles, Stevenson, WA, October 2007.
-
Pangaea : Taming Aggressive Replication in the Pangaea Wide-Area File System
Yasushi Saito, Christos Karamanolis, Magnus Karlsson, and Mallik
Mahalingam, HP Labs, OSDI'02
-
SUNDR : Secure Untrusted Data Repository (SUNDR)
Jinyuan Li, Maxwell Krohn, David Mazières, and Dennis Shasha, New York University, OSDI'04
Recovery
- Microreboot : MicrorebootA Technique for Cheap Recovery
George Candea, Shinichi Kawamoto, Yuichi Fujiki, Greg Friedman,
and Armando Fox, Stanford University, OSDI'04
Additional Papers
- DryadLINQ:
A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language
Yuan Yu, Michael Isard, Dennis Fetterly, and Mihai Budiu, Microsoft
Research Silicon Valley; Ulfar Erlingsson, Pradeep Kumar Gunda, and
Jon Currey
OSDI'08
-
Policy : Exploiting
Process Lifetime Distributions for Dynamic Load Balancing
Mor Harchol-Balter and Allen Downey.
Proceedings of ACM Sigmetrics '96 Conference on Measurement and
Modeling
of Computer Systems , (SIGMETRICS 96), May 23-26 1996, Philadelphia,
PA.
- Analysis -- Pip: Pip:
Detecting the Unexpected in Distributed Systems
Patrick
Reynolds, Janet L. Wiener, Jeffrey C. Mogul, Mehul A. Shah, Charles
Killian, and Amin Vahdat
Proceedings of the 3rd ACM/USENIX Symposium on Networked Systems Design and Implementation (NSDI), San Jose, CA,
May 2006.
- Petal + Frangipani
- Measurement, Modeling, and
Analysis of a Peer-to-Peer File-Sharing Workload
Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble,
Henry M. Levy, John Zahorjan (University of Washington), SOSP'03
- Measurement and Analysis of Spyware in a University Environment
Stefan Saroiu, Steven D. Gribble, and Henry M. Levy, University of
Washington, NSDI'04
- Mistakes: Understanding and Dealing with Operator Mistakes in Internet Services
Kiran Nagaraja, Fábio Oliveira, Ricardo Bianchini, Richard P. Martin, and Thu D. Nguyen, Rutgers University, OSDI'04
-
Boxwood : Boxwood: Abstractions as the Foundation for Storage Infrastructure
John MacCormick, Nick Murphy, Marc Najork, Chandramohan A. Thekkath, and Lidong Zhou, Microsoft Research Silicon Valley, OSDI'04
- Sensor : TAG: a Tiny AGgregation Service for Ad-Hoc Sensor Networks.
Samuel Madden, Michael Franklin, Joseph Hellerstein, Wei Hong.
In Proceedings of OSDI, 2002.
-
FUSE: Lightweight Guaranteed Distributed Failure Notification
John Dunagan, Microsoft Research; , Nicholas
J. A. Harvey, Massachusetts Institute of Technology; Michael
B. Jones, Microsoft Research; Dejan Kostic, Duke
University; Marvin Theimer and Alec Wolman, Microsoft
Research
OSDI'04
-
Pastiche: Making Backup Cheap and Easy
Landon P. Cox, Christopher D. Murray, and Brian D. Noble,
University of Michigan, OSDI'02
-
FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment
Atul Adya, William J. Bolosky, Miguel Castro, Gerald Cermak, Ronnie
Chaiken, John R. Douceur, Jon Howell, Jacob R. Lorch, Marvin Theimer,
and Roger P. Wattenhofer, Microsoft Research, OSDI'02
-
Ivy: A Read/Write Peer-to-Peer File System
Athicha Muthitacharoen, Robert Morris, Thomer M. Gil, and Benjie Chen,
Massachusetts Institute of Technology, OSDI'02
-
Paxos
Made Simple
Leslie Lamport
November 2001
- The Part-Time Parliament
Leslie Lamport
ACM Transactions on Computer Systems, Vol. 16, No. 2, May 1998
- Next Century Challenges: Scalable Coordination in Sensor Networks
Deborah Estrin, Ramesh Govindan, John Heidemann, Satish Kumar
Mobile Computing and Networking, 1999.
-
The LOCUS Distributed Operating System
Bruce Walker,
Gerald Popek,
Robert English,
Charles Kline,
Greg Thiel,
9th Symposium on Operating Systems Principles (SOSP),
Bretton Woods, New Hampshire,
November 1983,
pp. 49-70.
-
DEMOS/MP: The Development of a Distributed Operating System
Barton P. Miller,
David L. Presotto,
Michael L. Powell,
Software-Practice & Experience
17
4,
April 1987,
pp. 277-290.
-
Plan 9 from Bell Labs
Rob Pike,
David L. Presotto,
Sean Dorward,
Bob Flandrena,
Ken Thompson,
Howard Trickey,
Phil Winterbottom,
Computer Systems
8, 3, Summer 1995, pp. 221-254.
-
The ITC Distributed File System: Principles and Design
M. Satyanarayanan,
John H. Howard,
David A. Nichols,
Robert N. Sidebotham,
Alfred Z. Spector,
Michael J. West,
10th Symposium on Operating Systems Principles (SOSP),
Orcas Island, Washington,
pp. 35-50.
December 1985.
- Magp
ie : Using Magpie for Request Extraction and Workload Modelling
Paul Barham, Austin Donnelly, Rebecca Isaacs, and Richard Mortier,
Microsoft Research, Cambridge, UK, OSDI'04
- River : Run-Time Adaptation in River
Remzi H. Arpaci-Dusseau
Transactions on Computing Systems (TOCS), February, 2003, v. 21:1, pp. 36-86
- Linda: The S/Net's Linda Kernel
N. Carriero and D. Gelernter
ACM Trans. on Computer Systems 4, 2, May 1986, pp. 110-129.
- Survey : Process migration
Dejan S. Milojicic and Fred Douglis and Yves Paindaveine and Richard Wheeler and Songnian Zhou
ACM Comput. Surv. 32, 3, 2000.
- Scalable, Distributed Data Structures for Internet Service Construction
Steven D. Gribble, Eric A. Brewer, Joseph M. Hellerstein, and
David Culler , UC Berkeley
OSDI 2000
- RPC : Performance of the Firefly RPC
M. D. Schroeder and M. Burrows
ACM Trans. on Computer Systems, 8 1, February 1990, pp. 1-17.
- U-Net : U-Net: A User-Level Network Interface
for Parallel and Distributed Computing
Thorsten von Eicken, Anindya Basu, Vineet Buch, Werner Vogels
Proceedings of the 15th ACM Symposium on Operating Systems Principles,
Copper Mountain Resort, Colorado, December 1995, 40-53.
- RPC Background:
Implementing Remote Procedure Calls
Andrew D. Birrell,
Bruce Jay Nelson,
ACM Transactions on Computer Systems,
2
1,
February 1984,
pp. 39-59.
|
|
|
|