Distributed Systems To-read List
Scheduling
- Bikash Sharma, Victor Chudnovsky, Joseph L. Hellerstein, Rasekh Rifaat, and Chita R. Das. Modeling and synthesizing task placement constraints in Google compute clusters. In Proceedings of the 2nd ACM Symposium on Cloud Computing (SOCC '11). ACM, New York, NY, USA, , Article 3 , 14 pages.
- Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy Katz, Scott Shenker, and Ion Stoica. Mesos: a platform for fine-grained resource sharing in the data center. In Proceedings of the 8th USENIX conference on Networked systems design and implementation (NSDI'11). USENIX Association, Berkeley, CA, USA, 22-22.
Notes
- H. Nguyen et al. AGILE: elastic distributed resource scaling for Infrastructure-as-a-Service. In Proceedings of the 10th International Conference on Autonomic Computing (ICAC 13). USENIX Association, San Jose, CA, USA, 69-82.
Link
- Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, and John Wilkes. Omega: flexible, scalable schedulers for large compute clusters. In Proceedings of the 8th ACM European Conference on Computer Systems (EuroSys '13). ACM, New York, NY, USA, 351-364.
- Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, and Ion Stoica. Dominant resource fairness: fair allocation of multiple resource types. In Proceedings of the 8th USENIX conference on Networked systems design and implementation (NSDI'11). USENIX Association, Berkeley, CA, USA, 24-24.
Slide
- Kay Ousterhout, Patrick Wendell, Matei Zaharia, and Ion Stoica. Sparrow: distributed, low latency scheduling. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (SOSP '13). ACM, New York, NY, USA, 69-84.
Paper |
Slide |
Moderator |
Video
File Systems
- Cary G. Gray and David R. Cheriton. Leases: An efficient fault-tolerant mechanism for distributed file cache consistency. In Proceedings of the 12th ACM Symposium on Operating Systems Principles, pages 202-210, December 1989.
- Mahadev Satyanarayanan. Scalable, secure, and highly available distributed file access. IEEE Computer, 23(5):9-21, May 1990.
Shared Memory
- Cristiana Amza, Alan L. Cox, Sandhya Dwarkadas, Pete Keleher, Honghui Lu, Ramakrishnan Rajamony, Weimin Yu, and Willy Zwaenepoel. TreadMarks: Shared Memory Computing on Networks of Workstations. Computer 29, 2 (February 1996), 18-28.
Time and Global States
- D. Jefferson, B. Beckman, F. Wieland, L. Blume, and M. Diloreto. Time warp operating system. In Proceedings of the eleventh ACM Symposium on Operating systems principles (SOSP '87). ACM, New York, NY, USA, 77-93.
- D.L. Mills. Internet time synchronization: the network time protocol. Communications, IEEE Transactions on , vol.39, no.10, pp.1482,1493, Oct 1991.
- David R. Cheriton and Dale Skeen. Understanding the limitations of causally and totally ordered communication. In Proceedings of the fourteenth ACM symposium on Operating systems principles (SOSP '93). ACM, New York, NY, USA, 44-57 (1993).
- D.L. Mills. Internet time synchronization: the network time protocol. IEEE Transactions on Communications, vol.39, no.10, pp.1482,1493, Oct 1991. (Project website)
- Vive la Différence: Paxos vs. Viewstamped Replication vs. Zab
- Jeffrey Dean and Luiz André Barroso. The tail at scale. Commun. ACM 56, 2 (February 2013), 74-80.
Directores and Naming
- Butler W Lampson. Designing a global name service. In Proceedings of the fifth annual ACM symposium on Principles of distributed computing (PODC '86). ACM, New York, NY, USA, 1-10.
- David R. Cheriton and Timothy P. Mann. Decentralizing a global naming service for improved performance and fault tolerance. ACM Transactions on Computer Systems, 7(2):147-183, May 1989.
P2P Systems
- S.A. Baset, H.G. Schulzrinne. An Analysis of the Skype Peer-to-Peer Internet Telephony Protocol. INFOCOM 2006. 25th IEEE International Conference on Computer Communications. Proceedings , vol., no., pp.1,11, April 2006.
- Ion Stoica, Robert Morris, David Liben-Nowell, David R. Karger, M. Frans Kaashoek, Frank Dabek, Hari Balakrishnan. Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications. IEEE/ACM Transactions on Networking (TON), 2003.