Databases Paper Reading List
Parallel and Distributed DBs
- Magdalena Balazinska, Hari Balakrishnan, Samuel Madden, and Michael Stonebraker. Fault-tolerance in the Borealis distributed stream processing system. In Proceedings of the 2005 ACM SIGMOD international conference on Management of data (SIGMOD '05). ACM, New York, NY, USA, 13-24.
- J. Baker et al. Megastore: Providing Scalable, Highly Available Storage for Interactive Services CIDR (2011), pp. 223-234.
- J. Shute et al. F1: A distributed SQL database that scales. PVLDB, 6(11), 2013.
Notes
- J. C. Corbett et al. Spanner: Google's Globally-Distributed Database. In OSDI, 2012.
- Daniel Peng and Frank Dabek. Large-scale incremental processing using distributed transactions and notifications. In Proceedings of the 9th USENIX conference on Operating systems design and implementation (OSDI'10). USENIX Association, Berkeley, CA, USA, 1-15.
- S. Melnik et al. Dremel: Interactive Analysis of Web-Scale Datasets. Proc. VLDB Endow. 3, 1-2 (September 2010), 330-339.
- A. Gupta et al. Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing. Proc. VLDB Endow. 7(12) (August 2014), 1259-1270.
- G. Malewicz et al. Pregel: a system for large-scale graph processing. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data (SIGMOD '10). ACM, New York, NY, USA, 135-146.
- A. Demers et al. Epidemic algorithms for replicated database maintenance. In Proceedings of the sixth annual ACM Symposium on Principles of distributed computing (PODC '87), Fred B. Schneider (Ed.). ACM, New York, NY, USA, 1-12.
- P. Helland. Life beyond Distributed Transactions: an Apostate's Opinion. In 3rd Biennial Conf. On Innov. Data Syst. Research (CIDR), Asilomar, CA, USA, pp. 132-141 (2007).
- T. Akidau et al. MillWheel: Fault-Tolerant Stream Processing at Internet Scale. Very Large Data Bases (2013), pp. 734-746.
-
Stephen Tu, Wenting Zheng, Eddie Kohler, Barbara Liskov, and Samuel Madden. Speedy transactions in multicore in-memory databases. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (SOSP '13). ACM, New York, NY, USA, 18-32.
[Slide]
[Code]
- Stefan Heule, Marc Nunkesser, and Alexander Hall. HyperLogLog in practice: algorithmic engineering of a state of the art cardinality estimation algorithm. In Proceedings of the 16th International Conference on Extending Database Technology (EDBT '13). ACM, New York, NY, USA, 683-692.
Stream Processing
- Apache Storm (blog)
- Twitter Heron. SIGMOD'15