Parallel Database Systems: The Future of High Performance Database Systems

David J. DeWitt and Jim Gray
University of Wisconsin and DEC
Scribe by: Zuyu Zhang

One-line Summary

Parallel database

Typical relational systems execute queries as a collection of operations, with tuples streaming between them.
- Ex: σ_p₁(R)⋈ σ_p₂(S)
- hash join, and R fits in memory
- Option #1
  - run each operator on a different processor
  - bad, because
    1. no enough operators
    2. lots of skew between operators
- Option #2
  - data parallel, partitioned execution of operators
  - two new operators
    1. split
    2. merge (FIFO)
  - three phases of repartition
    - split: each node splits its portion of the table into fragments.
    - shuffle: redistribute the fragments.
    - merge: combine the shuffled fragments at their destinations.
Data partition schemes
- range partition: not always clear to pick boundaries
- round-robin
- hashing
Distribution strategy (RR)
- Suppose R and S are already partitioned on join attribute (no re-partitioning needed)
- Suppose R is partitioned on join attribute, but S is not ⇒ repartition S
- Suppose neither R nor S are already partitioned, and R is small
  - replicate R in every processor
  - network may not be the bottleneck
Parallel in index lookup
- partition on the index attribute
  - may not speedup
  - a big query and partition transactions on many processors

Select A, count(B) from R group by R.A

Option #1
- repartition on R.A
- do local aggregation
Option #2
- do local grouping on R.A
- repartition groups
- combine groups to do final aggregation