Back to index

MapReduce: simplified data processing on large clusters

Jeffrey Dean and Sanjay Ghemawat
Google

One-line Summary

MapReduce is an easy-to-use parallel data processing model which hides the complex management of parallel schedule and execution diagrams, and failure handling from users.

Overview/Main Points

Relevance

Flaws