MapReduce in terms of SQL
I found a great explanation of MapReduce in terms of SQL in a blog post about Hadoop by Chris Stucchio:
SELECT G(...) FROM table GROUP BY F(...)
The only thing you are permitted to touch is
G(k,v), except of course for performance optimizations (usually not the fun kind!) at intermediate steps. Everything else is fixed.
Back to index: Old blog.
Tags: algorithm, SQL, programming, old blog