Interpreting the Data: Parallel Analysis with Sawzall

by Rob Pike , Sean Dorward , Robert Griesemer , Sean Quinlan , Google Inc
Venue:Scientific Programming Journal, Special Issue on Grids and Worldwide Computing Programming Models and Infrastructure
Citations:205 - 0 self

Documents Related by Co-Citation

1734 MapReduce: Simplified Data Processing on Large Clusters – Jeffrey Dean, et al. - 2004
359 Pig Latin: A Not-So-Foreign Language for Data Processing – Christopher Olston, Benjamin Reed, Utkarsh Srivastava, Ravi Kumar, Andrew Tomkins
460 Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks – M Isard, M Budiu, Y Yu, A Birrell, D Fetterly - 2007
922 The Google File System – Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung - 2003
530 Bigtable: A distributed storage system for structured data – Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber - 2006
128 Map-reduce-merge: simplified relational data processing on large clusters – H-c Yang, A Dasdan, R-L Hsiao, D S Parker - 2007
193 Programming Parallel Algorithms – Guy E. Blelloch - 1996
169 DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language – Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Úlfar Erlingsson, Pradeep Kumar, Gunda Jon Currey
111 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets – Ronnie Chaiken, Bob Jenkins, Per-åke Larson, Bill Ramsey, Darren Shakib, Simon Weaver, Jingren Zhou
292 and Sanjay Ghemawat. Mapreduce: simplified data processing on large clusters – Jeffrey Dean
696 Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals – Jim Gray, Adam Bosworth, Andrew Layman, Don Reichart, Hamid Pirahesh - 1996
97 HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads – Azza Abouzeid, Kamil Bajda-pawlikowski, Daniel Abadi, Avi Silberschatz, Er Rasin
229 The Gamma database machine project – David J. Dewitt, Shahram Ghandeharizadeh, Donovan Schneider, Allan Bricker, Hui-i Hsiao, Rick Rasmussen - 1990
177 Encapsulation of Parallelism in the Volcano Query Processing System – G Graefe - 1990
138 Evaluating MapReduce for multi-core and multiprocessor systems – Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis - 2007
3262 The Anatomy of a Large-Scale Hypertextual Web Search Engine – Sergey Brin, Lawrence Page - 1998
520 Parallel database systems: the future of high performance database systems – David J. Dewitt, Jim Gray - 1992
1132 A bridging model for parallel computation – L Valiant - 1990
46 et al. Bigtable: A Distributed Storage System for Structured Data – F Chang - 2006