Interpreting the Data: Parallel Analysis with Sawzall

by Rob Pike , Sean Dorward , Robert Griesemer , Sean Quinlan , Google Inc
Venue:Scientific Programming Journal, Special Issue on Grids and Worldwide Computing Programming Models and Infrastructure
Citations:181 - 0 self

Documents Related by Co-Citation

1682 MapReduce: Simplified Data Processing on Large Clusters – Jeffrey Dean, et al. - 2004
348 Pig Latin: A Not-So-Foreign Language for Data Processing – Christopher Olston, Benjamin Reed, Utkarsh Srivastava, Ravi Kumar, Andrew Tomkins
455 Dryad: distributed data-parallel programs from sequential building blocks – M Isard, M Budiu, Y Yu, A Birrell, D Fetterly
908 The Google File System – Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung - 2003
506 Bigtable: A distributed storage system for structured data – Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber - 2006
123 Parker,"MapReduce-Merge: Simplified Relational Data Processing on Large Clusters – H chih Yang, A Dasdan, R-L Hsiao, D S - 2007
167 DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language – Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Úlfar Erlingsson, Pradeep Kumar, Gunda Jon Currey
193 Programming Parallel Algorithms – Guy E. Blelloch - 1996
109 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets – Ronnie Chaiken, Bob Jenkins, Per-åke Larson, Bill Ramsey, Darren Shakib, Simon Weaver, Jingren Zhou
285 Sanjay Ghemawat. Mapreduce: Simplified data processing on large clusters – Jeffrey Dean - 2004
693 Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals – Jim Gray, Adam Bosworth, Andrew Layman, Don Reichart, Hamid Pirahesh - 1996
178 Encapsulation of parallelism in the volcano query processing system – G Graefe - 1990
231 The Gamma database machine project – David J. Dewitt, Shahram Ghandeharizadeh, Donovan Schneider, Allan Bricker, Hui-i Hsiao, Rick Rasmussen - 1990
93 HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads – Azza Abouzeid, Kamil Bajda-pawlikowski, Daniel Abadi, Avi Silberschatz, Er Rasin
133 Evaluating MapReduce for multi-core and multiprocessor systems – Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis - 2007
3234 The Anatomy of a Large-Scale Hypertextual Web Search Engine – Sergey Brin, Lawrence Page - 1998
519 Parallel database systems: the future of high performance database systems – David J. Dewitt, Jim Gray - 1992
1126 A bridging model for parallel computation – Leslie G Valiant - 1990
46 et al. Bigtable: A Distributed Storage System for Structured Data. Google – Fay Chang - 2006