Documents Related by Co-Citation

1682 MapReduce: Simplified Data Processing on Large Clusters – Jeffrey Dean, et al. - 2004
455 Dryad: distributed data-parallel programs from sequential building blocks – M Isard, M Budiu, Y Yu, A Birrell, D Fetterly
348 Pig Latin: A Not-So-Foreign Language for Data Processing – Christopher Olston, Benjamin Reed, Utkarsh Srivastava, Ravi Kumar, Andrew Tomkins
167 DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language – Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Úlfar Erlingsson, Pradeep Kumar, Gunda Jon Currey
179 Improving MapReduce Performance in Heterogeneous Environments – Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Y Katz, Ion Stoica
19 DryadInc: Reusing work in large-scale computations – Lucian Popa, Mihai Budiu, Yuan Yu, Michael Isard
908 The Google File System – Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung - 2003
170 Pregel: A system for large-scale graph processing – Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, Grzegorz Czajkowski, Google Inc - 2010
311 Online Aggregation – Joseph M. Hellerstein, Peter J. Haas, Helen J. Wang - 1997
181 Interpreting the Data: Parallel Analysis with Sawzall – Rob Pike, Sean Dorward, Robert Griesemer, Sean Quinlan, Google Inc
62 HaLoop: Efficient Iterative Data Processing on Large Clusters – Yingyi Bu, Bill Howe, Magdalena Balazinska, Michael D. Ernst
97 Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling – Matei Zaharia, Khaled Elmeleegy, Dhruba Borthakur, Scott Shenker, Joydeep Sen Sarma, Ion Stoica - 2010
59 Spark: Cluster Computing with Working Sets – Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, Ion Stoica
107 Hive- A Warehousing Solution Over a Map-Reduce Framework – Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, Raghotham Murthy - 2009
506 Bigtable: A distributed storage system for structured data – Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber - 2006
22 Comet: batched stream processing for data intensive distributed computing – B He, M Yang, Z Guo, R Chen, B Su, W Lin, L Zhou - 2010
91 Twister: A runtime for iterative MapReduce – Jaliya Ekanayake, Hui Li, Bingjing Zhang, Thilina Gunarathne, Seung-hee Bae, Judy Qiu, Geoffrey Fox - 2010
24 Stateful Bulk Processing for Incremental Analytics – Dionysios Logothetis, Kevin C. Webb, Christopher Olston, Ken Yocum, Benjamin Reed
3234 The Anatomy of a Large-Scale Hypertextual Web Search Engine – Sergey Brin, Lawrence Page - 1998