MapReduce Programming and Cost-based Optimization? Crossing this Chasm with Starfish

by Herodotos Herodotou , Fei Dong , Shivnath Babu
Citations:5 - 3 self

Documents Related by Co-Citation

21 No One (Cluster) Size Fits All: Automatic Cluster Sizing for Data-intensive Analytics – Herodotos Herodotou, Fei Dong, Shivnath Babu
1734 MapReduce: Simplified Data Processing on Large Clusters – Jeffrey Dean, et al. - 2004
112 Hive- A Warehousing Solution Over a Map-Reduce Framework – Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, Raghotham Murthy - 2009
359 Pig Latin: A Not-So-Foreign Language for Data Processing – Christopher Olston, Benjamin Reed, Utkarsh Srivastava, Ravi Kumar, Andrew Tomkins
19 what-if analysis, and cost-based optimization of MapReduce programs – Profiling - 2011
22 Automatic Optimization for MapReduce Programs – Eaman Jahani, Michael J. Cafarella, Christopher Ré
14 Query optimization for massively parallel data processing – S Wu, F Li, S Mehrotra, B C Ooi - 2011
3 Automated SQL Tuning through Trial and (Sometimes) Error – Herodotos Herodotou, Shivnath Babu
3 Xplus: A SQL-Tuning-Aware Query Optimizer – Herodotos Herodotou, Shivnath Babu
3 Query Optimization Techniques for Partitioned Tables – Herodotos Herodotou, Nedyalko Borisov, Shivnath Babu
23 Automatic Optimization of Parallel Dataflow Programs – Christopher Olston, Benjamin Reed, Adam Silberstein, Utkarsh Srivastava
57 Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience – Alan F. Gates, Olga Natkovich, Shubham Chopra, Pradeep Kamath, Shravan M. Narayanamurthy, Christopher Olston, Benjamin Reed, Santhosh Srinivasan, Utkarsh Srivastava - 2009
38 Optimizing joins in a mapreduce environment – Foto N. Afrati, Jeffrey D. Ullman - 2010
17 MRShare: Sharing Across Multiple Queries in MapReduce – Tomasz Nykiel, George Kollios, Michalis Potamias, Nick Koudas, Chaitanya Mishra
460 Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks – M Isard, M Budiu, Y Yu, A Birrell, D Fetterly - 2007
130 A comparison of approaches to large-scale data analysis – Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. Dewitt, Samuel Madden, Michael Stonebraker - 2009
39 Distributed Aggregation for Data-Parallel Computing: Interfaces and Implementations – Yuan Yu, Pradeep Kumar Gunda, Michael Isard
2 Automated Experiment Driven Management of (Database) Systems – S Babu, N Borisov, S Duan, H Herodotou, V Thummala - 2009
13 RIOT: I/O-Efficient Numerical Computing without SQL ∗ – Yi Zhang, Herodotos Herodotou, Jun Yang