Towards Optimal Resource Provisioning for Running MapReduce Programs in Public Clouds

by Fengguang Tian , Keke Chen
Citations:13 - 1 self

Documents Related by Co-Citation

1682 MapReduce: Simplified Data Processing on Large Clusters – Jeffrey Dean, et al. - 2004
18 ARIA: Automatic Resource Inference and Allocation for MapReduce Environments – Abhishek Verma, Ludmila Cherkasova, Roy H. Campbell - 2011
28 Starfish: A Self-tuning System for Big Data Analytics – Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma Bilgen Cetin, Shivnath Babu - 2011
35 ParaTimer: A Progress Indicator for MapReduce DAGs – Kristi Morton, Magdalena Balazinska, Dan Grossman
107 Hive- A Warehousing Solution Over a Map-Reduce Framework – Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, Raghotham Murthy - 2009
27 Estimating the Progress of MapReduce Pipelines – Kristi Morton, Abram Friesen, Magdalena Balazinska, Dan Grossman
7 CoScan: Cooperative Scan Sharing in the Cloud – X Wang, C Olston, A Sarma, R Burns - 2011
19 No One (Cluster) Size Fits All: Automatic Cluster Sizing for Data-intensive Analytics – Herodotos Herodotou, Fei Dong, Shivnath Babu
15 FLEX: A Slot Allocation Scheduling Optimizer for MapReduce Workloads – J Wolf, D Rajan, K Hildrum, R Khandekar, V Kumar, S Parekh, K-L Wu, A Balmin - 2010
10 Resource provisioning framework for MapReduce jobs with performance goals – A Verma, L Cherkasova, R Campbell - 2011
455 Dryad: distributed data-parallel programs from sequential building blocks – M Isard, M Budiu, Y Yu, A Birrell, D Fetterly
38 Statistics-Driven Workload Modeling for the Cloud – Archana Ganapathi, Yanpei Chen, Armando Fox, Randy Katz, David Patterson - 2010
16 Performance-driven task co-scheduling for MapReduce environments РJ Polo, D Carrera, Y Becerra, J Torres, E Ayguad̩, M Steinder, I Whalley - 2010
54 Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience – Alan F. Gates, Olga Natkovich, Shubham Chopra, Pradeep Kamath, Shravan M. Narayanamurthy, Christopher Olston, Benjamin Reed, Santhosh Srinivasan, Utkarsh Srivastava - 2009
5 Automated Profiling and Resource Management of Pig Programs for Meeting Service Level Objectives – Zhuoyao Zhang, Ludmila Cherkasova, Boon Thau Loo, Abhishek Verma
4 Optimal Two- and Three-Stage – S Johnson - 1954
19 Towards Optimizing Hadoop Provisioning in the Cloud – Karthik Kambatla, Abhinav Pathak, Himabindu Pucha
18 What-if Analysis, and Costbased Optimization of MapReduce Programs – Profiling - 2011
97 Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling – Matei Zaharia, Khaled Elmeleegy, Dhruba Borthakur, Scott Shenker, Joydeep Sen Sarma, Ion Stoica - 2010