MapReduce Programming and Cost-based Optimization? Crossing this Chasm with Starfish

by Herodotos Herodotou , Fei Dong , Shivnath Babu

Active Bibliography

2 No One (Cluster) Size Fits All: Automatic Cluster Sizing for Data-intensive Analytics – Herodotos Herodotou, Fei Dong, Shivnath Babu
1 Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs – Herodotos Herodotou, Shivnath Babu
Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud – Balaji Palanisamy, Aameek Singh, Ling Liu, Bhushan Jain
PerfXplain: Debugging MapReduce Job Performance – Nodira Khoussainova, Magdalena Balazinska, Dan Suciu
5 Towards Optimal Resource Provisioning for Running MapReduce Programs in Public Clouds – Fengguang Tian, Keke Chen
Parallel Data Processing with MapReduce: A Survey – Kyong-ha Lee, Yoon-joon Lee, Hyunsik Choi, Yon Dohn Chung, Bongki Moon
1 An Optimization Framework for Map-Reduce Queries – Leonidas Fegaras, Chengkai Li, Upa Gupta
This paper is posted at ScholarlyCommons. http://repository.upenn.edu/cis reports/970Processing Data-intensive Workflows in – The Cloud, Zhuoyao Zhang, The Cloud, Zhuoyao Zhang - 2012
Automated Profiling and Resource Management of Pig Programs for Meeting Service Level Objectives – Zhuoyao Zhang, Ludmila Cherkasova, Boon Thau Loo, Abhishek Verma
HP Labs – Yongchul Kwon, Magdalena Balazinska, Bill Howe, Jerome Rolia
Efficient MapReduce in the Cloud – Michael Cardosa, Aameek Singh, Himabindu Pucha, Michael Cardosa, Aameek Singh, Himabindu Pucha, Abhishek Chandra
CURRICULUM VITAE – Rares Vernica, Professor Michael, J. Carey
2 Exploring MapReduce Efficiency with Highly-Distributed Data ∗ – Michael Cardosa, Chenyu Wang, Anshuman Nangia, Abhishek Ch, Jon Weissman
13 The Performance of MapReduce: An In-depth Study – Dawei Jiang, Beng Chin, Ooi Lei, Shi Sai Wu
5 A Platform for Scalable One-Pass Analytics using MapReduce – Boduo Li, Edward Mazur, Yanlei Diao, Andrew Mcgregor, Prashant Shenoy
Praveen Kumar Assistant Professor, – Piyush Saxena, Satyajit Padhy
MapReduce with Deltas – R. Lämmel, D. Saile
U N I V E R S – Calum Robert, William Clark
RDFPath: Path Query Processing on Large RDF Graphs with MapReduce – Martin Przyjaciel-zablocki, Er Schätzle, Thomas Hornung, Georg Lausen