|
913
|
MapReduce: simplified data processing on large clusters
– Jeffrey Dean, Sanjay Ghemawat
- 2004
|
|
5
|
ARIA: automatic resource inference and allocation for MapReduce environments
– A Verma, L Cherkasova, R H Campbell
- 2011
|
|
5
|
FLEX: A Slot Allocation Scheduling Optimizer for MapReduce Workloads
– J Wolf, D Rajan, K Hildrum, R Khandekar, V Kumar, S Parekh, K-L Wu, A Balmin
- 2010
|
|
9
|
ParaTimer: A Progress Indicator for MapReduce DAGs
– Kristi Morton, Magdalena Balazinska, Dan Grossman
|
|
11
|
Starfish: A Self-tuning System for Big Data Analytics
– Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma Bilgen Cetin, Shivnath Babu
- 2011
|
|
265
|
Dryad: Distributed data-parallel programs from sequential building blocks
– M Isard, M Budiu, Y Yu, A Birrell, D Fetterly
- 2007
|
|
41
|
Hive- A Warehousing Solution Over a Map-Reduce Framework
– Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, Raghotham Murthy
- 2009
|
|
3
|
Optimal Two- and Three-Stage
– S Johnson
- 1954
|
|
3
|
Performance-Driven Task Co-Scheduling for MapReduce Environments
– J Polo, D Carrera, Y Becerra, J Torres, E Ayguadé, M Steinder, I Whalley
- 2010
|
|
3
|
CoScan: Cooperative Scan Sharing in the Cloud
– X Wang, C Olston, A Sarma, R Burns
- 2011
|
|
27
|
Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience
– Alan F. Gates, Olga Natkovich, Shubham Chopra, Pradeep Kamath, Shravan M. Narayanamurthy, Christopher Olston, Benjamin Reed, Santhosh Srinivasan, Utkarsh Srivastava
- 2009
|
|
13
|
Statistics-Driven Workload Modeling for the Cloud
– Archana Ganapathi, Yanpei Chen, Armando Fox, Randy Katz, David Patterson
- 2010
|
|
158
|
An efficient data clustering method for very large databases
– T ZHANG, R RAMAKRISHNAN, M BIRCH LIVNY
- 1996
|
|
10
|
VISTA: Validating and refining clusters via visualization
– Keke Chen, Ling Liu, Keke Chen, Ling Liu
- 2004
|
|
1
|
CloudVista: Visual Cluster Exploration for Extreme Scale Data in the Cloud
– Keke Chen, Huiqi Xu, Fengguang Tian, Shumin Guo
|
|
1
|
Building a High-Level
– A Gates, O Natkovich, S Chopra, P Kamath, S Narayanam, C Olston, B Reed, S Srinivasan, U Srivastava
|
|
1
|
CoScan: Cooperative Scan Sharing
– X Wang, C Olston, A Sarma, R Burns
- 2011
|
|
65
|
SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets
– Ronnie Chaiken, Bob Jenkins, Per-åke Larson, Bill Ramsey, Darren Shakib, Simon Weaver, Jingren Zhou
|
|
7
|
Towards Optimizing Hadoop Provisioning in the Cloud
– Karthik Kambatla, Abhinav Pathak, Himabindu Pucha
|