#### DMCA

## Scalable Hybrid Implementation of Graph Coloring using MPI and OpenMP (2012)

Venue: | PROC OF PCO |

### Citations

449 | A simple parallel algorithm for the maximal independent set problem.
- Luby
- 1986
(Show Context)
Citation Context ...f vertices does not contain any two vertices having an edge between them; such a set is said to be maximal if no other vertices can be added to that set while keeping it independent. Luby’s algorithm =-=[18]-=- starts by assigning a random number to each vertex; then it finds a vertex such that its random number is larger than all of the neighbors, removes it and removes its neighbors. Instead of removing t... |

150 |
Estimation of sparse Jacobian matrices and graph coloring problems.
- Coleman, More
- 1984
(Show Context)
Citation Context ...nt in many scientific computing applications such as automatic differentiation [1], printed circuit testing [2], parallel numerical computation problems [3], register allocation [4], and optimization =-=[5]-=-. Today’s large scientific computing applications are typically executed on large scale parallel machines for mainly two reasons: to reduce the execution time by leveraging parallelism, and to process... |

78 | A parallel graph coloring heuristic,
- Jones, Plassmann
- 1993
(Show Context)
Citation Context ... Instead of removing that vertex, one can give it the smallest color, and the algorithm becomes a parallel graph coloring algorithm. Many of distributed memory parallel graph coloring algorithms [3], =-=[19]-=-, [20] relies on this technique. The second category of coloring algorithms relies on speculative coloring technique [21], [22], [6], [23]. In the simplest form [21], each processor tentatively colors... |

66 | Linear degree extractors and the inapproximability of max clique and chromatic number,
- Zuckerman
- 2006
(Show Context)
Citation Context ...lors assigned to the vertices. The problem has been known to be NP-Complete [13] and recently, it has shown that for all ɛ > 0, it is NP-Hard to approximate the graph coloring problem within |V | 1−ɛ =-=[14]-=-. Yet simple algorithms are known to provide almost optimal coloring for a majority of common graphs [5]. The sequential greedy coloring presented in Algorithm 1 is the most popular technique for grap... |

59 | What color is your Jacobian? graph coloring for computing derivatives.
- Gebremedhin, Manne, et al.
- 2005
(Show Context)
Citation Context ...t every two adjacent vertices have a different color. The graph coloring problem has been shown to be a critical ingredient in many scientific computing applications such as automatic differentiation =-=[1]-=-, printed circuit testing [2], parallel numerical computation problems [3], register allocation [4], and optimization [5]. Today’s large scientific computing applications are typically executed on lar... |

55 |
An application of graph coloring to printed circuit testing.
- Garey, Johnson, et al.
- 1976
(Show Context)
Citation Context ... have a different color. The graph coloring problem has been shown to be a critical ingredient in many scientific computing applications such as automatic differentiation [1], printed circuit testing =-=[2]-=-, parallel numerical computation problems [3], register allocation [4], and optimization [5]. Today’s large scientific computing applications are typically executed on large scale parallel machines fo... |

45 | Challenges in parallel graph processing
- Lumsdaine, Gregor, et al.
- 2007
(Show Context)
Citation Context ...red for scalable hybrid processing, which will be the main focus of this work. The reason to undertake such a challenging task is that distributed systems are not ideal platforms for graph algorithms =-=[10]-=-, furthermore, distributed memory graph coloring techniques (in fact almost all graph algorithms) suffer severe performance drawbacks when trying to use all the processing units of multicore clusters ... |

41 | Iterated greedy graph coloring and the difficulty landscape.
- Culberson
- 1992
(Show Context)
Citation Context ... vertex the smallest permissible color. The order of traversal of the graph is known to be of importance for reducing the number of colors used and many heuristics have been developed on that premise =-=[17]-=-, [1]. Algorithm 1: Sequential greedy coloring. Data: G = (V, E) for each v ∈ V do for each w ∈ adj (v) do forbiddenColors[color[w]] ← v color[v] ← min{i > 0 : forbiddenColors[i] ̸= v} B. Parallel Gra... |

40 |
Graph Coloring Algorithms
- MATULA, MARBLE, et al.
- 1972
(Show Context)
Citation Context ...e algorithms are known to provide almost optimal coloring for a majority of common graphs [5]. The sequential greedy coloring presented in Algorithm 1 is the most popular technique for graph coloring =-=[15]-=-, [16]. This algorithm simply visits the vertices of the graph in some order and assign to each vertex the smallest permissible color. The order of traversal of the graph is known to be of importance ... |

31 | Scalable Parallel Graph Coloring Algorithms, Concurrency: Practice and Experience 12
- Gebremedhin, Manne
- 2000
(Show Context)
Citation Context ...ithm. Many of distributed memory parallel graph coloring algorithms [3], [19], [20] relies on this technique. The second category of coloring algorithms relies on speculative coloring technique [21], =-=[22]-=-, [6], [23]. In the simplest form [21], each processor tentatively colors parts of the graph independently of the other ones using the sequential greedy algorithm. Once the graph has been fully colore... |

27 | A comparison of parallel graph coloring algorithms.
- Allwright, Bordawekar, et al.
- 1995
(Show Context)
Citation Context ...roblem has been shown to be a critical ingredient in many scientific computing applications such as automatic differentiation [1], printed circuit testing [2], parallel numerical computation problems =-=[3]-=-, register allocation [4], and optimization [5]. Today’s large scientific computing applications are typically executed on large scale parallel machines for mainly two reasons: to reduce the execution... |

22 | Metrics and models for reordering transformations
- Strout, Hovland
(Show Context)
Citation Context ...pn 1 ppn The experiments are run on six real-world graphs which come from various application areas including linear car analysis, finite element, structural engineering and automotive industry [22], =-=[27]-=-. They have been obtained from the University of Florida Sparse Matrix Collection3 and the Parasol project. The list of the graphs and their main properties are summarized in Table I. The number of co... |

19 | Parallel heuristics for improved, balanced graph colorings.
- Gjertsen, Jones, et al.
- 1996
(Show Context)
Citation Context ...ad of removing that vertex, one can give it the smallest color, and the algorithm becomes a parallel graph coloring algorithm. Many of distributed memory parallel graph coloring algorithms [3], [19], =-=[20]-=- relies on this technique. The second category of coloring algorithms relies on speculative coloring technique [21], [22], [6], [23]. In the simplest form [21], each processor tentatively colors parts... |

14 |
A framework for scalable greedy coloring on distributed memory parallel computers
- Bozdag, Gebremedhin, et al.
(Show Context)
Citation Context ...ecute a sequential version of the algorithm. Or execute a distributed memory version of the graph algorithm. In many cases, the former is either infeasible due to memory limitations, or not efficient =-=[6]-=-. The advent of multicore architectures significantly increased the number of processing units within a single machine. Most supercomputers nowadays provide more than four processing cores per node, a... |

9 | Challenges of scaling algebraic multigrid across modern multicore architectures, in: - Baker, Gamblin, et al. - 2011 |

5 |
Overlapping computation and communication for advection on hybrid parallel computers
- White, Dongarra
(Show Context)
Citation Context ... hardware specific intrinsics to be efficient which does not exists on the architectures targeted in this study. C. Hybrid Algorithms Many algorithms have been developed for hybrid systems [7], [25], =-=[26]-=-, [8]. Baker et al. [7] experiments algebraic multigrid on a hybrid platform and discusses the challenges faced. They introduce the first comprehensive study of the performance of algebraic multigrid ... |

5 |
Classical coloring of graphs
- Kosowski, Manuszewski
- 2004
(Show Context)
Citation Context ...rithms are known to provide almost optimal coloring for a majority of common graphs [5]. The sequential greedy coloring presented in Algorithm 1 is the most popular technique for graph coloring [15], =-=[16]-=-. This algorithm simply visits the vertices of the graph in some order and assign to each vertex the smallest permissible color. The order of traversal of the graph is known to be of importance for re... |

4 |
Parallel sparse matrix-vector multiplication as a test case for hybrid MPI+OpenMP programming
- Schubert, Hager, et al.
(Show Context)
Citation Context ...cessing units within a single node. Hybrid systems have flourished in computation-intensive areas such as linear algebra [7], multiple sequence alignment [8] and parallel matrix-vector multiplication =-=[9]-=- which report significant performance improvements. To the best of our knowledge, graph algorithms have not been considered for scalable hybrid processing, which will be the main focus of this work. T... |

4 | Parallel graph coloring algorithms using openmp
- Gebremedhin, I
- 1999
(Show Context)
Citation Context ... algorithm. Many of distributed memory parallel graph coloring algorithms [3], [19], [20] relies on this technique. The second category of coloring algorithms relies on speculative coloring technique =-=[21]-=-, [22], [6], [23]. In the simplest form [21], each processor tentatively colors parts of the graph independently of the other ones using the sequential greedy algorithm. Once the graph has been fully ... |

3 | Improving graph coloring on distributed-memory parallel computers
- Sariyuce, Saule, et al.
- 2011
(Show Context)
Citation Context ...y graph coloring techniques (in fact almost all graph algorithms) suffer severe performance drawbacks when trying to use all the processing units of multicore clusters using message passing libraries =-=[11]-=-. In this paper, we present the design and the development of a hybrid coloring algorithm. We provide a thorough experimental performance analysis of a careful implementation on a multicore cluster. O... |

3 |
New Multithreaded Ordering and Coloring Algorithms for Multicore Architectures
- Patwary, Gebremedhin, et al.
- 2011
(Show Context)
Citation Context ...of distributed memory parallel graph coloring algorithms [3], [19], [20] relies on this technique. The second category of coloring algorithms relies on speculative coloring technique [21], [22], [6], =-=[23]-=-. In the simplest form [21], each processor tentatively colors parts of the graph independently of the other ones using the sequential greedy algorithm. Once the graph has been fully colored, all the ... |

3 | A.: Graph coloring algorithms for multi-core and massively multithreaded architectures. Parallel Computing
- Çatalyürek, Feo, et al.
- 2012
(Show Context)
Citation Context ...explained in details in Section III. The third category, which includes most recent development in coloring algorithms, is dataflow coloring algorithms which has been originally designed for Cray XMT =-=[24]-=-. The main difference of dataflow coloring algorithm is how the coloring of a vertex is initiated or triggered (and in some versions, how these coloring tasks are assigned to a processing element). By... |

2 |
Hybrid MPI/OpenMP strategy for biological multiple sequence alignment with DIALIGN-TX in heterogeneous mluticore clusters
- Macedo, Melo, et al.
(Show Context)
Citation Context ...provement made possible by having multiple processing units within a single node. Hybrid systems have flourished in computation-intensive areas such as linear algebra [7], multiple sequence alignment =-=[8]-=- and parallel matrix-vector multiplication [9] which report significant performance improvements. To the best of our knowledge, graph algorithms have not been considered for scalable hybrid processing... |

1 | Large scale complex network analysis using the hybrid combination of a MapReduce cluster and a highly multithreaded system
- Kang, Bader
(Show Context)
Citation Context ...-level hardware specific intrinsics to be efficient which does not exists on the architectures targeted in this study. C. Hybrid Algorithms Many algorithms have been developed for hybrid systems [7], =-=[25]-=-, [26], [8]. Baker et al. [7] experiments algebraic multigrid on a hybrid platform and discusses the challenges faced. They introduce the first comprehensive study of the performance of algebraic mult... |