Results 1 - 10
of
35
MagPIe: MPI’s Collective Communication Operations for Clustered Wide Area Systems
- Proc PPoPP'99
, 1999
"... Writing parallel applications for computational grids is a challenging task. To achieve good performance, algorithms designed for local area networks must be adapted to the differences in link speeds. An important class of algorithms are collective operations, such as broadcast and reduce. We have d ..."
Abstract
-
Cited by 138 (26 self)
- Add to MetaCart
Writing parallel applications for computational grids is a challenging task. To achieve good performance, algorithms designed for local area networks must be adapted to the differences in link speeds. An important class of algorithms are collective operations, such as broadcast and reduce. We have developed MAGPIE, a library of collective communication operations optimized for wide area systems. MAGPIE's algorithms send the minimal amount of data over the slow wide area links, and only incur a single wide area latency. Using our system, existing MPI applications can be run unmodified on geographically distributed systems. On moderate cluster sizes, using a wide area latency of 10 milliseconds and a bandwidth of 1 MByte/s, MAGPIE executes operations up to 10 times faster than MPICH, a widely used MPI implementation; application kernels improve by up to a factor of 4. Due to the structure of our algorithms, MAGPIE's advantage increases for higher wide area latencies.
Efficient Collective Communication in Distributed Heterogeneous Systems
- Journal of Parallel and Distributed Computing
, 1999
"... The Information Power Grid (IPG) is emerging as an infrastructure that will enable distributed applications – such as video conferencing and distributed interactive simulation – to seamlessly integrate collections of heterogeneous workstations, multiprocessors, and mobile nodes, over heterogeneous w ..."
Abstract
-
Cited by 71 (2 self)
- Add to MetaCart
The Information Power Grid (IPG) is emerging as an infrastructure that will enable distributed applications – such as video conferencing and distributed interactive simulation – to seamlessly integrate collections of heterogeneous workstations, multiprocessors, and mobile nodes, over heterogeneous wide-area networks. This paper introduces a framework for developing efficient collective communication schedules in such systems. Our framework consists of analytical models of the heterogeneous system, scheduling algorithms for the collective communication pattern, and performance evaluation mechanisms. We show that previous models, which considered node heterogeneity but ignored network heterogeneity, can lead to solutions which are worse than the optimal by an unbounded factor. We then introduce an enhanced communication model, and develop three heuristic algorithms for the broadcast and multicast patterns. The completion time of the schedule is chosen as the performance metric. The heuristic algorithms are FEF (Fastest Edge First), ECEF (Earliest Completing Edge First), and ECEF with look-ahead. For small system sizes, we find the optimal solution using exhaustive search. Our simulationexperiments indicate that the performance of our heuristic algorithms is close to optimal. For performance evaluation of larger systems, we have also developed a simple lower bound on the completion time. Our heuristic algorithms achieve significant performance improvements over previous approaches. 1.
Broadcast Scheduling Optimization for Heterogeneous Cluster Systems
, 2000
"... C.17D=174,>)EF=)+*F3(12-9= 24H3C#9>**D31 MD3<=1NIOMP QR 9U 1NIOMP QR [C(./\ =VC@ 29110-43290 * 2,- VC@ 29110-43290 IU(*VS^\a?&((G4&(11(&./b&(5?5J<3=7&(c9= 939> -40160 ?&((G4&(11( F=(<= 17 15600-39120 G4&(11(&./b&(5 oqphi Ers]r_"Ot uv40. 90-38070 k 27420-39120 ./b&(5?5J<3=7&(c9 ..."
Abstract
-
Cited by 33 (1 self)
- Add to MetaCart
C.17D=174,>)EF=)+*F3(12-9= 24H3C#9>**D31 MD3<=1NIOMP QR 9U 1NIOMP QR [C(./\ =VC@ 29110-43290 * 2,- VC@ 29110-43290 IU(*VS^\a?&((G4&(11(&./b&(5?5J<3=7&(c9= 939> -40160 ?&((G4&(11( F=(<= 17 15600-39120 G4&(11(&./b&(5 oqphi Ers]r_"Ot uv40. 90-38070 k 27420-39120 ./b&(5?5J<3=7&(c9= F= 90-38070 k 27420-39120 ./b&(5?5J<3=7&(c9= F=(<= MF3M 714?V./$F3 9=561|D= /$F3 15000-35970 0-37020 5 \ 561|D= /$F3 15000-35970 0-37020 5?5J<3=7&(c9= rs]r}12(8U^V./(_F39U561UD= &N1>&C* |&(1<= U561UD= *F=CF:rs$ry7H0939U (4%&(5?9>\./4 U H1*eI=\ >\./4 U *( 28829-29700 JF=D= U 5?5J<3=7&( O[ !@ 1.
Communication modeling of heterogeneous networks of workstations for performance characterization of collective operations
- In HCW’99, the 8th Heterogeneous Computing Workshop
, 1999
"... Abstract: Networks of Workstations (NOW) have become an attractive alternative platform for high performance computing. Due to the commodity nature of workstations and interconnects and due to the multiplicity of vendors and platforms, the NOW environments are being gradually redefined as Heterogene ..."
Abstract
-
Cited by 29 (0 self)
- Add to MetaCart
Abstract: Networks of Workstations (NOW) have become an attractive alternative platform for high performance computing. Due to the commodity nature of workstations and interconnects and due to the multiplicity of vendors and platforms, the NOW environments are being gradually redefined as Heterogeneous Networks of Workstations (HNOW). Having an accurate model for the communication in HNOW systems is crucial for design and evaluation of efficient communication layers for such systems. In this paper we present a model for point-to-point communication in HNOW systems and show how it can be used for characterizing the performance of different collective communication operations. In particular, we show how the performance of broadcast, scatter, and gather operations can be modeled and analyzed. We also verify the accuracy of our proposed model by using an experimental HNOW testbed. Furthermore, it is shown how this model can be used for comparing the performance of different collective communication algorithms. We also show how the effect of heterogeneity on the performance of collective communication operations can be predicted. 1
Bandwidth-efficient Collective Communication for Clustered Wide Area Systems
- In Proc. International Parallel and Distributed Processing Symposium (IPDPS 2000), Cancun
, 1999
"... Metacomputing infrastructures couple multiple clusters (or MPPs) via wide-area networks and thus allow parallel programs to run on geographically distributed resources. A major problem in programming such wide-area parallel applications is the difference in communication costs inside and between clu ..."
Abstract
-
Cited by 24 (3 self)
- Add to MetaCart
Metacomputing infrastructures couple multiple clusters (or MPPs) via wide-area networks and thus allow parallel programs to run on geographically distributed resources. A major problem in programming such wide-area parallel applications is the difference in communication costs inside and between clusters. Latency and bandwidth of WANs often are orders of magnitude worse than those of local networks. Our MagPIe library eases wide-area parallel programming by providing an efficient implementation of MPI's collective communication operations. MagPIe exploits the hierarchical structure of clustered wide-area systems and minimizes the communication overhead over the WAN links. In this paper, we present improved algorithms for collective communication that achieve shorter completion times by simultaneously using the aggregate bandwidth of the available wide-area links. Our new algorithms split messages into multiple segments that are sent in parallel over different WAN links, thus resulting ...
Assessing the impact and limits of steady-state scheduling for mixed task and data parallelism on heterogeneous platforms
, 2004
"... ..."
Complexity Results for Throughput and Latency Optimization of Replicated and Data-parallel Workflow
- ALGORITHMICA
, 2007
"... Mapping applications onto parallel platforms is a challenging problem, even for simple application patterns such as pipeline or fork graphs. Several antagonist criteria should be optimized for workflow applications, such as throughput and latency (or a combination). In this paper, we consider a si ..."
Abstract
-
Cited by 15 (12 self)
- Add to MetaCart
Mapping applications onto parallel platforms is a challenging problem, even for simple application patterns such as pipeline or fork graphs. Several antagonist criteria should be optimized for workflow applications, such as throughput and latency (or a combination). In this paper, we consider a simplified model with no communication cost, and we provide an exhaustive list of complexity results for different problem instances. Pipeline or fork stages can be replicated in order to increase the throughput by sending consecutive data sets onto different processors. In some cases, stages can also be data-parallelized, i.e. the computation of one single data set is shared between several processors. This leads to a decrease of the latency and an increase of the throughput. Some instances of this simple model are shown to be NP-hard, thereby exposing the inherent complexity of the mapping problem. We provide polynomial algorithms for other problem instances. Altogether, we provide solid theoretical foundations for the study of mono-criterion or bi-criteria mapping optimization problems.
Broadcast trees for heterogeneous platforms
- 19th International Parallel and Distributed Processing Symposium (IPDPS’05
, 2005
"... Laboratoire de l'Informatique du Paralle'lisme E'cole Normale Supe'rieure de LyonUnite ' Mixte de Recherche CNRS-INRIA-ENS LYON-UCBL no 5668 ..."
Abstract
-
Cited by 13 (2 self)
- Add to MetaCart
Laboratoire de l'Informatique du Paralle'lisme E'cole Normale Supe'rieure de LyonUnite ' Mixte de Recherche CNRS-INRIA-ENS LYON-UCBL no 5668
Optimizing the steady-state throughput of scatter and reduce operations on heterogeneous platforms
, 2005
"... ..."
On multicast algorithms for heterogeneous networks of workstations
- Journal of Parallel and Distributed Computing
, 2001
"... Networks of workstations (NOWs) provide an economical platform for high performance parallel computing. Such networks may comprise a variety of different types of workstations and network devices. This paper addresses the problem of efficient multicast in a heterogeneous communication model. Althoug ..."
Abstract
-
Cited by 11 (0 self)
- Add to MetaCart
Networks of workstations (NOWs) provide an economical platform for high performance parallel computing. Such networks may comprise a variety of different types of workstations and network devices. This paper addresses the problem of efficient multicast in a heterogeneous communication model. Although the problem offinding optimal multicast schedules is known to be NP-complete in this model, a greedy algorithm has been shown experimentally to find good solutions in practice. In this paper we show that the greedy algorithm finds provably near-optimal schedules in polynomial time and that optimal schedules can be found in polynomial time when the number of distinct types ofworkstations is bounded by a constant. Specifically, this paper presents three results. First, when there are n workstations ofsome constant k distinct types, the greedy algorithm is shown to find schedules that complete at most a constant additive term later than optimal. Second, an algorithm is given that finds optimal schedules in time O(n 2k). Finally, it is shown that for the general problem, the greedy algorithm finds solutions that complete the multicast in at most twice the optimal time. © 2001 Elsevier Science Key Words: networks ofworkstations; heterogeneous networks; multicast communication; approximation algorithms; dynamic programming.

