Results 1  10
of
16
Realistic communication model for parallel computing on cluster
 In Proceedings of the International Workshop on Cluster Computing
, 1999
"... ©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other wo ..."
Abstract

Cited by 13 (5 self)
 Add to MetaCart
(Show Context)
©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Towards a Scalable Parallel Object Database  The Bulk Synchronous Parallel Approach
, 1996
"... Parallel computers have been successfully deployed in many scientific and numerical application areas, although their use in nonnumerical and database applications has been scarce. In this report, we first survey the architectural advancements beginning to make generalpurpose parallel computing co ..."
Abstract

Cited by 9 (2 self)
 Add to MetaCart
(Show Context)
Parallel computers have been successfully deployed in many scientific and numerical application areas, although their use in nonnumerical and database applications has been scarce. In this report, we first survey the architectural advancements beginning to make generalpurpose parallel computing costeffective, the requirements for nonnumerical (or symbolic) applications, and the previous attempts to develop parallel databases. The central theme of the Bulk Synchronous Parallel model is to provide a high level abstraction of parallel computing hardware whilst providing a realisation of a parallel programming model that enables architecture independent programs to deliver scalable performance on diverse hardware platforms. Therefore, the primary objective of this report is to investigate the feasibility of developing a portable, scalable, parallel object database, based on the Bulk Synchronous Parallel model of computation. In particular, we devise a way of providing highlevel abstra...
Performance Studies of HighSpeed Communication on Commodity Cluster
, 2001
"... Parallel computing is becoming increasingly accessible through advancement in microprocessors and networking technologies. It is found that the performances of the clusters do not match with their promises, although they are built on the most advanced components. Much effort has been devoted to addr ..."
Abstract

Cited by 6 (0 self)
 Add to MetaCart
Parallel computing is becoming increasingly accessible through advancement in microprocessors and networking technologies. It is found that the performances of the clusters do not match with their promises, although they are built on the most advanced components. Much effort has been devoted to address the software overhead problem in the past, which is known as the major hindrance in achieving high performance. This thesis shows that having a lowlatency communication system does not guarantee high performance, as there are other communication issues that have not been addressed by the use of lowlatency communication, such as contention, communication patterns and scheduling of communication events. The development of an efficient parallel application depends upon a realistic prediction of application behavior and the ability to explain the performance characteristics of an application on a parallel system; this requires indepth understanding of both the application and the architecture characteristics. This dissertation proposes the use of a realistic communication model to guide the performance understanding and the algorithm design processes, which are the keys to
Truly Efficient Parallel Algorithms: cOptimal Multisearch for an Extension of the BSP Model (Extended Abstract)
 Proc. of European Symposium on Algorithms
, 1995
"... ) Armin Baumker, Wolfgang Dittrich and Friedhelm Meyer auf der Heide Department of Mathematics and Computer Science and Heinz Nixdorf Institute, University of Paderborn 33095 Paderborn, Germany Abstract In this paper we design and analyse parallel algorithms with the goal to get exact bounds on t ..."
Abstract

Cited by 4 (1 self)
 Add to MetaCart
(Show Context)
) Armin Baumker, Wolfgang Dittrich and Friedhelm Meyer auf der Heide Department of Mathematics and Computer Science and Heinz Nixdorf Institute, University of Paderborn 33095 Paderborn, Germany Abstract In this paper we design and analyse parallel algorithms with the goal to get exact bounds on their speedups on real machines. For this purpose we define an extension of Valiant's BSP model, BSP*, that rewards blockwise communication, and uses Valiant's notion of coptimality. Intuitively a coptimal parallel algorithm for p processors achieves speedup close to p=c. We consider the Multisearch problem: Assume a strip in 2D to be partitioned into m segments. Given n query points in the strip, the task is to locate, for each query, its segment. For m n we present a deterministic BSP* algorithm that is 1optimal, if n =\Omega (p log 2 p). For m ? n, we present a randomized BSP* algorithm that is (1 + ffi)optimal for arbitrary ffi ? 0, m 2 p and n =\Omega (p log 2 p). Both r...
Descriptive Simplicity in Parallel Computing
, 1997
"... and there currently exist a wide selection of parallel programming languages and environments. This thesis presents and examines the Hierarchical Skeleton Model (HSM), a model of parallel programming that combines ease of use, portability and flexibility. ..."
Abstract

Cited by 1 (0 self)
 Add to MetaCart
(Show Context)
and there currently exist a wide selection of parallel programming languages and environments. This thesis presents and examines the Hierarchical Skeleton Model (HSM), a model of parallel programming that combines ease of use, portability and flexibility.
Truly Efficient Parallel Algorithms: 1Optimal Multisearch for an Extension of the BSP Model
, 1998
"... In this paper we design and analyse parallel algorithms with the goal to get exact bounds on their speedups on real machines. For this purpose we define an extension of Valiant's BSP model, BSP*, that rewards blockwise communication, and use Valiant's notion of 1optimality. Intuitively, ..."
Abstract

Cited by 1 (0 self)
 Add to MetaCart
In this paper we design and analyse parallel algorithms with the goal to get exact bounds on their speedups on real machines. For this purpose we define an extension of Valiant's BSP model, BSP*, that rewards blockwise communication, and use Valiant's notion of 1optimality. Intuitively, a 1optimal parallel algorithm for p processors achieves speedup close to p. We consider the Multisearch Problem: Assume a strip in 2D to be partitioned into m segments. Given n query points in the strip, the task is to locate, for each query, its segment. For m n p we present a deterministic BSP* algorithm that is 1optimal, if n p log 2 n. For m ? n p, we present a randomized BSP* algorithm that is 1optimal with high probability, if m 2 p and n p log 3 n. Both results hold for a wide range of BSP* parameters where the range becomes larger with growing input size n. We further report on implementation work. Previous parallel algorithms for Multisearch were far away from being 1op...
An Analytic Model for ATM Network Performance and its Application to BSP
, 1997
"... Bulk Synchronous Parallelism (BSP) as a parallel model enables accurate costs of parallel programs to be predicted from the program structure and two architectural parameters, g, the permeability of the network, and l, the time required for barrier synchronisation. Networks such as ATM already play ..."
Abstract

Cited by 1 (0 self)
 Add to MetaCart
(Show Context)
Bulk Synchronous Parallelism (BSP) as a parallel model enables accurate costs of parallel programs to be predicted from the program structure and two architectural parameters, g, the permeability of the network, and l, the time required for barrier synchronisation. Networks such as ATM already play a role in parallel computers built as networks of workstations, and may become the standard mechanism for interconnecting processors at all scales. We present an analytic model for determining the BSP parameters of such architectures. Although the model is simple, there is substantial agreement with measured results where these are known. This represents the first time that these architectural parameters have been determined other than by benchmarking, and suggests that the approach may be serviceable for other wormhole routed networks. Keywords: parallel computing, interconnection network, performance modelling, total exchange, bulk synchronous parallelism, latency, throughput. 1 Introd...
Communication Efficient Parallel Searching
, 1997
"... Searching is one of the most important algorithmic problems, used as a subroutine in many applications. Accordingly, designing search algorithms is in the center of research on data structures since decades. In this paper we aim to survey recent developments in designing parallel search algorithms w ..."
Abstract

Cited by 1 (0 self)
 Add to MetaCart
Searching is one of the most important algorithmic problems, used as a subroutine in many applications. Accordingly, designing search algorithms is in the center of research on data structures since decades. In this paper we aim to survey recent developments in designing parallel search algorithms where parallel machines are used to answer many search queries in parallel, so called multisearch algorithms. We briefly describe the current state of multisearch algorithms based on hashing and binary search, as they are developed for abstract parallel models like the PRAM. The main part of the paper describes deterministic and randomized multisearch algorithms that are very communication efficient. As a computation and cost model we employ Valiant's BSP model and its variant BSP* due to Bäumker et al.