Results 11 -
19 of
19
Document for a Standard Message-Passing Interface
, 1993
"... this document, along with a brief description of each. ..."
Process Topologies
"... Introduction This chapter discusses the MPI topology mechanism. A topology is an extra, optional attribute that one can give to an intra-communicator; topologies cannot be added to intercommunicators. A topology can provide a convenient naming mechanism for the processes of a group (within a commun ..."
Abstract
- Add to MetaCart
Introduction This chapter discusses the MPI topology mechanism. A topology is an extra, optional attribute that one can give to an intra-communicator; topologies cannot be added to intercommunicators. A topology can provide a convenient naming mechanism for the processes of a group (within a communicator), and additionally, may assist the runtime system in mapping the processes onto hardware. As stated in chapter ??, a process group in MPI is a collection of n processes. Each process in the group is assigned a rank between 0 and n-1. In many parallel applications a linear ranking of processes does not adequately reflect the logical communication pattern of the processes (which is usually determined by the underlying problem geometry and the numerical algorithm used). Often the processes are arranged in topological patterns such as two- or three-dimensional grids. More generally, th
Mapping Strategies for Switch-Based Cluster Systems of Irregular Topology
, 2001
"... Mapping virtual process topology to physical processor topology is one of the most important issues in parallel computing. The mapping problem for switch-based cluster systems of irregular topology is very complicated due to the connection irregularity and routing complexity. This paper proposes two ..."
Abstract
- Add to MetaCart
Mapping virtual process topology to physical processor topology is one of the most important issues in parallel computing. The mapping problem for switch-based cluster systems of irregular topology is very complicated due to the connection irregularity and routing complexity. This paper proposes two mapping schemes for irregular cluster systems, which try to map the nearest neighbors in the process topology to physically adjacent processors. In addition, an application-oriented performance metric, weightedcardinality, is introduced to represent the quality of mapping. Simulation study shows that, for a virtual topology of a 16 \Theta 16 mesh, the proposed mapping schemes result in better mapping quality and about 15 20% shorter communication latency compared to random mapping. The proposed algorithms should also be beneficial when they are appliedtometacomputing and cluster of cluster systems, wherethecommunication costs areanorder of magnitude different depending on the relative position of the processor nodes.
Scientific and Engineering Computation
"... this document, the rationale for design choices made in the interface specification is set off in this format. Some readers may wish to skip these sections, while readers interested in interface design may want to read them carefully. (End of rationale.) Advice to users. Throughout this document, m ..."
Abstract
- Add to MetaCart
this document, the rationale for design choices made in the interface specification is set off in this format. Some readers may wish to skip these sections, while readers interested in interface design may want to read them carefully. (End of rationale.) Advice to users. Throughout this document, material that speaks to users and illustrates usage is set off in this format. Some readers may wish to skip these sections, while readers interested in programming in MPI may want to read them carefully. (End of advice to users.) Advice to implementors. Throughout this document, material that is primarily commentary to implementors is set off in this format. Some readers may wish to skip these sections, while readers interested in MPI implementations may want to read them carefully. (End of advice to implementors.) 1.7.2 Procedure Specification MPI procedures are specified using a language independent notation. The arguments of procedure calls are marked as IN, OUT or INOUT. The meanings of these are: ffl the call uses but does not update an argument marked IN, ffl the call may update an argument marked OUT, ffl the call both uses and updates an argument marked INOUT
2009 18th International Conference on Parallel Architectures and Compilation Techniques Core-Selectability in Chip Multiprocessors
"... Abstract—The centralized structures necessary for the extraction of instruction-level parallelism (ILP) are consuming progressively smaller portions of the total die area of chip multiprocessors (CMP). The reason for this is that scaling these structures does not enhance general performance as much ..."
Abstract
- Add to MetaCart
Abstract—The centralized structures necessary for the extraction of instruction-level parallelism (ILP) are consuming progressively smaller portions of the total die area of chip multiprocessors (CMP). The reason for this is that scaling these structures does not enhance general performance as much as scaling the cache and interconnect. However, the fact that these structures now consume less proportional die area opens an avenue to enhancing their performance through truly overcoming the one-size-fits-all approach to their design. This paper proposes core-selectability – incorporating differently-designed cores that can be toggled into active employment. This enables differently customized ILP-extracting structures to be at hand in the system while not dramatically adding to the interconnect complexity. The design verification effort is minimized by separating the complexity of different core designs. Moreover, contrary to alternative approaches, the performance and power efficiency of the core designs are not compromised. Evaluation results are presented that show that, even when limiting the diversity between core designs to only the sizing of microarchitectural structures, core-selectability has the potential to provide notable performance enhancement (with an average of 10%) to scalable multithreaded applications, without increased concurrency. In addition, it can provide significantly greater throughput to multiprogrammed workloads by providing the potential for the system to transform into a heterogeneous design.
SUMMARY
"... The Message-Passing Interface (MPI) standard provides basic means for adaptations of the mapping of MPI process ranks to processing elements to better match the communication characteristics of applications to the capabilities of the underlying systems. The MPI process topology mechanism enables the ..."
Abstract
- Add to MetaCart
The Message-Passing Interface (MPI) standard provides basic means for adaptations of the mapping of MPI process ranks to processing elements to better match the communication characteristics of applications to the capabilities of the underlying systems. The MPI process topology mechanism enables the MPI implementation to rerank processes by creating a new communicator that reflects user-supplied information about the application communication pattern. With the newly released MPI 2.2 version of the MPI standard, the process topology mechanism has been enhanced with new interfaces
Workstations
"... (MPI) is a portable message-passing standard that facilitates development of parallel applications and libraries. MPI defines the syntax and semantics of a core of library routines useful to a wide range of users writing portable message-passing programs in Fortran 77 or C. The standard also forms a ..."
Abstract
- Add to MetaCart
(MPI) is a portable message-passing standard that facilitates development of parallel applications and libraries. MPI defines the syntax and semantics of a core of library routines useful to a wide range of users writing portable message-passing programs in Fortran 77 or C. The standard also forms a possible target for such language
Etudes et evaluation de reseaux . . . .
, 1997
"... The Asynchronous Transfer Mode (ATM) is an emerging communication protocol which is actually proposed as standard for Broadband-Integrated Services Digital Network (B-ISDN). It is based ..."
Abstract
- Add to MetaCart
The Asynchronous Transfer Mode (ATM) is an emerging communication protocol which is actually proposed as standard for Broadband-Integrated Services Digital Network (B-ISDN). It is based

