|
893
|
Using MPI – Portable Parallel Programming with message-passing Interface, Second edition
– Gropp William, Ewing Lusk, Anthony skjellum
- 1999
|
|
35
|
The Implementation of MPI-2 One-Sided Communication for the NEC SX-5
– Jesper Larsson Träff, Hubert Ritzdorf, Rolf Hempel
- 2000
|
|
98
|
ARMCI: A Portable Remote Memory Copy Library for Distributed Array Libraries and Compiler Run-Time Systems
– Jarek Nieplocha, Bryan Carpenter
- 1999
|
|
67
|
GASNet Specification, v1.1
– Dan Bonachea
- 2002
|
|
6
|
NIC-Based Atomic Remote Memory Operations
– D Buntinas, D K Panda, W Gropp
- 2002
|
|
7
|
Portable and Effcient Parallel Computing Using the BSP Model
– M Goudreau, K Lang, S B Rao, T Suel, T Tsantilas
- 1999
|
|
7
|
RDMA Protocol Verbs Specification (Version 1.0
– J Hilland, P Culley, J Pinkerton, R Recio
- 2003
|
|
651
|
A high-performance, portable implementation of the MPI message passing interface standard
– Ewing Lusk, Nathan Doss, Anthony Skjellum
- 1996
|
|
255
|
The design and implementation of FFTW3
– Matteo Frigo, Steven, G. Johnson
- 2005
|
|
8
|
Distributed Queue Based Locking Using Advanced Network Features
– A Devulapalli, P Wyckoff
- 2005
|
|
7
|
High Performance Distributed Lock Management Services using Networkbased Remote Atomic Operations
– S. Narravula, A. Mamidala, A. Vishnu, K. Vaidyanathan, D. K. Panda
- 2007
|
|
5
|
Analysis of Implementation Options for MPI-2 One-Sided
– Brian W. Barrett, Galen M. Shipman, Andrew Lumsdaine
- 2007
|
|
3
|
E.Apra. An evaluation of two implementation strategies for optimizing one-sided atomic reduction
– Jarek Nieplocha, Edoardo Apra
|
|
3
|
Optimizing the HPCC randomaccess benchmark on blue Gene/L Supercomputer
– Rahul Garg, Yogish Sabharwal
- 2006
|
|
3
|
Designing Passive Synchronization for MPI-2 One-Sided Communication to Maximize Overlap ∗
– Gopal Santhanaraman, Sundeep Narravula, Dhabaleswar K. Panda
|
|
3
|
Unified Parallel C (UPC) Project. http://upc.lbl.gov
– Berkeley
|
|
3
|
Hydra-mpi: An adaptive particle-particle, particle-mesh code for conducting cosmological simulations on mpp architectures. High Performance Computing Systems and Applications
– G Pringle R J Thacker, H M P Couchman, S Booth
- 2003
|
|
3
|
Minimizing Synchronization Overhead
– R Thakur, W Gropp, B Toonen
- 2004
|
|
2
|
Overview of the HPC Challenge Benchmark Suite
– Jack J. Dongarra, I. High, Productivity Computing Systems
|