Results 11 -
19 of
19
Compiler Techniques for Software Prefetching on Cache-Coherent Shared-Memory Multiprocessors
"... This document describes a set of new techniques for improving the efficiency of compiler-directed software prefetching for parallel Fortran programs running on cache-coherent DSM (distributed shared memory) multiprocessors. The key component used in this scheme is a data flow framework that exploits ..."
Abstract
- Add to MetaCart
This document describes a set of new techniques for improving the efficiency of compiler-directed software prefetching for parallel Fortran programs running on cache-coherent DSM (distributed shared memory) multiprocessors. The key component used in this scheme is a data flow framework
Region-Based Prefetch Techniques for Software Distributed Shared Memory Systems
"... Abstract—Although shared memory programming models show good programmability compared to message passing programming models, their implementation by page-based software distributed shared memory systems usually suffers from high memory consistency costs. The major part of these costs is internode da ..."
Abstract
- Add to MetaCart
of an incorrectly prefetched page to be equivalent to that of a miss, these techniques have an effective page miss rate of 63 % and 71 % respectively. Additionally, these two techniques are compared with two well-known software distributed shared memory (sDSM) prefetch techniques, Adaptive++ and TODFCM. TRe
OpenMP for a Java DSM
"... Although OpenMP is a widely agreed-upon standard for the C/C++ and Fortran programming languages for semi-automatic parallelization of programs for shared memory machines, not much has been done on the binding of OpenMP to Java that targets clusters with distributed memory. This paper presents three ..."
Abstract
- Add to MetaCart
present our prototype implementation of JaMP in the research compiler Jackal, a software-based distributed shared memory implementation for Java. We evaluate the performance of JaMP with a set of micro-benchmarks and with OpenMP versions of the parallel Java Grande Forum (JGF) benchmarks. The micro
Tolerating Latency in Software Distributed Shared Memory Systems Through Non-Binding Prefetching
, 1998
"... A key obstacle to achieving high performance on software distributed shared memory (DSM) systems is their high memory latencies. Software-controlled prefetching tolerates memory latency by overlapping computation with communication. This thesis proposes and evaluates an implementation of software-co ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
A key obstacle to achieving high performance on software distributed shared memory (DSM) systems is their high memory latencies. Software-controlled prefetching tolerates memory latency by overlapping computation with communication. This thesis proposes and evaluates an implementation of software
Automatic Prefetching with Binary Code Rewriting in Object-based DSMs
"... Abstract. Dynamic optimizers modify the binary code of programs at runtime by profiling and optimizing certain aspects of the execution. We present a completely software-based framework that dynamically optimizes programs for object-based Distributed Shared Memory (DSM) systems. In DSM systems, redu ..."
Abstract
- Add to MetaCart
Abstract. Dynamic optimizers modify the binary code of programs at runtime by profiling and optimizing certain aspects of the execution. We present a completely software-based framework that dynamically optimizes programs for object-based Distributed Shared Memory (DSM) systems. In DSM systems
Compiler and Software Distributed Shared Memory Support for Irregular Applications
, 1997
"... We investigate the use of a software distributed shared memory (DSM) layer to support irregular computations on distributed memory machines. Software DSM supports irregular computation through demand fetching of data in response to memory access faults. With the addition of a very limited form of co ..."
Abstract
-
Cited by 47 (3 self)
- Add to MetaCart
We investigate the use of a software distributed shared memory (DSM) layer to support irregular computations on distributed memory machines. Software DSM supports irregular computation through demand fetching of data in response to memory access faults. With the addition of a very limited form
Comparative Evaluation of Latency Tolerance Techniques for Software Distributed Shared Memory
- In Proceedings of the 4th IEEE Symposium on High-Performance Computer Architecture
, 1998
"... A key challenge in achieving high performance on software DSMs is overcoming their relatively large communication latencies. In this paper, we consider two techniques which address this problem: prefetching and multithreading. While previous studies have examined each of these techniques in isolatio ..."
Abstract
-
Cited by 26 (2 self)
- Add to MetaCart
in isolation, this paper is the first to evaluate both techniques using a consistent hardware platform and set of applications, thereby allowing direct comparisons. In addition, this is the first study to consider combining prefetching and multithreading in a software DSM. We performed our experiments on real
Software Transactional Distributed Shared Memory
"... We have developed a transaction-based approach to distributed shared memory(DSM) that supports object caching and gener-ates path expression prefetches. A path expression specifies a path through the heap that traverses the objects to be prefetched. To our knowledge, this is the first prefetching ap ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
We have developed a transaction-based approach to distributed shared memory(DSM) that supports object caching and gener-ates path expression prefetches. A path expression specifies a path through the heap that traverses the objects to be prefetched. To our knowledge, this is the first prefetching
Distributed Shared Memory Management for Java
- In Proc. sixth annual conference of the Advanced School for Computing and Imaging (ASCI 2000
, 1999
"... Jackal is a fine-grained distributed shared memory system that can run multithreaded Java programs on distributedmemory systems. The Jackal compiler generates an access check for every use of an object field or array element. The overhead of the access checks is reduced using compiler optimizations. ..."
Abstract
-
Cited by 14 (3 self)
- Add to MetaCart
collectors, which also deal with replicated objects and partitioned arrays. The paper discusses the various distributed memory management techniques used in the system and presents performance results on a cluster of workstations. Keywords: Java, fine-grained software DSM, parallel garbage collection. 1