Results 1 -
5 of
5
A New Approach to I/O Performance Evaluation - Self-Scaling I/O Benchmarks, Predicted I/O Performance
, 1993
"... . Current I/O benchmarks suffer from several chronic problems: they quickly become obsolete, they do not stress the I/O system, and they do not help in understanding I/O system performance. We propose a new approach to I/O performance analysis. First, we propose a self-scaling benchmark that dynamic ..."
Abstract
-
Cited by 34 (2 self)
- Add to MetaCart
. Current I/O benchmarks suffer from several chronic problems: they quickly become obsolete, they do not stress the I/O system, and they do not help in understanding I/O system performance. We propose a new approach to I/O performance analysis. First, we propose a self-scaling benchmark that dynamically adjusts aspects of its workload according to the performance characteristic of the system being measured. By doing so, the benchmark automatically scales across current and future systems. The evaluation aids in understanding system performance by reporting how performance varies according to each of five workload parameters. Second, we propose predicted performance, a technique for using the results from the self-scaling evaluation to quickly estimate the performance for workloads that have not been measured. We show that this technique yields reasonably accurate performance estimates and argue that this method gives a far more accurate comparative performance evaluation than tradition...
Cache Coherence for Shared Memory Multiprocessors Based on Virtual Memory Support
, 1992
"... This paper presents a software cache coherence scheme that uses virtual memory (VM) support to maintain cache coherency for shared memory multiprocessors and requires no special hardware to do so. Traditional VM translation hardware in each processor is used to detect memory access attempts that wou ..."
Abstract
-
Cited by 24 (3 self)
- Add to MetaCart
This paper presents a software cache coherence scheme that uses virtual memory (VM) support to maintain cache coherency for shared memory multiprocessors and requires no special hardware to do so. Traditional VM translation hardware in each processor is used to detect memory access attempts that would violate cache coherence and system software is used to enforce coherence. The implementation of this class of coherence schemes is extremely economical: it requires neither special multiprocessor hardware nor compiler support, and easily incorporates different consistency models. We evaluated two consistency models for the VM-based approach: sequential consistency and lazy release consistency. The VM-based schemes are compared with a bus based snoopy caching architecture, and our trace-driven simulation results show that the VM-based cache coherence schemes are practical for small-scale, shared memory multiprocessors. Keywords: shared memory, multiprocessors, cache coherence, memory manag...
A Comparative Evaluation of Cache Coherence Schemes Based on Virtual Memory Support
, 1992
"... This paper presents an evaluation of a new class of software cache coherence schemes that use virtual memory (VM) support to maintain multiprocessor cache coherence. Traditional VM translation hardware in each processor detects memory access attempts that would violate cache coherence and system sof ..."
Abstract
- Add to MetaCart
This paper presents an evaluation of a new class of software cache coherence schemes that use virtual memory (VM) support to maintain multiprocessor cache coherence. Traditional VM translation hardware in each processor detects memory access attempts that would violate cache coherence and system software is used to enforce coherence. The implementation of this class of coherence schemes is extremely economical: it requires neither special multiprocessor hardware nor compiler support, and easily incorporates different consistency models. We have evaluated four consistency models for the VM-based approach: sequential consistency, singlewriter release consistency, release consistency, and lazy release consistency. Our trace-driven simulation results show that the VM-based cache coherence schemes are practical for small-scale multiprocessors and that the performance of lazy release consistency for multi-threaded parallel programs is close to the snoopy-cache invalidation-based coherence ap...
Evaluation of a "Stall" Cache: An Efficient Restricted On-chip Instruction Cache
"... In this report we compare the cost and performance of a new kind of restricted instruction cache architecture --- the stall cache --- against several other conventional cache architectures. The stall cache minimizes the size of an on-chip instruction cache by caching only those instructions whose in ..."
Abstract
- Add to MetaCart
In this report we compare the cost and performance of a new kind of restricted instruction cache architecture --- the stall cache --- against several other conventional cache architectures. The stall cache minimizes the size of an on-chip instruction cache by caching only those instructions whose instruction fetch phase collides with the memory access phase of a preceding load or store instruction. Many existing machines provide a single cycle external cache memory [6, 17, 2]. Our results show that, under this assumption, the stall cache always outperforms an equivalent sized on-chip instruction cache, reducing external memory access stalls by approximately 10%. In addition we present results for a system using an onchip data cache, and for one with a double width data bus and short instruction prefetch buffer. 1 Introduction RISC instruction sets are designed to facilitate pipelining, with many architectures aiming to issue at least one instruction per clock cycle [19, 20, 8, 18]. T...
1993 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems. A New Approach to I/O Performance Evaluation— Self-Scaling I/O Benchmarks, Predicted I/O Performance
"... Abstract. Current I/O benchmarks suffer from several chronic problems: they quickly become obsolete, they do not stress the I/O system, and they do not help in understanding I/O system performance. We propose a new approach to I/O performance analysis. First, we propose a self-scaling benchmark that ..."
Abstract
- Add to MetaCart
Abstract. Current I/O benchmarks suffer from several chronic problems: they quickly become obsolete, they do not stress the I/O system, and they do not help in understanding I/O system performance. We propose a new approach to I/O performance analysis. First, we propose a self-scaling benchmark that dynamically adjusts aspects of its workload according to the performance characteristic of the system being measured. By doing so, the benchmark automatically scales across current and future systems. The evaluation aids in understanding system performance by reporting how performance varies according to each of five workload parameters. Second, we propose predicted performance, a technique for using the results from the self-scaling evaluation to quickly estimate the performance for workloads that have not been measured. We show that this technique yields reasonably accurate performance estimates and argue that this method gives a far more accurate comparative performance evaluation than traditional single point benchmarks. We apply our new evaluation technique by measuring a SPARCstation 1+ with one SCSI disk, an HP 730 with one SCSI-II disk, a Sprite LFS DECstation 5000/200 with a three-disk disk array, a Convex C240 minisupercomputer with a four-disk disk array, and a Solbourne 5E/905 fileserver with a two-disk disk array. 1.

