Results 1 - 10
of
74
Clock Rate versus IPC: The End of the Road for Conventional Microarchitectures
, 2000
"... The doubling of microprocessor performance every three years has been the result of two factors: more transistors per chip and superlinear scaling of the processor clock with technology generation. Our results show that, due to both diminishing improvements in clock rates and poor wire scaling as se ..."
Abstract
-
Cited by 324 (23 self)
- Add to MetaCart
performance---estimating both clock rate and IPC--- of an aggressive out-of-order microarchitecture as it is scaled from a 250nm technology to a 35nm technology. We perform this analysis for three clock scaling targets and two microarchitecture scaling strategies: pipeline scaling and capacity scaling. We
Dynamic IPC/Clock Rate Optimization
- PROCEEDINGS OF THE 25TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE
, 1998
"... Current microprocessor designs set the functionality and clock rate of the chip at design time based on the configuration that achieves the best overall performance over a range of target applications. The result may be poor performance when running applications whose requirements are not well-match ..."
Abstract
-
Cited by 86 (19 self)
- Add to MetaCart
-matched to the particular hardware organization chosen. We present a new approach called Complexity-Adaptive Processors (CAPs) in which the IPC/clock rate tradeoff can be altered at runtime to dynamically match the changing requirements of the instruction stream. By exploiting repeater methodologies used increasingly
Improving IPC by kernel design
- IN 14TH ACM SYMPOSIUM ON OPERATING SYSTEM PRINCIPLES (SOSP
, 1993
"... Inter-process communication (ipc) has to be fast and e ective, otherwise programmers will not use remote procedure calls (RPC), multithreading and multitasking adequately. Thus ipc performance is vital for modern operating systems, especially µ-kernel based ones. Surprisingly, most µ-kernels exhibit ..."
Abstract
-
Cited by 194 (20 self)
- Add to MetaCart
-kernels exhibit poor ipc performance, typically requiring 100 µs for a short message transfer on a modern processor, running with 50 MHz clock rate. In contrast, we achieve 5 µs; a twentyfold improvement. This paper describes the methods and principles used, starting from the architectural design and going down
Improving IPC by Kernel Design
"... gmd.de Inter-process communication (ipc) haa to be fast and effective, otherwise programmers will not use remote procedure calls (RPC), multithreading and multitasking adequately. Thus ipc performance is vital for modern operating systems, especially p-kernel based ones. Surprisingly, most,u-kernels ..."
Abstract
- Add to MetaCart
,u-kernels exhibit poor ipc performance, typically requiring 100,US for a short message transfer on a modern processor, running with 50 MHz clock rate. In contrast, we achieve 5,US; a twentyfold improvement. This paper describes the methods and principles used, starting from the architectural design and going down
Genomic clocks and evolutionary timescales. Trends Genet
, 2003
"... For decades, molecular clocks have helped to illuminate the evolutionary timescale of life, but now genomic data pose a challenge for time estimation methods. It is unclear how to integrate data from many genes, each potentially evolving under a different model of substitution and at a different rat ..."
Abstract
-
Cited by 76 (8 self)
- Add to MetaCart
rate. Current methods can be grouped by the way the data are handled (genes considered separately or combined into a ‘supergene’) and the way gene-specific rate models are applied (global versus local clock). There are advantages and disadvantages to each of these approaches, and the optimal method has
Multiple-banked register file architectures
- In International Symposium on Computer Architecture(ISCA-27
, 2000
"... Abstract The register file access time is one of the critical delays in current superscalar processors. Its impact on processor performance is likely to increase in future processor generations, as they are expected to increase the issue width (which implies more register ports) and the size of the ..."
Abstract
-
Cited by 146 (12 self)
- Add to MetaCart
of the instruction window (which implies more registers), and to use some kind of multithreading. Under this scenario, the register file access time could be a dominant delay and a pipelined implementation would be desirable to allow for high clock rates. However, a multi-stage register file has severe implications
Increasing the Instruction Fetch Rate via Multiple Branch Prediction and a Branch Address Cache
, 1993
"... High performance computer implementation today is increasingly directed toward parallelism in the hardware. Superscalar machines, where the hardware can issue more than one instruction each cycle, are being adopted by more implementations. As the trend toward wider issue rates continues, so too must ..."
Abstract
-
Cited by 109 (5 self)
- Add to MetaCart
clock for a machine front-end. For one, two, and three basic blocks, the IPC f of integer benchmark...
A Design Space Evaluation of Grid Processor Architectures
, 2001
"... In this paper, we survey the design space of a new class of architec-tures called Grid Processor Architectures (GPAs). These architectures are designed to scale with technology, allowing faster clock rates than conventional architectures while providing superior instruction-level parallelism on trad ..."
Abstract
-
Cited by 118 (33 self)
- Add to MetaCart
In this paper, we survey the design space of a new class of architec-tures called Grid Processor Architectures (GPAs). These architectures are designed to scale with technology, allowing faster clock rates than conventional architectures while providing superior instruction-level parallelism
ILP is Dead, Long Live IPC! (A position paper.)
, 2005
"... We will discuss the current state of microprocessor architecture, where it is currently headed, and where it should be headed. Specifically, until recently processors consisted of one copy of a CPU, the latter exploiting as much Instruction-Level Parallelism as possible. This improved performance. U ..."
Abstract
- Add to MetaCart
. Unfortunately, two trends collided and caused a rapid shift in processor architecture: 1) The architecture community's ability to extract ILP from typical code asymptotically approached zero, so processor companies kept increasing the CPU clock rate to compensate and improve performance in a brute force
Thread Criticality Predictors for Dynamic Performance, Power, and Resource Management in Chip Multiprocessors
"... With the shift towards chip multiprocessors (CMPs), exploiting and managing parallelism has become a central problem in computer systems. Many issues of parallelism management boil down to discerning which running threads or processes are critical, or slowest, versus which are non-critical. If one c ..."
Abstract
-
Cited by 63 (0 self)
- Add to MetaCart
can accurately predict critical threads in a parallel program, then one can respond in a variety of ways. Possibilities include running the critical thread at a faster clock rate, performing load balancing techniques to offload work onto currently non-critical threads, or giving the critical thread
Results 1 - 10
of
74