• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 74
Next 10 →

Clock Rate versus IPC: The End of the Road for Conventional Microarchitectures

by Vikas Agarwal, M.S. Hrishikesh, Stephen W. Keckler, Doug Burger , 2000
"... The doubling of microprocessor performance every three years has been the result of two factors: more transistors per chip and superlinear scaling of the processor clock with technology generation. Our results show that, due to both diminishing improvements in clock rates and poor wire scaling as se ..."
Abstract - Cited by 324 (23 self) - Add to MetaCart
performance---estimating both clock rate and IPC--- of an aggressive out-of-order microarchitecture as it is scaled from a 250nm technology to a 35nm technology. We perform this analysis for three clock scaling targets and two microarchitecture scaling strategies: pipeline scaling and capacity scaling. We

Dynamic IPC/Clock Rate Optimization

by David H. Albonesi - PROCEEDINGS OF THE 25TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE , 1998
"... Current microprocessor designs set the functionality and clock rate of the chip at design time based on the configuration that achieves the best overall performance over a range of target applications. The result may be poor performance when running applications whose requirements are not well-match ..."
Abstract - Cited by 86 (19 self) - Add to MetaCart
-matched to the particular hardware organization chosen. We present a new approach called Complexity-Adaptive Processors (CAPs) in which the IPC/clock rate tradeoff can be altered at runtime to dynamically match the changing requirements of the instruction stream. By exploiting repeater methodologies used increasingly

Improving IPC by kernel design

by Jochen Liedtke - IN 14TH ACM SYMPOSIUM ON OPERATING SYSTEM PRINCIPLES (SOSP , 1993
"... Inter-process communication (ipc) has to be fast and e ective, otherwise programmers will not use remote procedure calls (RPC), multithreading and multitasking adequately. Thus ipc performance is vital for modern operating systems, especially µ-kernel based ones. Surprisingly, most µ-kernels exhibit ..."
Abstract - Cited by 194 (20 self) - Add to MetaCart
-kernels exhibit poor ipc performance, typically requiring 100 µs for a short message transfer on a modern processor, running with 50 MHz clock rate. In contrast, we achieve 5 µs; a twentyfold improvement. This paper describes the methods and principles used, starting from the architectural design and going down

Improving IPC by Kernel Design

by unknown authors
"... gmd.de Inter-process communication (ipc) haa to be fast and effective, otherwise programmers will not use remote procedure calls (RPC), multithreading and multitasking adequately. Thus ipc performance is vital for modern operating systems, especially p-kernel based ones. Surprisingly, most,u-kernels ..."
Abstract - Add to MetaCart
,u-kernels exhibit poor ipc performance, typically requiring 100,US for a short message transfer on a modern processor, running with 50 MHz clock rate. In contrast, we achieve 5,US; a twentyfold improvement. This paper describes the methods and principles used, starting from the architectural design and going down

Genomic clocks and evolutionary timescales. Trends Genet

by S. Blair Hedges, Sudhir Kumar , 2003
"... For decades, molecular clocks have helped to illuminate the evolutionary timescale of life, but now genomic data pose a challenge for time estimation methods. It is unclear how to integrate data from many genes, each potentially evolving under a different model of substitution and at a different rat ..."
Abstract - Cited by 76 (8 self) - Add to MetaCart
rate. Current methods can be grouped by the way the data are handled (genes considered separately or combined into a ‘supergene’) and the way gene-specific rate models are applied (global versus local clock). There are advantages and disadvantages to each of these approaches, and the optimal method has

Multiple-banked register file architectures

by José-Lorenzo Cruz , Antonio González , Mateo Valero , Nigel P Topham - In International Symposium on Computer Architecture(ISCA-27 , 2000
"... Abstract The register file access time is one of the critical delays in current superscalar processors. Its impact on processor performance is likely to increase in future processor generations, as they are expected to increase the issue width (which implies more register ports) and the size of the ..."
Abstract - Cited by 146 (12 self) - Add to MetaCart
of the instruction window (which implies more registers), and to use some kind of multithreading. Under this scenario, the register file access time could be a dominant delay and a pipelined implementation would be desirable to allow for high clock rates. However, a multi-stage register file has severe implications

Increasing the Instruction Fetch Rate via Multiple Branch Prediction and a Branch Address Cache

by Tse-yu Yeh, Deborah T. Marr, Yale N. Patt , 1993
"... High performance computer implementation today is increasingly directed toward parallelism in the hardware. Superscalar machines, where the hardware can issue more than one instruction each cycle, are being adopted by more implementations. As the trend toward wider issue rates continues, so too must ..."
Abstract - Cited by 109 (5 self) - Add to MetaCart
clock for a machine front-end. For one, two, and three basic blocks, the IPC f of integer benchmark...

A Design Space Evaluation of Grid Processor Architectures

by Ramadass Nagarajan, Karthikeyan Sankaralingam, Doug Burger, Stephen W. Keckler , 2001
"... In this paper, we survey the design space of a new class of architec-tures called Grid Processor Architectures (GPAs). These architectures are designed to scale with technology, allowing faster clock rates than conventional architectures while providing superior instruction-level parallelism on trad ..."
Abstract - Cited by 118 (33 self) - Add to MetaCart
In this paper, we survey the design space of a new class of architec-tures called Grid Processor Architectures (GPAs). These architectures are designed to scale with technology, allowing faster clock rates than conventional architectures while providing superior instruction-level parallelism

ILP is Dead, Long Live IPC! (A position paper.)

by Augustus K. Uht , 2005
"... We will discuss the current state of microprocessor architecture, where it is currently headed, and where it should be headed. Specifically, until recently processors consisted of one copy of a CPU, the latter exploiting as much Instruction-Level Parallelism as possible. This improved performance. U ..."
Abstract - Add to MetaCart
. Unfortunately, two trends collided and caused a rapid shift in processor architecture: 1) The architecture community's ability to extract ILP from typical code asymptotically approached zero, so processor companies kept increasing the CPU clock rate to compensate and improve performance in a brute force

Thread Criticality Predictors for Dynamic Performance, Power, and Resource Management in Chip Multiprocessors

by Abhishek Bhattacharjee, Margaret Martonosi
"... With the shift towards chip multiprocessors (CMPs), exploiting and managing parallelism has become a central problem in computer systems. Many issues of parallelism management boil down to discerning which running threads or processes are critical, or slowest, versus which are non-critical. If one c ..."
Abstract - Cited by 63 (0 self) - Add to MetaCart
can accurately predict critical threads in a parallel program, then one can respond in a variety of ways. Possibilities include running the critical thread at a faster clock rate, performing load balancing techniques to offload work onto currently non-critical threads, or giving the critical thread
Next 10 →
Results 1 - 10 of 74
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University