Results 1 - 10
of
55
Reducing Power in High-performance Microprocessors
- In Design Automation Conference
"... Power consumption has become one of the biggest challenges in high-performance microprocessor design. The rapid increase in the complexity and speed of each new CPU generation is outstripping the benefits of voltage reduction and feature size scaling. Designers are thus continuously challenged to co ..."
Abstract
-
Cited by 66 (0 self)
- Add to MetaCart
Power consumption has become one of the biggest challenges in high-performance microprocessor design. The rapid increase in the complexity and speed of each new CPU generation is outstripping the benefits of voltage reduction and feature size scaling. Designers are thus continuously challenged to come up with innovative ways to reduce power, while trying to meet all the other constraints imposed on the design. This paper presents an overview of the issues related to power consumption in the context of Intel CPUs. The main trends that are driving the increased focus on design for low power are described. System and benchmarking issues, and sources of power consumption in a high-performance CPU are briefly described. Techniques that have been tried on real designs in the past are described. The role of CAD tools and their limitations in this domain will also be discussed. In addition, areas that need increased research focus in the future are also pointed out. 1. INTRODUCTION The drive...
A coding framework for low-power address and data busses
- IEEE Transactions on VLSI Systems
, 1999
"... Abstract—This paper presents a source-coding framework for the design of coding schemes to reduce transition activity. These schemes are suited for high-capacitance busses where the extra power dissipation due to the encoder and decoder circuitry is offset by the power savings at the bus. In this fr ..."
Abstract
-
Cited by 38 (0 self)
- Add to MetaCart
Abstract—This paper presents a source-coding framework for the design of coding schemes to reduce transition activity. These schemes are suited for high-capacitance busses where the extra power dissipation due to the encoder and decoder circuitry is offset by the power savings at the bus. In this framework, a data source (characterized in a probabilistic manner) is first passed through a decorrelating function �I. Next, a variant of entropy coding function �P is employed, which reduces the transition activity. The framework is then employed to derive novel encoding schemes whereby practical forms for �I and �P are proposed. Simulation results with an encoding scheme for data busses indicate an average reduction in transition activity of 36%. This translates into a reduction in total power dissipation for bus capacitances greater than 14 pF/b in 1.2-"m CMOS technology. For a typical value for bus capacitance of 50 pF/b, there is a 36 % reduction in power dissipation and eight times more power savings compared to existing schemes. Simulation results with an encoding scheme for instruction address busses indicate an average reduction in transition activity by a factor of 1.5 times over known coding schemes. Index Terms — CMOS VLSI, coding, high-capacitance busses, low-power design, switching activity.
Soft Digital Signal Processing
- IEEE Transactions on Very Large Scale Integration (VLSI
, 2001
"... In this paper, we propose a framework for low-energy digital signal processing (DSP) where the supply voltage is scaled beyond the critical voltage imposed by the requirement to match the critical path delay to the throughput. This deliberate introduction of input-dependent errors leads to degradati ..."
Abstract
-
Cited by 24 (4 self)
- Add to MetaCart
In this paper, we propose a framework for low-energy digital signal processing (DSP) where the supply voltage is scaled beyond the critical voltage imposed by the requirement to match the critical path delay to the throughput. This deliberate introduction of input-dependent errors leads to degradation in the algorithmic performance, which is compensated for via algorithmic noise-tolerance (ANT) schemes. The resulting setup that comprises of the DSP architecture operating at sub-critical voltage and the error control scheme is referred to as soft DSP. The effectiveness of the proposed scheme is enhanced when arithmetic units with a higher "delay-imbalance" are employed. A prediction based error-control scheme is proposed to enhance the performance of the filtering algorithm in presence of errors due to soft computations. For a frequency selective filter, it is shown that the proposed scheme pro- vides 60% - 81% reduction in energy dissipation for filter bandwidths up to 0.5 (where 2 corresponds to the sampling frequency f) over that achieved via conven- tional architecture and voltage scaling, with a maximum of 0.5dB degradation in the output signal-to-noise ratio ($NRo). It is also shown that the proposed algorithmic noise-tolerance schemes can also be used to improve the performance of DSP algorithms in presence of bit-error rates of upto 10-3 due to deep submicron (DSM) noise. I.
A Mathematical Basis For Power-Reduction In Digital VLSI Systems
- IEEE Trans. Circuits Syst. II
, 1997
"... Presented in this paper is a mathematical basis for power-reduction in VLSI systems. This basis is employed to 1.) derive lower bounds on the power dissipation in digital systems and 2.) unify existing power-reduction techniques under a common framework. The proposed basis is derived from informatio ..."
Abstract
-
Cited by 23 (15 self)
- Add to MetaCart
Presented in this paper is a mathematical basis for power-reduction in VLSI systems. This basis is employed to 1.) derive lower bounds on the power dissipation in digital systems and 2.) unify existing power-reduction techniques under a common framework. The proposed basis is derived from information-theoretic arguments. In particular, a digital signal processing algorithm is viewed as a process of information transfer with an inherent information transfer rate requirement of R bits/sec. Architectures implementing a given algorithm are equivalent to communication networks each with a certain capacity C (also in bits/sec). The absolute lower bound on the power dissipation for any given architecture is then obtained by minimizing the signal power such that its channel capacity C is equal to the desired information transfer rate R. By including various implementation constraints, increasingly realistic lower bounds are calculated. The usefulness of the proposed theory is demonstrated via...
Digital Circuit Optimization via Geometric Programming
- Operations Research
, 2005
"... informs ® doi 10.1287/opre.1050.0254 © 2005 INFORMS This paper concerns a method for digital circuit optimization based on formulating the problem as a geometric program (GP) or generalized geometric program (GGP), which can be transformed to a convex optimization problem and then very efficiently s ..."
Abstract
-
Cited by 19 (6 self)
- Add to MetaCart
informs ® doi 10.1287/opre.1050.0254 © 2005 INFORMS This paper concerns a method for digital circuit optimization based on formulating the problem as a geometric program (GP) or generalized geometric program (GGP), which can be transformed to a convex optimization problem and then very efficiently solved. We start with a basic gate scaling problem, with delay modeled as a simple resistor-capacitor (RC) time constant, and then add various layers of complexity and modeling accuracy, such as accounting for differing signal fall and rise times, and the effects of signal transition times. We then consider more complex formulations such as robust design over corners, multimode design, statistical design, and problems in which threshold and power supply voltage are also variables to be chosen. Finally, we look at the detailed design of gates and interconnect wires, again using a formulation that is compatible with GP or GGP.
Energy-Efficient Signal Processing via Algorithmic Noise-Tolerance
, 1999
"... In this paper, we propose a framework for low-energy digital signal processing (DSP) where the supply voltage is scaled beyond the critical voltage required to match the critical path delay to the throughput. This deliberate introduction of input-dependent errors leads to degradation in the algorith ..."
Abstract
-
Cited by 18 (2 self)
- Add to MetaCart
In this paper, we propose a framework for low-energy digital signal processing (DSP) where the supply voltage is scaled beyond the critical voltage required to match the critical path delay to the throughput. This deliberate introduction of input-dependent errors leads to degradation in the algorithmic performance, which is compensated for via algorithmic noise-tolerance (ANT) schemes. The resulting setup that comprises of the DSP architecture operating at sub-critical voltage and the error control scheme is referred to as soft DSP. It is shown that technology scaling renders the proposed scheme more effective as the delay penalty suffered due to voltage scaling reduces due to short channel effects. The effectiveness of the proposed scheme is also enhanced when arithmetic units with a higher "delay-imbalance" are employed. A prediction based error-control scheme is proposed to enhance the performance of the filtering algorithm in presence of errors due to soft computations. For a frequ...
Low-Power Scheduling with Resources Operating at Multiple Voltages
- IEEE Trans. on Circuits and Systems-II : Analog and Digital Signal Processing
, 2000
"... Abstract—This paper presents a resource-constrained scheduling scheme and a latency-constrained scheduling scheme that minimize power consumption for the case when the resources operate at multiple voltages. The resource-constrained scheduling reduces the power consumption by maximally utilizing res ..."
Abstract
-
Cited by 14 (0 self)
- Add to MetaCart
Abstract—This paper presents a resource-constrained scheduling scheme and a latency-constrained scheduling scheme that minimize power consumption for the case when the resources operate at multiple voltages. The resource-constrained scheduling reduces the power consumption by maximally utilizing resources operating at reduced voltages and, at the same time, reducing the latency. The latency-constrained scheduling scheme reduces the power consumption by assigning as many nodes (of the data flow graph) as possible to the resources operating at reduced voltages. Both schemes consider the effect of switching activity on the power consumption of the functional units. In addition, both schemes use heuristics to reduce the power consumed by the level shifters. Experiments with HLS benchmark examples show that the proposed schemes achieve significant power reduction when the operating voltages are 5 and 3.3 V or 5, 3.3, and 2.4 V. Index Terms—Behavioral synthesis, latency-constrained scheduling, low-power design, multiple voltage scheduling, resource-constrained scheduling. I.
Analytical Estimation of Signal Transition Activity from Word-Level Statistics
- IEEE Trans. on CAD
, 1997
"... Presented in this paper is a novel methodology to determine the average number of transitions in a signal from its word-level statistical description. The proposed methodology employs: 1.) high-level signal statistics, 2.) a statistical signal generation model, and 3.) the signal encoding (or num ..."
Abstract
-
Cited by 14 (2 self)
- Add to MetaCart
Presented in this paper is a novel methodology to determine the average number of transitions in a signal from its word-level statistical description. The proposed methodology employs: 1.) high-level signal statistics, 2.) a statistical signal generation model, and 3.) the signal encoding (or number representation) to estimate the transition activity for that signal. In particular, the signal statistics employed are mean (t), variance (2), and autocorrelation (p). The signal generation models considered are auto-regressive moving-average (ARMA) models. The signal encoding includes unsigned, one's complement, two's complement, and sign-magnitude representations.
Low Power Address Encoding using Self-Organizing Lists
- in ISLPED
, 2001
"... Off-chip bus transitions are a major source of power dissipation for embedded systems. In this paper, new adaptive encoding schemes are proposed that significantly reduce transition activity on data and multiplexed address buses, that do not add redundancy in space or time and which have minimal del ..."
Abstract
-
Cited by 14 (0 self)
- Add to MetaCart
Off-chip bus transitions are a major source of power dissipation for embedded systems. In this paper, new adaptive encoding schemes are proposed that significantly reduce transition activity on data and multiplexed address buses, that do not add redundancy in space or time and which have minimal delay overhead. These adaptive techniques are based on self-organising lists to achieve reduction in transition activity by exploiting the spatial and temporal locality of the addresses. Unlike previous approaches that focus on instruction address buses, experiments demonstrate significant reduction in transition activity of up to 54% in data address buses and up to 59% in multiplexed address buses. The average reductions are twice those obtained using current schemes on a data address bus and more than twice those obtained on a multiplexed address bus.
A Simple Packet Transmission Scheme for Wireless Data over Fading Channels
- IEEE TRANS. COMMUN
, 2004
"... In this paper, we present a simplified scheduling scheme for packet transmission over a fading channel which is modeled as a finite state block channel. We first derive the optimal minimum power transmission policy with constraints on both average delay and packet loss. This problem is seen to be th ..."
Abstract
-
Cited by 11 (0 self)
- Add to MetaCart
In this paper, we present a simplified scheduling scheme for packet transmission over a fading channel which is modeled as a finite state block channel. We first derive the optimal minimum power transmission policy with constraints on both average delay and packet loss. This problem is seen to be the dual problem of the work by Rajan et. al. [1] where the packet loss rate is minimized under constraints on average delay and power. The optimal policy requires a sophisticated tablelook -up for implementation. In order to alleviate this problem, we design a simplified transmission policy that is based on checking for three control parameters: a transmission rate threshold, a channel state threshold and the transmission buffer size. Our results show that the minimum average power with the simplified scheme is very close to that achieved by the optimal policy. By relaxing the packet loss constraint, the simplified policy is also found to allow reduced buffer sizes, thereby simplifying system implementation. With the simplified scheduling policy, the transmitter can be modeled as a bulk service queue and an upper bound for the average delay is derived. Further, the packet loss rate and the average transmit power are estimated using an imbedded Markov chain technique.

