Results 1 -
7 of
7
Digit-Set Conversions: Generalizations and Applications
- IEEE Transactions on Computers
, 1995
"... The problem of digit set conversion for fixed radix is investigated for the case of converting into a non-redundant, as well as into a redundant digit set. Conversion may be from very general digit sets, and covers as special cases multiplier recodings, additions and certain multiplications. We gene ..."
Abstract
-
Cited by 20 (5 self)
- Add to MetaCart
The problem of digit set conversion for fixed radix is investigated for the case of converting into a non-redundant, as well as into a redundant digit set. Conversion may be from very general digit sets, and covers as special cases multiplier recodings, additions and certain multiplications. We generalize known algorithms for conversions into non-redundant digit sets, as well as apply conversion to generalize the O(log n) time algorithm for conditional sum addition using parallel prefix computation, and a comparison is made with standard carry-lookahead techniques. Examples on multi-operand addition are used to illustrate the generality of this approach. O(1) time algorithms for converting into redundant digit sets are generalized based on a very simple lemma, which provides a framework for all conversions into redundant digit sets. Applications in multiplier recoding and partial product accumulation are used here as exemplifications. Keywords: Computer arithmetic, digit set conversio...
Reduced Power Dissipation Through Truncated Multiplication
- in IEEE Alessandro Volta Memorial Workshop on Low Power Design
, 1999
"... Reducing the power dissipation of parallel multipliers is important in the design of digital signal processing systems. In many of these systems, the products of parallel multipliers are rounded to avoid growth in word size. The power dissipation and area of rounded parallel multipliers can be signi ..."
Abstract
-
Cited by 15 (5 self)
- Add to MetaCart
Reducing the power dissipation of parallel multipliers is important in the design of digital signal processing systems. In many of these systems, the products of parallel multipliers are rounded to avoid growth in word size. The power dissipation and area of rounded parallel multipliers can be significantly reduced by a technique known as truncated multiplication. With this technique, the least significant columns of the multiplication matrix are not used. Instead, the carries generated by these columns are estimated. This estimate is added with the most significant columns to produce the rounded product. This paper presents the design and implementation of parallel truncated multipliers. Simulations indicate that truncated parallel multipliers dissipate between 29 and 40 percent less power than standard parallel multipliers for operand sizes of 16 and 32 bits. 1: Introduction High-speed parallel multipliers are fundamental building blocks in digital signal processing systems [1]. In...
Parallel Saturating Fractional Arithmetic Units
- IN 9TH GREAT LAKES SYMPOSIUM ON VLSI
, 1999
"... This paper describes the designs of a saturating adder, multiplier, single MAC unit, and dual MAC unit with one cycle latencies. The dual MAC unit can perform two saturating MAC operations in parallel and accumulate the results with saturation. Specialized saturation logic ensures that the output of ..."
Abstract
-
Cited by 11 (6 self)
- Add to MetaCart
This paper describes the designs of a saturating adder, multiplier, single MAC unit, and dual MAC unit with one cycle latencies. The dual MAC unit can perform two saturating MAC operations in parallel and accumulate the results with saturation. Specialized saturation logic ensures that the output of the dual MAC unit is identical to the result of the operations performed serially with saturation after each multiplication and each addition 1
Integer Multiplication with Overflow Detection or Saturation
- IEEE Transactions on Computers
, 2000
"... High-speed multiplication is frequently used in general-purpose and application-specific computer systems. These systems often support integer multiplication, where two n-bit integers are multiplied to produce a 2n-bit product. To prevent growth in word length, processors typically return the n leas ..."
Abstract
-
Cited by 6 (2 self)
- Add to MetaCart
High-speed multiplication is frequently used in general-purpose and application-specific computer systems. These systems often support integer multiplication, where two n-bit integers are multiplied to produce a 2n-bit product. To prevent growth in word length, processors typically return the n least significant bits of the product and a flag that indicates whether or not overflow has occurred. Alternatively, some processors saturate results that overflow to the most positive or most negative representable number. This paper presents efficient methods for performing unsigned or two's complement integer multiplication with overflow detection or saturation. These methods have significantly less area and delay than conventional methods for integer multiplication with overflow detection or saturation. Keywords--- Overflow, saturation, two's complement, unsigned, integer, array multipliers, tree multipliers, computer arithmetic. I. Introduction Most modern computers directly support multi...
A Fast Parallel Squarer Based on Divide-and-Conquer
- IEEE Journal of Solid-State Circuits
, 1995
"... Fast and small squarers are needed in many applications such as image compression. A new family of high performance parallel squarers based on the divide-and-conquer method is reported. Our main result was realizing the basis cases of the divide-and-conquer recursion by using optimized n-bit primiti ..."
Abstract
-
Cited by 6 (0 self)
- Add to MetaCart
Fast and small squarers are needed in many applications such as image compression. A new family of high performance parallel squarers based on the divide-and-conquer method is reported. Our main result was realizing the basis cases of the divide-and-conquer recursion by using optimized n-bit primitive squarers, where n is in the range of 2 to 6. This method reduced the gate count and provided shorter critical paths. A chip implementing an 8-bit squarer was designed, fabricated and successfully tested, resulting in 24 MOPS using a 2-¯ CMOS fabrication technology. This squarer had two additional features: increased number of squaring operations per unit circuit area, and the potential for reduced power consumption per squaring operation. 1 Introduction The need to square numbers arises in a large number of image processing algorithms. For example, in many subband vector quantization systems (e.g. [1]), the L 2 -norm calculations in the vector quantizer can involve the order of 288 mil...
Combined Unsigned and Two's Complement Saturating Multipliers
, 2000
"... In many digital signal processing and multimedia applications, results that overflow are saturated to the most positive or most negative representable number. This paper presents efficient techniques for performing saturating n-bit integer multiplication on unsigned and two's complement numbers. Un ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
In many digital signal processing and multimedia applications, results that overflow are saturated to the most positive or most negative representable number. This paper presents efficient techniques for performing saturating n-bit integer multiplication on unsigned and two's complement numbers. Unlike conventional techniques for saturating multiplication, which compute a 2n-bit product and then examine the n most significant product bits to determine if overflow has occurred, the techniques presented in this paper compute only the (n + 1) least significant bits of the product. Specialized overflow detection units, which operate in parallel with the multiplier, determine if overflow has occurred and the product should be saturated. These techniques are applied to designs for saturating array multipliers that perform either unsigned or two's complement saturating integer multiplication, based on an input control signal. Compared to array multipliers that use conventional methods for sa...
Arithmetic, pp. 168--174, IEEE Computer Society, 1997. [41] M. J. Schulte and E. E. Swartzlander, "Hardware Designs for Exactly Rounded Elementary Functions,"
"... -9, IEEE Computer Society, 1993. 5 [27] J. Fandrianto, "Algorithm for High Speed Shared Radix 4 Division and Radix 4 SquareRoot, " in Proc. 8th IEEE Symposium on Computer Arithmetic, pp. 73--79, IEEE Computer Society, 1987. [28] C. V. Ramamoorthy, J. R. Goodman, and K. H. Kim, "Some Properties of ..."
Abstract
- Add to MetaCart
-9, IEEE Computer Society, 1993. 5 [27] J. Fandrianto, "Algorithm for High Speed Shared Radix 4 Division and Radix 4 SquareRoot, " in Proc. 8th IEEE Symposium on Computer Arithmetic, pp. 73--79, IEEE Computer Society, 1987. [28] C. V. Ramamoorthy, J. R. Goodman, and K. H. Kim, "Some Properties of Iterative Square-Rooting Methods Using High-Speed Multiplication," IEEE Transactions on Computers, vol. C-21, pp. 837--847, 1972. [29] M. J. Flynn, "On Division by Functional Iteration," IEEE Transactions on Computers, vol. C-19, pp. 702--706, 1970. [30] S. Oberman and M. Flynn, "Division Algorithms and Implementations," ieeetc, vol. C46, pp. 833--854, August 1997. [31] P. Soderquist and M. Leeser, "An Area/performance Comparison of Subtractive and Multiplicative Divide/Square Root Implementations," in Proc. 12th IEEE Symposium on Computer Arithmetic (S. Knowles and W. H. McAllister, eds.), IEEE Computer Society,

