Floating-point arithmetic is considered as esoteric subject by many people. This is rather surprising, because floating-point is ubiquitous in computer systems: Almost every language has a floating-point datatype; computers from PCs to supercomputers have floating-point accelerators; most compilers will be called upon to compile floating-point algorithms from time to time; and virtually every operating system must respond to floating-point exceptions such as overflow. This paper presents a tutorial on the aspects of floating-point that have a direct impact on designers of computer systems. It begins with background on floating-point representation and rounding error, continues with a discussion of the IEEE floating point standard, and concludes with examples of how computer system builders can better support floating point.
|
1516
|
The Art of Computer Programming
– Knuth
- 1968
|
|
1052
|
The C Programming Language
– Kerighan, Ritchie
- 1978
|
|
135
|
The Art of Computer
– Knuth
- 1973
|
|
62
|
A floating-point technique for extending the available precision
– Dekker
- 1971
|
|
40
|
The Sign/Logarithm Number System
– Alexopoulos
- 1975
|
|
37
|
Underflow and the Reliability of Numerical Software
– Demmel
- 1984
|
|
29
|
A Simple but Realistic Model of Floating-Point Computation
– Brown
- 1981
|
|
29
|
Contributions to a Proposed Standard for Binary Floating-Point Arithmetic
– Coonen
- 1984
|
|
29
|
Floating-point computation
– Sterbenz
- 1974
|
|
26
|
A proposed radix and word length independent standard for floating-point arithmetic
– Cody
- 1985
|
|
26
|
The Arithmetic of the Digital Computer: A New Approach
– Kulisch, Miranker
- 1986
|
|
25
|
BBranch Cuts for Complex Elementary Functions, or Much Ado About Nothing_s Sign Bit
– Kahan
- 1987
|
|
22
|
IEEE Standard 754-1985 for Binary Floating-point Arithmetic
– IEEE
- 1987
|
|
21
|
A Survey Of Error Analysis
– Kahan
- 1972
|
|
15
|
27 Bits Are Not Enough for 8–Digit Accuracy
– Goldberg
- 1967
|
|
15
|
Finite Precision Rational Arithmetic: Slash Number Systems
– Matula, Kornerup
- 1985
|
|
14
|
American National Standard Programming Language FORTRAN
– ANSI
- 1978
|
|
14
|
Compiler Support for Floating-point Computation
– Farnum
- 1988
|
|
11
|
Computer Arithmetic, in “Computer Architecture: A Quantitative Approach”, by David
– Goldberg
- 1990
|
|
11
|
The Near Orthogonality of Syntax
– Kahan, Coonen
- 1982
|
|
10
|
TheCProgramming Language. Prentice{Hall, Englewood Cli s
– Kernighan, Ritchie
- 1978
|
|
9
|
A Portable Floating-Point Environment, unpublished manuscript
– Barnett
- 1987
|
|
9
|
Floating-Point Standards — Theory and Practice, in “Reliability in Computing: the role of interval methods in scientific computing
– Cody
- 1988
|
|
8
|
Unpublished lectures given at Sun Microsystems
– Kahan
- 1988
|
|
8
|
Arithmetic for Vector
– Kirchner, Kulisch
- 1987
|
|
8
|
Evading the Drift
– Reiser, Knuth
- 1975
|
|
6
|
Arithmetic for Vector Processors
– Kirchner, Kulisch
- 1988
|
|
4
|
Calculating Area and Angle of a Needle-like Triangle, unpublished manuscript
– Kahan
- 1986
|
|
4
|
The Arithmetic of the Digital Computer: A New Approach
– Kulish, Miranker
- 1986
|
|
4
|
A Proposed Radix- and Word-length-independent Standard for Floating-point Arithmetic
– al
- 1984
|
|
4
|
Numerical Computation Guide
– unknown authors
- 1986
|
|
3
|
Anomalies in the IBM ACRITH package
– Kahan, LeBlanc
- 1985
|
|
2
|
Computer Arithmetic, in "Computer Architecture: A Quantitative Approach", by David
– Goldberg
- 1990
|
|
1
|
A portable floating-point environment. Unpubhshed manuscript
– 13ARNETT
- 1987
|
|
1
|
Floating-point standards–Theory and practice
– CODY
- 1988
|
|
1
|
Computer Solut~on of Linear Algebraic Systems
– FORSYTHE, MOLER
- 1967
|
|
1
|
Unpublished lectures given at
– KAHAN
- 1988
|
|
1
|
The near orthogonality of syntax, semantics, and diagnostics in numerical programming environments
– KAHAN, COONEN
- 1982
|
|
1
|
Evading the drift in floating-point addition Inf. Process
– REISER, KNUTH
- 1975
|
|
1
|
27 Bits Ar e Not Enough for 8-Digit Accuracy
– Goldberg
- 1967
|
|
1
|
1982.HP-15C Advanced Functions Handbook
– Packard
|
|
1
|
1972.A Survey Of Err or Analysis, in Information Pr ocessing 71
– Kahan
|
|
1
|
Unpublished lectur es given at Sun Micr osystems
– Kahan
- 1988
|
|
1
|
ome T. 1982.The Near Orthogonality of Syntax
– Kahan, Coonen
|
|
1
|
Arithmetic for V ector
– Kirchner, Kulisch
- 1987
|