100% found this document useful (1 vote)

469 views4 pages

Higham & Moler - Three Measures of Precision

The document discusses three key measures of precision in floating point arithmetic: machine epsilon, the smallest floating point number for which (1 + x) > 1, and unit roundoff. It highlights the impact of rounding methods on the values of these measures and notes that the precise value of the smallest floating point number can vary between different machines. Additionally, it emphasizes that relative error in addition and subtraction may exceed the unit roundoff on certain machines lacking guard digits.

Uploaded by

Derek O'Connor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

469 views4 pages

Higham & Moler - Three Measures of Precision

Uploaded by

Derek O'Connor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Three Measures of Precision in Floating Point Arithmetic

Nick Higham April 13, 1991

This note is about three quantities that relate to the precision of oating point arithmetic. For t-digit, rounded base b arithmetic the quantities are 1. machine epsilon m , dened as the distance from 1.0 to the smallest oating point number bigger than 1.0 (and given by m = b(1t) , which is the spacing of the oating point numbers between 1.0 and b), and 2. = smallest oating point number x such that (1 + x ) > 1,
1 3. unit roundoff u = 2 b(1t) (which is a bound for the relative error in rounding a real number to oating point form).

The terminology I have used is not an accepted standard; for example, the name machine epsilon is sometimes given to the quantity in (2). My denition of unit roundoff is as in Golub and Van Loans book Matrix Computations [1] and is widely used. I chose the notation eps in (1) because it conforms with M ATLAB, in which the permanent variable eps is the machine epsilon. [Ed. note: Well, not quite. See my comments below. Cleve] The purpose of this note is to point out that it is not necessarily the case that = m , or that = u, as is sometimes claimed in the literature, and that, moreover, the precise value of is difcult to predict. It is helpful to consider binary arithmetic with t = 3. Using binary notation we have 1 + u = 1.00 + .001 = 1.001, which is exactly half way between the adjacent oating point numbers 1.00 and 1.01. Thus (1 + u) = 1.01 if we round away from zero when there is a tie, while (1 + u) = 1.00 if we round to an even last digit on a tie. It follows that u with round away from zero (and it is easy to see that = u), whereas > u for round to even. I believe that round away from zero used to be the more common choice in computer arithmetic, and this may explain why some authors dene or characterize u as in (2). However, the widely used IEEE standard 754 binary arithmetic uses round to even. 1

Nick Higham

Three Measures of Precision

So far, then it is clear that the way in which ties are resolved in rounding affects the value of . Let us now try to determine the value of with round to even. A little thought may lead one to suspect that u(1 + m ). For in the b = 2, t = 3 case we have x = u (1 + m ) = .001 (1 + .01) = .00101

f l (1 + x ) = f l (1.00101) = 1.01,
assuming perfect rounding. I reasoned this way, and decided to check this putative value of in 386-M ATLAB on my PC. M ATLAB uses IEEE standard 754 binary arithmetic, which has t = 53 (taking into account the implicit leading bit of 1). Here is what I found:
>> format compact; format hex >> x = 2^(-53)*(1+2^(-52)); y = [1+x 1 x] y = 3ff0000000000000 3ff0000000000000 3ca0000000000001 >> x = 2^(-53)*(1+2^(-11)); y = [1+x 1 x] y = 3ff0000000000000 3ff0000000000000 3ca0020000000000 >> x = 2^(-53)*(1+2^(-10)); y = [1+x 1 x] y = 3ff0000000000001 3ff0000000000000 3ca0040000000000

Thus the guess is wrong, and it appears that = u(1 + 242 ment! What is the explanation?

in this environ-

The answer is that we are seeing the effect of double-rounding, a phenomenon that I learned about from an article by Cleve Moler [2]. The Intel oating-point chips used on PCs implement internally the optional extended precision arithmetic described in the IEEE standard, with 64 bits in the mantissa [3]. What appears to be happening in the example above is that 1 + x is rst rounded to 64 bits; if x = u (1 + 2i ) and i > 10 then the least signicant bit is lost in this rounding. The extended precision number is now rounded to 53 bit precision; but when i > 10 there is a rounding tie (since we have lost the original least signicant bit) which is resolved to 1.0, which has an even last bit. The interesting fact, then, is that the value of can vary even between machines that implement IEEE standard arithmetic. Finally, Id like to stress an important point that I learned from the work of Vel Kahan: the relative error in addition and subtraction is not necessarily bounded by u. Indeed on machines such as Crays that lack a guard digit this relative error can be as large as 1. For example, if b = 2 and t = 3, then subtracting from 1.0 the next smaller oating number we have
Exactly: 1.00 -0.111 ----0.001
D EREK OC ONNOR , J ULY 29, 2011

Computed, without a guard digit: 1.00 -0.11 The least significant bit is dropped. ----0.01

Nick Higham

Three Measures of Precision

The computed answer is too big by a factor 2 and so has relative error 1! According to Vel Kahan, the example I have given mimics what happens on a Cray X-MP or Y-MP, but the Cray 2 behaves differently and produces the answer zero. Although the relative error in addition/subtraction is not bounded by the unit roundoff u for machines without a guard digit, it is nevertheless true that f l ( a + b ) = a (1 + e ) + b (1 + f ), where e and f are bounded in magnitude by u. [1] G. H. Golub and C. F. Van Loan, Matrix Computations, Second Edition, Johns Hopkins Press, Baltimore, 1989. [2] C. B. Moler, Technical note: Double-rounding and implications for numeric computations, The MathWorks Newsletter, Vol 4, No. 1 (1990), p. 6. [3] R. Startz, 8087/80287/80387 for the IBM PC & Compatibles, Third Edition, Brady, New York, 1988.

D EREK OC ONNOR , J ULY 29, 2011

Nick Higham Editors addendum: [Cleve Moler]

Three Measures of Precision

I agree with everything Nick has to say, and have a few more comments. M ATLAB on a PC has IEEE oating point with extended precision implemented in an Intel chip. The C compiler generates code with double rounding. M ATLAB on a Sun Sparc also has IEEE oating point with extended precision, but it is implemented in a Sparc chip. The C compiler generates code which avoids double rounding. On both the PC and the Sparc
m

= 252 = 3cb0000000000000 = 2.220446049250313e-16

However, on the PC = 253 (1 + 210 ) = 3ca0040000000000 = 1.111307226797642e 16 While on the Sparc = 253 (1 + 252 ) = 3ca0000000000001 = 1.110223024625157e 16 Note that is not 2 raised to a negative integer power. M ATLAB on a VAX usually uses D oating point (there is also a G version under VMS). Compared to IEEE oating point, the D format has 3 more bits in the fraction and 3 less bits in the exponent. So m should be 255 , but M ATLAB says m is 256 . It is actually using the 1 + x > 1 trick to compute what were now calling . There is no extended precision or double rounding and ties between two oating point values are chopped, so we can nd by just trying powers of 2. On the VAX with D oat
m

= 255 = 2.775557561562891e 17

= 256 = 1.387778780781446e 17 The denition of m as the distance from 1.0 to the next oating point number is a purely geometric quantity depending only on the structure of the oating point numbers. The point Nick is making is that the more common denition of what we here call involves a comparison between 1.0 + x and 1.0 and subtle rounding properties of oating point addition. I now much prefer the simple geometric denition, even though Ive been as responsible as anybody for the popularity of the denition involving addition. Cleve
A This is a L TEXed version of the original post to NA Digest, 13 April, 1991, Derek OConnor. www.derekroconnor.net,

D EREK OC ONNOR , J ULY 29, 2011

Floating Point
No ratings yet
Floating Point
3 pages
IEEE Floating-Point in Matlab
No ratings yet
IEEE Floating-Point in Matlab
9 pages
Understanding Rounding Errors in Computing
No ratings yet
Understanding Rounding Errors in Computing
34 pages
MATLAB Floating Point Lab Guide
No ratings yet
MATLAB Floating Point Lab Guide
17 pages
IEEE Arithmetic
No ratings yet
IEEE Arithmetic
6 pages
CHAP 03e
No ratings yet
CHAP 03e
32 pages
Cit335 Summary
No ratings yet
Cit335 Summary
10 pages
Floating Point Formats: 0 1 1 P 1 (P 1) e I
No ratings yet
Floating Point Formats: 0 1 1 P 1 (P 1) e I
3 pages
MIT18 335JF10 Lec4 Hand PDF
No ratings yet
MIT18 335JF10 Lec4 Hand PDF
3 pages
Floating-Point Numbers and Round-Off Errors by Kusal Kaluarachchi Medium
No ratings yet
Floating-Point Numbers and Round-Off Errors by Kusal Kaluarachchi Medium
2 pages
Numerical Methods: Representing Numbers
No ratings yet
Numerical Methods: Representing Numbers
30 pages
Lect 02&03
No ratings yet
Lect 02&03
22 pages
Slide n2 Appendix Posted
No ratings yet
Slide n2 Appendix Posted
21 pages
Rounding and Machine Arithmetic
No ratings yet
Rounding and Machine Arithmetic
20 pages
An Introduction To Floating Point Arithmetic by Example: Pat Quillen
No ratings yet
An Introduction To Floating Point Arithmetic by Example: Pat Quillen
33 pages
Numerical Errors in Engineering
No ratings yet
Numerical Errors in Engineering
15 pages
Advanced Computational Methods: ENGR 680
No ratings yet
Advanced Computational Methods: ENGR 680
19 pages
Floating Point Arithmethic - Error Analysis
No ratings yet
Floating Point Arithmethic - Error Analysis
30 pages
IEEE 754 Floating Point Notes
No ratings yet
IEEE 754 Floating Point Notes
4 pages
CS137Part03 Floats Mathlib Root Finding Post
No ratings yet
CS137Part03 Floats Mathlib Root Finding Post
51 pages
Demystifying Floating Point - John Farrier - CppCon 2015
No ratings yet
Demystifying Floating Point - John Farrier - CppCon 2015
61 pages
Computational Physics I: Luigi Scorzato Lecture 2: Floating Point Arithmetic
No ratings yet
Computational Physics I: Luigi Scorzato Lecture 2: Floating Point Arithmetic
7 pages
Rtes HW02 99101773
No ratings yet
Rtes HW02 99101773
15 pages
Numerical Methods
No ratings yet
Numerical Methods
72 pages
Computations in Mechanical Engineering: Numbers and Vectors
No ratings yet
Computations in Mechanical Engineering: Numbers and Vectors
18 pages
WBMT2049-T2/WI2032TH - Numerical Analysis For ODE's
No ratings yet
WBMT2049-T2/WI2032TH - Numerical Analysis For ODE's
30 pages
Computer Arithmetic Explained
No ratings yet
Computer Arithmetic Explained
60 pages
Computer Arithmetic & Error Analysis
No ratings yet
Computer Arithmetic & Error Analysis
60 pages
Unit 4 - 1
No ratings yet
Unit 4 - 1
11 pages
Week 2 M1Lessons 2-3
No ratings yet
Week 2 M1Lessons 2-3
41 pages
Floating Point Guide 2015-10-15
No ratings yet
Floating Point Guide 2015-10-15
16 pages
Scientific Computing - LESSON 1: Computer Arithmetic and Error Analysis 1
No ratings yet
Scientific Computing - LESSON 1: Computer Arithmetic and Error Analysis 1
10 pages
1.3 Error, Accuracy, and Stability: Preliminaries
No ratings yet
1.3 Error, Accuracy, and Stability: Preliminaries
4 pages
1.3 Error, Accuracy, and Stability: Preliminaries
No ratings yet
1.3 Error, Accuracy, and Stability: Preliminaries
4 pages
P03 Intro To Numerical Methods
No ratings yet
P03 Intro To Numerical Methods
11 pages
Annotated
No ratings yet
Annotated
34 pages
Numerical+Analysis+Chapter+1 2
No ratings yet
Numerical+Analysis+Chapter+1 2
13 pages
A Brief Introduction To The IEEE Standard
No ratings yet
A Brief Introduction To The IEEE Standard
4 pages
Nummeth Lect 01
No ratings yet
Nummeth Lect 01
24 pages
Approximations and Round-Off Errors
No ratings yet
Approximations and Round-Off Errors
12 pages
Errors 2
No ratings yet
Errors 2
3 pages
Numerical Analysis Basics
No ratings yet
Numerical Analysis Basics
49 pages
Unit Ii
No ratings yet
Unit Ii
10 pages
Lecture 10 (Temp)
No ratings yet
Lecture 10 (Temp)
50 pages
ME3105 Notes 1
No ratings yet
ME3105 Notes 1
10 pages
Mathematical Modeling
No ratings yet
Mathematical Modeling
14 pages
Floating Point Arithmetic
No ratings yet
Floating Point Arithmetic
10 pages
A 01 Mobile
No ratings yet
A 01 Mobile
38 pages
Numerical Errors and Approximations
No ratings yet
Numerical Errors and Approximations
6 pages
Floating-Point Arithmetic in The Coq System
No ratings yet
Floating-Point Arithmetic in The Coq System
10 pages
Floating Point: Adders and Multipliers
No ratings yet
Floating Point: Adders and Multipliers
45 pages
Unit 4 - 2
No ratings yet
Unit 4 - 2
21 pages
Chap2 Float
No ratings yet
Chap2 Float
20 pages
Operations On Floating Point Numbers
No ratings yet
Operations On Floating Point Numbers
16 pages
8.3 Floating Point Numbers
No ratings yet
8.3 Floating Point Numbers
19 pages
Ponto Flutuante
No ratings yet
Ponto Flutuante
87 pages
Thompson 2015
No ratings yet
Thompson 2015
2 pages
OConnor - Zeno's Paradox
No ratings yet
OConnor - Zeno's Paradox
6 pages
O'Connor - The Bellman-Ford-Moore Shortest Path Algorithm
100% (1)
O'Connor - The Bellman-Ford-Moore Shortest Path Algorithm
13 pages
Dijkstra - A Note On Two Problems in Connexion With Graphs
No ratings yet
Dijkstra - A Note On Two Problems in Connexion With Graphs
3 pages
O'Connor - Boolean Matrix Inverses
No ratings yet
O'Connor - Boolean Matrix Inverses
3 pages
Lambay Island: History & Shipwrecks
No ratings yet
Lambay Island: History & Shipwrecks
10 pages
O'Connor - Common Errors in Numerical Programming
No ratings yet
O'Connor - Common Errors in Numerical Programming
17 pages
Kelly & O'Neill - The Network Simplex Method
100% (1)
Kelly & O'Neill - The Network Simplex Method
91 pages
O'Connor - A Historical Note On The Fisher-Yates Shuffle Algorithm.
No ratings yet
O'Connor - A Historical Note On The Fisher-Yates Shuffle Algorithm.
4 pages
Bellman - On The Routing Problem
No ratings yet
Bellman - On The Routing Problem
4 pages
Rutherford - Inverses of Boolean Matrices
No ratings yet
Rutherford - Inverses of Boolean Matrices
5 pages
O'Connor - Matrix Benchmarks
No ratings yet
O'Connor - Matrix Benchmarks
6 pages
O'Connor - The Lenovo ThinkPad X220 Journey
No ratings yet
O'Connor - The Lenovo ThinkPad X220 Journey
6 pages
O'Connor - RPGLab: A Matlab Package For Random Permutation Generation
No ratings yet
O'Connor - RPGLab: A Matlab Package For Random Permutation Generation
36 pages
Bannon - John F Muth (1930 - 2005)
100% (2)
Bannon - John F Muth (1930 - 2005)
5 pages
Accurate Trigonometric Argument Reduction
No ratings yet
Accurate Trigonometric Argument Reduction
8 pages
Forsythe, Malcolm, & Moler - URAND Generator.
No ratings yet
Forsythe, Malcolm, & Moler - URAND Generator.
1 page
Kalman - Probability in The Real World
No ratings yet
Kalman - Probability in The Real World
24 pages
Durstenfeld - Random Permutation Algorithm
No ratings yet
Durstenfeld - Random Permutation Algorithm
1 page
O'Connor - Mrs La Touche of Harristown
No ratings yet
O'Connor - Mrs La Touche of Harristown
10 pages
6th Central Pay Commission Salary Calculator
100% (436)
6th Central Pay Commission Salary Calculator
15 pages
Moler & Morrison - Pythagorean Sums
No ratings yet
Moler & Morrison - Pythagorean Sums
5 pages
Matrix Operations Complexity Analysis
No ratings yet
Matrix Operations Complexity Analysis
5 pages
Exercise - 3: Previous Year Iit Questions
No ratings yet
Exercise - 3: Previous Year Iit Questions
9 pages
Overview of Prime Numbers and The Riemann Hypothesis
No ratings yet
Overview of Prime Numbers and The Riemann Hypothesis
7 pages
Number Systems
No ratings yet
Number Systems
24 pages
Multiplication of Decimals Activity Sheet
No ratings yet
Multiplication of Decimals Activity Sheet
3 pages
Complex Functions
100% (2)
Complex Functions
102 pages
A1 Mathematics Assignment 1-L2 - 1
No ratings yet
A1 Mathematics Assignment 1-L2 - 1
6 pages
N1 Mathematics Lecturer Guide
No ratings yet
N1 Mathematics Lecturer Guide
102 pages
Quadratic Equations Review Exercises
No ratings yet
Quadratic Equations Review Exercises
31 pages
Binary Arithmetic Operations Guide
No ratings yet
Binary Arithmetic Operations Guide
30 pages
Arithmetic Sequences for Students
0% (1)
Arithmetic Sequences for Students
2 pages
Integers Gap Closing
No ratings yet
Integers Gap Closing
26 pages
Math 7 - Summative Test Q2
100% (6)
Math 7 - Summative Test Q2
5 pages
Limits of Exponential and Logarithmic Functions
No ratings yet
Limits of Exponential and Logarithmic Functions
11 pages
Grade 4 Percentages Jubilee
No ratings yet
Grade 4 Percentages Jubilee
5 pages
4-SSS SAS ASA and AAS Congruence PDF
No ratings yet
4-SSS SAS ASA and AAS Congruence PDF
4 pages
Sequences: Convergence and Divergence PDF
100% (2)
Sequences: Convergence and Divergence PDF
49 pages
À À À ¡À
No ratings yet
À À À ¡À
6 pages
1KRK Chapter 3 Squares and Square Roots 15.3.2021
No ratings yet
1KRK Chapter 3 Squares and Square Roots 15.3.2021
5 pages
8th Cubes and Cube Root Ak
No ratings yet
8th Cubes and Cube Root Ak
4 pages
Bangladesh Informatics Olympiad 2013
0% (1)
Bangladesh Informatics Olympiad 2013
10 pages
N - 7 (B) Indefinite Integration
No ratings yet
N - 7 (B) Indefinite Integration
11 pages
MHF4U - Polynomial Test
No ratings yet
MHF4U - Polynomial Test
5 pages
Second Quarter - Module 35: Mathematics
No ratings yet
Second Quarter - Module 35: Mathematics
23 pages
Numbers and Counting Rules Guide
No ratings yet
Numbers and Counting Rules Guide
2 pages
Sophie Fellowship Application 2024
0% (1)
Sophie Fellowship Application 2024
3 pages
Mathematics G9: Quarter 2
82% (11)
Mathematics G9: Quarter 2
40 pages
Class9 KD Real Number
No ratings yet
Class9 KD Real Number
20 pages
3 Math5Q3Week1
No ratings yet
3 Math5Q3Week1
25 pages
A.P & G.P Question Bank
No ratings yet
A.P & G.P Question Bank
104 pages
GCD Properties Explained
No ratings yet
GCD Properties Explained
2 pages

Higham & Moler - Three Measures of Precision

Uploaded by

Higham & Moler - Three Measures of Precision

Uploaded by

Three Measures of Precision in Floating Point Arithmetic

Nick Higham April 13, 1991

Three Measures of Precision

Three Measures of Precision

D EREK OC ONNOR , J ULY 29, 2011

Nick Higham Editors addendum: [Cleve Moler]

Three Measures of Precision

= 252 = 3cb0000000000000 = 2.220446049250313e-16

D EREK OC ONNOR , J ULY 29, 2011

You might also like