0% found this document useful (0 votes)

137 views

Chapter 3 Arithmetic For Computers (Revised)

This document discusses arithmetic operations for computers. It begins by introducing integer operations like addition, subtraction, multiplication, and division. It then discusses floating-point number representation and operations. The document goes on to explain different number representations like sign-magnitude, one's complement, and two's complement. It also discusses how MIPS uses two's complement representation. Various arithmetic operations like addition, subtraction, and detecting overflow are described in the context of two's complement numbers. Finally, the document discusses implementing an ALU and optimizations like carry-lookahead addition to improve performance.

Uploaded by

yrikki

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

137 views

Chapter 3 Arithmetic For Computers (Revised)

Uploaded by

yrikki

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 64

Chapter 3

Arithmetic for Computers

3.1 Introduction

Arithmetic for Computers

Operations on integers

Addition and subtraction Multiplication and division Dealing with overflow

Floating-point real numbers

Representation and operations

Chapter 3 Arithmetic for Computers 2

Arithmetic
Where we've been: Performance (seconds, cycles, instructions) Abstractions: Instruction Set Architecture Assembly Language and Machine Language What's up ahead: Implementing the Architecture
operation

a
32

ALU
result
32

b
32

Numbers

Bits are just bits (no inherent meaning) conventions define relationship between bits and numbers Binary numbers (base 2) 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001... decimal: 0...2n-1 Of course it gets more complicated: numbers are finite (overflow) fractions and real numbers negative numbers e.g., no MIPS subi instruction; addi can add a negative number) How do we represent negative numbers? i.e., which bit patterns will represent which numbers?

Possible Representations
Sign Magnitude: One's Complement Two's Complement

000 = +0 001 = +1 010 = +2 011 = +3 100 = -0 101 = -1 110 = -2 111 = -3

000 = +0 001 = +1 010 = +2 011 = +3 100 = -3 101 = -2 110 = -1 111 = -0

000 = +0 001 = +1 010 = +2 011 = +3 100 = -4 101 = -3 110 = -2 111 = -1

Issues: balance, number of zeros, ease of operations Which one is best? Why?

MIPS
32 bit signed numbers:
0000 0000 0000 ... 0111 0111 1000 1000 1000 ... 1111 1111 1111 0000 0000 0000 0000 0000 0000 0000two = 0ten 0000 0000 0000 0000 0000 0000 0001two = + 1ten 0000 0000 0000 0000 0000 0000 0010two = + 2ten

1111 1111 0000 0000 0000

1110two 1111two 0000two 0001two 0010two

= = = = =

+ +

2,147,483,646ten 2,147,483,647ten 2,147,483,648ten 2,147,483,647ten 2,147,483,646ten

maxint
minint

1111 1111 1111 1111 1111 1111 1101two = 3ten 1111 1111 1111 1111 1111 1111 1110two = 2ten 1111 1111 1111 1111 1111 1111 1111two = 1ten

Two's Complement Operations

Negating a two's complement number: invert all bits and add 1

remember: negate (+/-) and invert (1/0) are quite different! Converting n bit numbers into numbers with more than n bits: MIPS 16 bit immediate gets converted to 32 bits for arithmetic copy the most significant bit (the sign bit) into the other bits 0010 1010 -> 0000 0010 -> 1111 1010

"sign extension" (lbu vs. lb)

Addition & Subtraction

Just like in grade school (carry/borrow 1s) 0111 0111 0110 + 0110 - 0110 - 0101
Two's complement operations easy subtraction using addition of negative numbers 0111 + 1010

Overflow (result too large for finite computer word):

e.g., adding two n-bit numbers does not yield an n-bit number 0111 + 0001 note that overflow term is somewhat misleading, 1000 it does not mean a carry overflowed

Detecting Overflow
No overflow when adding a positive and a negative number No overflow when signs are the same for subtraction Overflow occurs when the value affects the sign: overflow when adding two positives yields a negative or, adding two negatives gives a positive or, subtract a negative from a positive and get a negative or, subtract a positive from a negative and get a positive Consider the operations A + B, and A B Can overflow occur if B is 0 ? Can overflow occur if A is 0 ?

Effects of Overflow
An exception (interrupt) occurs Control jumps to predefined address for exception Interrupted address is saved for possible resumption Details based on software system / language example: flight control vs. homework assignment Don't always want to detect overflow new MIPS instructions: addu, addiu, subu
note: addiu still sign-extends! note: sltu, sltiu for unsigned comparisons Roll over: circular buffers Saturation: pixel lightness control

Review: Boolean Algebra & Gates

Problem: Consider a logic function with three inputs: A, B, and C.
Output D is true if at least one input is true Output E is true if exactly two inputs are true Output F is true only if all three inputs are true Show the truth table for these three functions. Show the Boolean equations for these three functions.

Show an implementation consisting of inverters, AND, and OR gates.

An ALU (arithmetic logic unit)

Let's build an ALU to support the andi and ori instructions
we'll just build a 1 bit ALU, and use 32 of them operation a op a b res

result

Possible Implementation (sum-of-products):

Appendix: C.5

Review: The Multiplexor

Selects one of the inputs to be the output, based on a control input
S note: we call this a 2-input mux even though it has 3 inputs!

A B

0 1

Lets build our ALU using a MUX:

Different Implementations
Not easy to decide the best way to build something
Don't want too many inputs to a single gate Dont want to have to go through too many gates for our purposes, ease of comprehension is important Let's look at a 1-bit ALU for addition:
CarryIn

a Sum b

cout = a b + a cin + b cin sum = a xor b xor cin

CarryOut

How could we build a 1-bit ALU for add, and, and or? How could we build a 32-bit ALU?

Building a 32 bit ALU

Case (Op) { 0: R = a ^ b;
a0 CarryIn ALU0 CarryOut
Operation CarryIn

CarryIn

Operation

1: R = a V b; 2: R = a + b}
a

Result0

a1
0

CarryIn ALU1 CarryOut Result1

Result

a2
2 b

CarryIn ALU2 CarryOut Result2

R0 = a ^ b; R1 = a V b; R2 = a + b; Case (Op) { 0: R = R0; 1: R = R1; 2: R = R2 }

CarryOut

a31 b31

CarryIn ALU31 Result31

What about subtraction (a b) ?

Two's complement approach: just negate b and add. How do we negate?

A very clever solution: Result = A + (~B) + 1

Binvert Operation CarryIn

a 0

Result

0 1

CarryOut

Tailoring the ALU to the MIPS

Need to support the set-on-less-than instruction (slt)
remember: slt is an arithmetic instruction produces a 1 if rs < rt and 0 otherwise use subtraction: (a-b) < 0 implies a < b Need to support test for equality (beq $t5, $t6, $t7) use subtraction: (a-b) = 0 implies a = b

Supporting slt
a

Binvert

Operation CarryIn

Can we figure out the idea?

b 0 1 Less

1 Result 2

CarryOut

Binvert

Operation CarryIn

0 1 Result

0 1

Less

3
Set

Overflow detection
b.

Overflow

Binvert

CarryIn

Operation

a0 b0

CarryIn ALU0 Less CarryOut

Result0

a1 b1 0

CarryIn ALU1 Less CarryOut

Result1

a2 b2 0

CarryIn ALU2 Less CarryOut

Result2

CarryIn

a31 b31 0

CarryIn ALU31 Less

Result31 Set Overflow

Test for equality

Notice control lines:
000 001 010 110 111 = = = = = and or add subtract slt
Bnegate Operation

a0 b0

CarryIn ALU0 Less CarryOut

Result0

a1 b1 0

CarryIn ALU1 Less CarryOut

Result1 Zero

Note: zero is a 1 when the result is zero!

Binvert Operation CarryIn

a2 b2 0

CarryIn ALU2 Less CarryOut

Result2

a 0 1 Result b 0 1 Less 3 2
a31 b31 0 CarryIn ALU31 Less Result31 Set Overflow

CarryOut

Conclusion
We can build an ALU to support the MIPS instruction set
key idea: use multiplexor to select the output we want we can efficiently perform subtraction using twos complement we can replicate a 1-bit ALU to produce a 32-bit ALU

Important points about hardware

all of the gates are always working the speed of a gate is affected by the number of inputs to the gate the speed of a circuit is affected by the number of gates in series (on the critical path or the deepest level of logic)

Our primary focus: comprehension, however, Clever changes to organization can improve performance (similar to using better algorithms in software) well look at two examples for addition and multiplication

Problem: ripple carry adder is slow

Is a 32-bit ALU as fast as a 1-bit ALU? Is there more than one way to do addition? two extremes: ripple carry and sum-of-products

Can you see the ripple? How could you get rid of it? c1 = b0c0 + a0c0 + a0b0 c3 = b2c2 + a2c2 + a2b2 c2 = b1c1 + a1c1 + a1b1 c4 = b3c3 + a3c3 + a3b3

Expanding the carry chains c2 = a1a0b0+a1a0c0+a1b0c0+b1a0b0+b1a0c0+b1b0c0+a1b1 c3 = ??? c4 = ???

Not feasible! Why? Cn has 2P terms (P=0n)

Appendix: C.6

Carry-lookahead adder
An approach in-between our two extremes Motivation: If we didn't know the value of carry-in, what could we do? When would we always generate a carry? gi = ai bi When would we propagate the carry? pi = ai + bi Did we get rid of the ripple? c1 = g0 + p0c0 c3 = g2 + p2c2 Expanding the carry chains c2 = g1+p1g0+p1p0c0 c2 = g1 + p1c1 c4 = g3 + p3c3

Feasible! Why? The carry chain does not disappear, but is much smaller: Cn has n+1 terms

Use principle to build bigger adders

CarryIn

a0 b0 a1 b1 a2 b2 a3 b3 CarryIn Result0--3 ALU0 P0 G0 C1 a4 b4 a5 b5 a6 b6 a7 b7 CarryIn Result4--7 ALU1 P1 G1 C2 CarryIn Result8--11 ALU2 P2 G2 C3 CarryIn Result12--15 ALU3 P3 G3 C4 CarryOut pi + 3 gi + 3 ci + 4 pi + 2 gi + 2 ci + 3 pi + 1 gi + 1 ci + 2 pi gi

Carry-lookahead unit

How about constructing a 16-bit adder in CLA way? Cant build a 16 bit adder this way... (too big) Could use ripple carry of 4-bit CLA adders Better: use the CLA principle again! P0 = p3 p2 p1 p0 P1 = p7 p6 p5 p4 P2 =p11 p10 p9 p8 P3 = p15 p14 p13 p12 G0 = g3+p3g2+p3p2g1+p3p2p1g0 G1 = g7+p7g6+p7p6g5+p7p6p5g4 G2 = g11+p11g10+p11p10g9+p11p10p9g8 G3 = g15+p15g14+p15p14g13+p15p14p13g12

ci + 1

a8 b8 a9 b9 a10 b10 a11 b11

a12 b12 a13 b13 a14 b14 a15 b15

C1 = G0 +(P0 c0) C2 = G1+(P1 G0)+(P1P0c0) C3 = G2+(P2 G1)+(P2P1G0)+(P2P1P0c0) C4 = G3+(P3 G2)+(P3P2G1)+(P3P2P1G0) +(P3P2P1P0c0)

3.3 Multiplication

Multiplication

Start with long-multiplication approach

1000 1001 1000 0000 0000 1000 1001000

multiplicand

multiplier

product

Length of product is the sum of operand lengths

Chapter 3 Arithmetic for Computers 31

Multiplication Hardware

Initially 0

Chapter 3 Arithmetic for Computers 32

Optimized Multiplier

Perform steps in parallel: add/shift

One cycle per partial-product addition

Thats ok, if frequency of multiplications is low

Chapter 3 Arithmetic for Computers 33

Faster Multiplier

Uses multiple adders

Cost/performance tradeoff

Can be pipelined

Several multiplication performed in parallel

Chapter 3 Arithmetic for Computers 34

MIPS Multiplication

Two 32-bit registers for product

HI: most-significant 32 bits LO: least-significant 32-bits mult rs, rt

Instructions

multu rs, rt

64-bit product in HI/LO

mfhi rd

mflo rd

Move from HI/LO to rd Can test HI value to see if product overflows 32 bits

mul rd, rs, rt

Least-significant 32 bits of product > rd

Chapter 3 Arithmetic for Computers 35

3.4 Division

Division

Check for 0 divisor Long division approach

quotient dividend

If divisor dividend bits

1 bit in quotient, subtract 0 bit in quotient, bring down next dividend bit

10010 1000 10010100 -1000 divisor 10 101 1010 -1000 100 remainder
n-bit operands yield (n+1)-bit quotient and n-bit remainder

Otherwise

Restoring division

Do the subtract, and if remainder goes < 0, add divisor back Divide using absolute values Adjust sign of quotient and remainder as required

Signed division

Chapter 3 Arithmetic for Computers 36

Division Hardware
Initially divisor in left half

Initially dividend

Chapter 3 Arithmetic for Computers 37

Optimized Divider

One cycle per partial-remainder subtraction Looks a lot like a multiplier!

Same hardware can be used for both

Chapter 3 Arithmetic for Computers 38

Faster Division

Cant use parallel hardware as in multiplier

Subtraction is conditional on sign of remainder

Faster dividers (e.g. SRT devision) generate multiple quotient bits per step

Still require multiple steps

Chapter 3 Arithmetic for Computers 39

MIPS Division

Use HI/LO registers for result

HI: 32-bit remainder LO: 32-bit quotient div rs, rt / divu rs, rt No overflow or divide-by-0 checking

Instructions

Software must perform checks if required

Use mfhi, mflo to access result

Chapter 3 Arithmetic for Computers 40

3.5 Floating Point

Floating Point

Representation for non-integral numbers

Including very small and very large numbers 2.34 1056 +0.002 104 +987.02 109 1.xxxxxxx2 2yyyy
normalized

Like scientific notation

not normalized

In binary

Types float and double in C

Chapter 3 Arithmetic for Computers 41

Floating Point Standard

Defined by IEEE Std 754-1985 Developed in response to divergence of representations

Portability issues for scientific code

Now almost universally adopted Two representations

Single precision (32-bit) Double precision (64-bit)

Chapter 3 Arithmetic for Computers 42

IEEE Floating-Point Format

single: 8 bits double: 11 bits single: 23 bits double: 52 bits

S Exponent
S

Fraction
(Exponent Bias)

x ( 1) (1 Fraction) 2

S: sign bit (0 non-negative, 1 negative) Normalize significand: 1.0 |significand| < 2.0

Always has a leading pre-binary-point 1 bit, so no need to represent it explicitly (hidden bit) Significand is Fraction with the 1. restored Ensures exponent is unsigned Single: Bias = 127; Double: Bias = 1203
Chapter 3 Arithmetic for Computers 43

Exponent: excess representation: actual exponent + Bias

Single-Precision Range

Exponents 00000000 and 11111111 reserved Smallest value

Exponent: 00000001 actual exponent = 1 127 = 126 Fraction: 00000 significand = 1.0 1.0 2126 1.2 1038 exponent: 11111110 actual exponent = 254 127 = +127 Fraction: 11111 significand 2.0 2.0 2+127 3.4 10+38
Chapter 3 Arithmetic for Computers 44

Largest value

Double-Precision Range

Exponents 000000 and 111111 reserved Smallest value

Exponent: 00000000001 actual exponent = 1 1023 = 1022 Fraction: 00000 significand = 1.0 1.0 21022 2.2 10308 Exponent: 11111111110 actual exponent = 2046 1023 = +1023 Fraction: 11111 significand 2.0 2.0 2+1023 1.8 10+308
Chapter 3 Arithmetic for Computers 45

Largest value

Floating-Point Precision

Relative precision

all fraction bits are significant Single: approx 223

Equivalent to 23 log102 23 0.3 6 decimal digits of precision

Double: approx 252

Equivalent to 52 log102 52 0.3 16 decimal digits of precision

Chapter 3 Arithmetic for Computers 46

Floating-Point Example

Represent 0.75

0.75 = (1)1 1.12 21 S=1 Fraction = 1000002 Exponent = 1 + Bias

Single: 1 + 127 = 126 = 011111102 Double: 1 + 1023 = 1022 = 011111111102

Single: 101111110100000 Double: 101111111110100000

Chapter 3 Arithmetic for Computers 47

Floating-Point Example

What number is represented by the singleprecision float 1100000010100000

S=1 Fraction = 01000002 Fxponent = 100000012 = 129 = (1) 1.25 22 = 5.0

x = (1)1 (1 + 012) 2(129 127)

Chapter 3 Arithmetic for Computers 48

IEEE 754 encoding of floating-point numbers

Single precision
Exponent 0 0 1-254 255 255 Fraction 0 Nonzero Anything 0 Nonzero

Double precision
Exponent 0 0 1-2046 2047 2047 Fraction 0 Nonzero Anything 0 Nonzero

Object represented

+ Denomalized number +

Floating-point number

+ Infinity NaN (Not a Number)

Floating-Point Addition

Consider a 4-digit decimal example

9.999 101 + 1.610 101

1. Align decimal points

Shift number with smaller exponent 9.999 101 + 0.016 101

9.999 101 + 0.016 101 = 10.015 101 1.0015 102

2. Add significands

3. Normalize result & check for over/underflow

4. Round and renormalize if necessary

1.002 102
Chapter 3 Arithmetic for Computers 52

Floating-Point Addition

Now consider a 4-digit binary example

1.0002 21 + 1.1102 22 (0.5 + 0.4375)

1. Align binary points

Shift number with smaller exponent 1.0002 21 + 0.1112 21

1.0002 21 + 0.1112 21 = 0.0012 21 1.0002 24, with no over/underflow

2. Add significands

3. Normalize result & check for over/underflow

4. Round and renormalize if necessary

1.0002 24 (no change) = 0.0625

Chapter 3 Arithmetic for Computers 53

FP Adder Hardware

Much more complex than integer adder Doing it in one clock cycle would take too long

Much longer than integer operations Slower clock would penalize all instructions Can be pipelined

FP adder usually takes several cycles

Chapter 3 Arithmetic for Computers 54

FP Adder Hardware

Step 1

Step 2

Step 3

Step 4

Chapter 3 Arithmetic for Computers 55

FP Arithmetic Hardware

FP multiplier is of similar complexity to FP adder

But uses a multiplier for significands instead of an adder Addition, subtraction, multiplication, division, reciprocal, square-root FP integer conversion

FP arithmetic hardware usually does

Operations usually takes several cycles

Can be pipelined
Chapter 3 Arithmetic for Computers 58

FP Instructions in MIPS

FP hardware is coprocessor 1

Adjunct processor that extends the ISA

Separate FP registers

32 single-precision: $f0, $f1, $f31 Paired for double-precision: $f0/$f1, $f2/$f3,

Release 2 of MIPs ISA supports 32 64-bit FP regs

FP instructions operate only on FP registers

Programs generally dont do integer ops on FP data, or vice versa More registers with minimal code-size impact lwc1, ldc1, swc1, sdc1

FP load and store instructions

e.g., ldc1 $f8, 32($sp)

Chapter 3 Arithmetic for Computers 59

FP Instructions in MIPS

Single-precision arithmetic

add.s, sub.s, mul.s, div.s

e.g., add.s $f0, $f1, $f6

Double-precision arithmetic

add.d, sub.d, mul.d, div.d

e.g., mul.d $f4, $f4, $f6

Single- and double-precision comparison

c.xx.s, c.xx.d (xx is eq, lt, le, ) Sets or clears FP condition-code bit

e.g. c.lt.s $f3, $f4

Branch on FP condition code true or false

bc1t, bc1f

e.g., bc1t TargetLabel

Chapter 3 Arithmetic for Computers 60

FP Example: F to C

C code:
float f2c (float fahr) { return ((5.0/9.0)*(fahr - 32.0)); } fahr in $f12, result in $f0, literals in global memory space

Compiled MIPS code:

f2c: lwc1 lwc2 div.s lwc1 sub.s mul.s jr $f16, $f18, $f16, $f18, $f18, $f0, $ra const5($gp) const9($gp) $f16, $f18 const32($gp) $f12, $f18 $f16, $f18
Chapter 3 Arithmetic for Computers 61

FP Example: Array Multiplication

X=X+YZ

All 32 32 matrices, 64-bit double-precision elements

C code:
void mm (double x[][], double y[][], double z[][]) { int i, j, k; for (i = 0; i! = 32; i = i + 1) for (j = 0; j! = 32; j = j + 1) for (k = 0; k! = 32; k = k + 1) x[i][j] = x[i][j] + y[i][k] * z[k][j]; } Addresses of x, y, z in $a0, $a1, $a2, and i, j, k in $s0, $s1, $s2
Chapter 3 Arithmetic for Computers 62

FP Example: Array Multiplication

MIPS code:
$t1, 32 $s0, 0 $s1, 0 $s2, 0 $t2, $s0, 5 $t2, $t2, $s1 $t2, $t2, 3 $t2, $a0, $t2 $f4, 0($t2) $t0, $s2, 5 $t0, $t0, $s1 $t0, $t0, 3 $t0, $a2, $t0 $f16, 0($t0) # # # # # # # # # # # # # # $t1 = 32 (row size/loop end) i = 0; initialize 1st for loop j = 0; restart 2nd for loop k = 0; restart 3rd for loop $t2 = i * 32 (size of row of x) $t2 = i * size(row) + j $t2 = byte offset of [i][j] $t2 = byte address of x[i][j] $f4 = 8 bytes of x[i][j] $t0 = k * 32 (size of row of z) $t0 = k * size(row) + j $t0 = byte offset of [k][j] $t0 = byte address of z[k][j] $f16 = 8 bytes of z[k][j]

li li L1: li L2: li sll addu sll addu l.d L3: sll addu sll addu l.d

Chapter 3 Arithmetic for Computers 63

FP Example: Array Multiplication

sll $t0, $s0, 5 addu $t0, $t0, $s2 sll $t0, $t0, 3 addu $t0, $a1, $t0 l.d $f18, 0($t0) mul.d $f16, $f18, $f16 add.d $f4, $f4, $f16 addiu $s2, $s2, 1 bne $s2, $t1, L3 s.d $f4, 0($t2) addiu $s1, $s1, 1 bne $s1, $t1, L2 addiu $s0, $s0, 1 bne $s0, $t1, L1 # # # # # # # # # # # # # # $t0 = i*32 (size of row of y) $t0 = i*size(row) + k $t0 = byte offset of [i][k] $t0 = byte address of y[i][k] $f18 = 8 bytes of y[i][k] $f16 = y[i][k] * z[k][j] f4=x[i][j] + y[i][k]*z[k][j] $k k + 1 if (k != 32) go to L3 x[i][j] = $f4 $j = j + 1 if (j != 32) go to L2 $i = i + 1 if (i != 32) go to L1

Chapter 3 Arithmetic for Computers 64

Interpretation of Data
The BIG Picture

Bits have no inherent meaning

Interpretation depends on the instructions applied

Finite range and precision Need to account for this in programs

Computer representations of numbers

Chapter 3 Arithmetic for Computers 66

3.6 Parallelism and Computer Arithmetic: Associativity

Associativity

Parallel programs may interleave operations in unexpected orders

Assumptions of associativity may fail

(x+y)+z x -1.50E+38 y 1.50E+38 0.00E+00 z 1.0 1.0 1.50E+38 1.00E+00 0.00E+00 x+(y+z) -1.50E+38

Need to validate parallel programs under varying degrees of parallelism

Chapter 3 Arithmetic for Computers 67

3.7 Real Stuff: Floating Point in the x86

x86 FP Architecture

Originally based on 8087 FP coprocessor

8 80-bit extended-precision registers Used as a push-down stack Registers indexed from TOS: ST(0), ST(1), Converted on load/store of memory operand Integer operands can also be converted on load/store Result: poor FP performance

FP values are 32-bit or 64 in memory

Very difficult to generate and optimize code

Chapter 3 Arithmetic for Computers 68

x86 FP Instructions
Data transfer
FILD mem/ST(i) FISTP mem/ST(i) FLDPI FLD1 FLDZ

Arithmetic
FIADDP FISUBRP FIMULP FIDIVRP FSQRT FABS FRNDINT mem/ST(i) mem/ST(i) mem/ST(i) mem/ST(i)

Compare
FICOMP FIUCOMP FSTSW AX/mem

Transcendental
FPATAN F2XMI FCOS FPTAN FPREM FPSIN FYL2X

Optional variations

I: integer operand P: pop operand from stack R: reverse operand order But not all combinations allowed
Chapter 3 Arithmetic for Computers 69

Streaming SIMD Extension 2 (SSE2)

Adds 4 128-bit registers

Extended to 8 registers in AMD64/EM64T 2 64-bit double precision 4 32-bit double precision Instructions operate on them simultaneously

Can be used for multiple FP operands

Single-Instruction Multiple-Data

Chapter 3 Arithmetic for Computers 70

3.8 Fallacies and Pitfalls

Right Shift and Division

Left shift by i places multiplies an integer by 2i Right shift divides by 2i?

Only for unsigned integers Arithmetic right shift: replicate the sign bit e.g., 5 / 4

For signed integers

111110112 >> 2 = 111111102 = 2 Rounds toward

c.f. 111110112 >>> 2 = 001111102 = +62

Chapter 3 Arithmetic for Computers 71

Who Cares About FP Accuracy?

Important for scientific code

But for everyday consumer use?

My bank balance is out by 0.0002!

The Intel Pentium FDIV bug

The market expects accuracy See Colwell, The Pentium Chronicles

Chapter 3 Arithmetic for Computers 72

3.9 Concluding Remarks

Concluding Remarks

ISAs support arithmetic

Signed and unsigned integers Floating-point approximation to reals Operations can overflow and underflow Core instructions: 54 most frequently used

Bounded range and precision

MIPS ISA

100% of SPECINT, 97% of SPECFP

Other instructions: less frequent

Chapter 3 Arithmetic for Computers 73

CS M151B / EE M116C: Computer Systems Architecture
No ratings yet
CS M151B / EE M116C: Computer Systems Architecture
33 pages
CPSC 161: Prof. L.N. Bhuyan .HTML
No ratings yet
CPSC 161: Prof. L.N. Bhuyan .HTML
28 pages
COD Ch. 3 Arithmetic For Computers
No ratings yet
COD Ch. 3 Arithmetic For Computers
72 pages
CODch 4 Slides
No ratings yet
CODch 4 Slides
71 pages
Chapter 3 OnlyFor Q39 and ProblemNo 9
No ratings yet
Chapter 3 OnlyFor Q39 and ProblemNo 9
32 pages
L3e4
No ratings yet
L3e4
112 pages
Csci 136 Computer Architecture II
No ratings yet
Csci 136 Computer Architecture II
28 pages
07 CA (Computer+Arithmetic)
No ratings yet
07 CA (Computer+Arithmetic)
19 pages
PPT#04
No ratings yet
PPT#04
43 pages
Processor Design 5Z032: Arithmetic For Computers
No ratings yet
Processor Design 5Z032: Arithmetic For Computers
57 pages
Lecture 7 COMP2611 Arithmetic Part1
No ratings yet
Lecture 7 COMP2611 Arithmetic Part1
33 pages
Mips Alu
No ratings yet
Mips Alu
27 pages
Arithmetic: - Performance (Seconds, Cycles, Instructions) - Abstractions
No ratings yet
Arithmetic: - Performance (Seconds, Cycles, Instructions) - Abstractions
7 pages
Lec06-ALU
No ratings yet
Lec06-ALU
59 pages
Cse675.02.F.aludesign Part1
No ratings yet
Cse675.02.F.aludesign Part1
7 pages
Computer Arithmetic: Electrical and Computer Engineering Department
No ratings yet
Computer Arithmetic: Electrical and Computer Engineering Department
72 pages
Signed Binary Addition
No ratings yet
Signed Binary Addition
57 pages
Arithmetic-Logic Units: CPSC 321 Computer Architecture Andreas Klappenecker
No ratings yet
Arithmetic-Logic Units: CPSC 321 Computer Architecture Andreas Klappenecker
18 pages
Computer Organization & Assembly Language: CS/COE0447
No ratings yet
Computer Organization & Assembly Language: CS/COE0447
30 pages
ALU Design
No ratings yet
ALU Design
7 pages
Module 2 - Number System Arithmetic
No ratings yet
Module 2 - Number System Arithmetic
82 pages
Unit - Ii Arithmetic For Computers
No ratings yet
Unit - Ii Arithmetic For Computers
28 pages
Week9 2
No ratings yet
Week9 2
48 pages
Computer Architecture
No ratings yet
Computer Architecture
17 pages
Computer and Structure
No ratings yet
Computer and Structure
54 pages
Unit - 2 Arithmetic Unit
No ratings yet
Unit - 2 Arithmetic Unit
71 pages
3 Integer Arithmetic
No ratings yet
3 Integer Arithmetic
40 pages
Arithmetic Logic Unit: CSE 429 Digital System Design
No ratings yet
Arithmetic Logic Unit: CSE 429 Digital System Design
42 pages
DigitalLogic ComputerOrganization L13 Arithmetic Handout
No ratings yet
DigitalLogic ComputerOrganization L13 Arithmetic Handout
37 pages
CSE341 Lecture Notes Fall 2009 Arithmetic For Computers: Ex: Write - 38 in 32 Bits
No ratings yet
CSE341 Lecture Notes Fall 2009 Arithmetic For Computers: Ex: Write - 38 in 32 Bits
30 pages
CPE 232 Computer Organization MIPS Arithmetic - Part I
No ratings yet
CPE 232 Computer Organization MIPS Arithmetic - Part I
18 pages
Ca Unit 2 Prabu
No ratings yet
Ca Unit 2 Prabu
30 pages
Arithmetic
No ratings yet
Arithmetic
39 pages
Week 6: Arithmetic Functions and Circuits: Adding Two Bits
No ratings yet
Week 6: Arithmetic Functions and Circuits: Adding Two Bits
12 pages
9 Computer Arithmetics
No ratings yet
9 Computer Arithmetics
47 pages
Building An ALU L
No ratings yet
Building An ALU L
13 pages
21CS401-CA-UNIT-II_230223_190425
No ratings yet
21CS401-CA-UNIT-II_230223_190425
26 pages
1 - Bit ALU
No ratings yet
1 - Bit ALU
13 pages
Computer Arithmetic: Electrical and Computer Engineering Department
No ratings yet
Computer Arithmetic: Electrical and Computer Engineering Department
72 pages
Arith 3
No ratings yet
Arith 3
18 pages
Unit 2 - Arithmetic Unit
No ratings yet
Unit 2 - Arithmetic Unit
32 pages
Unit - Ii Arithmetic For Computers
No ratings yet
Unit - Ii Arithmetic For Computers
28 pages
Lec 14
No ratings yet
Lec 14
45 pages
ARITHMETIC and LOGIC UNIT - in This Lecture, We Will Examine How
No ratings yet
ARITHMETIC and LOGIC UNIT - in This Lecture, We Will Examine How
12 pages
Number System
No ratings yet
Number System
18 pages
MIPS Architecture - BITS Pilani
No ratings yet
MIPS Architecture - BITS Pilani
58 pages
Chapter IV Computer Arithmetic
No ratings yet
Chapter IV Computer Arithmetic
133 pages
Ch#3 Part 1 2 3
No ratings yet
Ch#3 Part 1 2 3
66 pages
Chapter 03-Yo Ver8
No ratings yet
Chapter 03-Yo Ver8
132 pages
Chapter_4_pdf
No ratings yet
Chapter_4_pdf
49 pages
Week 6 - Lecture 6 - Arithmetic Processing Unit Implementation
No ratings yet
Week 6 - Lecture 6 - Arithmetic Processing Unit Implementation
32 pages
ch1 COA New1
No ratings yet
ch1 COA New1
87 pages
Computer Architecture and Organization: The Central Processing Unit
100% (1)
Computer Architecture and Organization: The Central Processing Unit
126 pages
Chapter 3 Arithmetic For Computers
No ratings yet
Chapter 3 Arithmetic For Computers
82 pages
14 Arithmetic Circuits
No ratings yet
14 Arithmetic Circuits
52 pages
198:211 Computer Architecture: Topics
No ratings yet
198:211 Computer Architecture: Topics
35 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Right Way to Build Half and Full Adders with Logic Gates
From Everand
Right Way to Build Half and Full Adders with Logic Gates
GURUPRASAD N H
No ratings yet
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
From Everand
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
Derek Molloy
4/5 (2)
C++ Learn in 24 Hours
From Everand
C++ Learn in 24 Hours
Alex Nordeen
No ratings yet
Negotiable Instruments Act PDF
No ratings yet
Negotiable Instruments Act PDF
30 pages
Answersheet For Section Wise Set of Professional Knowledge For It Officer
No ratings yet
Answersheet For Section Wise Set of Professional Knowledge For It Officer
1 page
Ques (1-5) Directions: Study The Following Information Carefully and Answer The Questions Given Below
No ratings yet
Ques (1-5) Directions: Study The Following Information Carefully and Answer The Questions Given Below
2 pages
Punit Narayan CV
No ratings yet
Punit Narayan CV
2 pages
Directions (Q. 1-5) : Study The Following Information Carefully and Answer The Questions Given Below
No ratings yet
Directions (Q. 1-5) : Study The Following Information Carefully and Answer The Questions Given Below
2 pages
Voice Recognition System Report
No ratings yet
Voice Recognition System Report
17 pages
Marketing Is The Process of Communicating The Value of A Product or Service To Customers, For The
No ratings yet
Marketing Is The Process of Communicating The Value of A Product or Service To Customers, For The
4 pages
Hardware Inv by Ti Best
No ratings yet
Hardware Inv by Ti Best
18 pages
Synopsis Automatic Phase Exchanger
No ratings yet
Synopsis Automatic Phase Exchanger
4 pages
Vehicle Speed Mearurement System Final2
No ratings yet
Vehicle Speed Mearurement System Final2
71 pages
The Stack, Subroutines, Interrupts and Resets
No ratings yet
The Stack, Subroutines, Interrupts and Resets
20 pages
Avr+lcd Report
No ratings yet
Avr+lcd Report
95 pages
Synopsis On Electromagnetic Car PDF
No ratings yet
Synopsis On Electromagnetic Car PDF
3 pages
Election Commission of India: A State-of-the-Art, User Friendly and Tamper Proof
No ratings yet
Election Commission of India: A State-of-the-Art, User Friendly and Tamper Proof
29 pages
Synopsis On Car Parking System
No ratings yet
Synopsis On Car Parking System
5 pages
Implementation of Binary To Floating Point Converter Using HDL
No ratings yet
Implementation of Binary To Floating Point Converter Using HDL
41 pages
Unit 1 and 2. Dfa
0% (1)
Unit 1 and 2. Dfa
43 pages
Problem Set 1: Collaboration
No ratings yet
Problem Set 1: Collaboration
6 pages
C Good
No ratings yet
C Good
197 pages
Maxima by Example: Ch.8: Numerical Integration: Edwin L. Woollett November 16, 2012
No ratings yet
Maxima by Example: Ch.8: Numerical Integration: Edwin L. Woollett November 16, 2012
35 pages
LH11 0112 Eng
No ratings yet
LH11 0112 Eng
782 pages
COBOL Layouts
No ratings yet
COBOL Layouts
15 pages
Ac Datatypes Ref
No ratings yet
Ac Datatypes Ref
56 pages
Lec05 Quantization I
No ratings yet
Lec05 Quantization I
70 pages
Rane Fixed Vs Floating Point Note153
No ratings yet
Rane Fixed Vs Floating Point Note153
4 pages
Compact Numerical Methods by John Nash
No ratings yet
Compact Numerical Methods by John Nash
288 pages
SystemC Methodologies and Applications-Müller-Kluwer
No ratings yet
SystemC Methodologies and Applications-Müller-Kluwer
356 pages
LCD5110 Basic
No ratings yet
LCD5110 Basic
5 pages
Eigenmath Manual
No ratings yet
Eigenmath Manual
54 pages
HMI Intouch
100% (1)
HMI Intouch
274 pages
Matlab Prog
No ratings yet
Matlab Prog
1,218 pages
CPP Sci Comp
No ratings yet
CPP Sci Comp
320 pages
Number Formats
No ratings yet
Number Formats
35 pages
STM32F3xx Training V1 - 2x PDF
100% (1)
STM32F3xx Training V1 - 2x PDF
602 pages
Fixed Point
No ratings yet
Fixed Point
3 pages
Change Log
No ratings yet
Change Log
42 pages
Accuracy of The Discrete Fourier Transform and The Fast Fourier Transform
No ratings yet
Accuracy of The Discrete Fourier Transform and The Fast Fourier Transform
18 pages
GFK2259E
No ratings yet
GFK2259E
206 pages
Using The ADSP-2100 Family Volume 1 PDF
No ratings yet
Using The ADSP-2100 Family Volume 1 PDF
606 pages
Aca
No ratings yet
Aca
71 pages
SQR
100% (1)
SQR
81 pages
Basic Cio
No ratings yet
Basic Cio
10 pages
Andor Software Development Kit 3
No ratings yet
Andor Software Development Kit 3
67 pages
Numerical Methods With MATLAB PDF
No ratings yet
Numerical Methods With MATLAB PDF
189 pages
Spectation Communication Manual en
No ratings yet
Spectation Communication Manual en
110 pages