Computer Architecture Unit 1 - Phase1 PDF
Computer Architecture Unit 1 - Phase1 PDF
ORGANIZATION
Course Objective:
To understand how computers are constructed out of a set of functional
units and how the functional units operate, interact and communicate.
To make the students to understand the concept of interfacing memory and
various I/O devices to a computer system using a suitable bus system.
Text Books:
1. C. Hamacher, Z. Vranesic and S. Zaky, "Computer Organization", McGraw- Hill, 2012.
(6 th edition)
2. W. Stallings, "Computer Organization and Architecture - Designing for Performance",
Prentice Hall of India, 2012. (9th Edition)
References:
1. D. A. Patterson and J. L. Hennessy, "Computer Organization and Design The
Hardware/Software Interface", Morgan Kaufmann, 2013. (5 th edition)
2. J .P. Hayes, "Computer Architecture and Organization", McGraw-Hill, (3 rd edition)
3. Behrooz Parhami, Computer Architecture-From Microprocessors to Supercomputers,
Oxford University Press, 2014.
Highlevel
view
Computer archit ecture
Computer organization
Electronic components
Software
Circuit designer
Logic designer
Computer designer
System designer
Application designer
Application domains
Clients taste:
mood, style, . . .
Goals
Interface
Means
Engineering
Arts
Computer Architecture,
Background and Motivation
Architect
Interface
Analog
Digital
Stored-program
Electronic
General-purpose
Number cruncher
Nonelectronic
Special-purpose
Data manipulator
The space of computer systems, with what we normally mean by the word
computer highlighted.
Computer Architecture,
Background and Motivation
Fixed-function
COMPUTER TYPES
Computers are classified based on the
parameters like
Speed of operation
Size
Cost
Computational power
Type of application
Embedded computers are integrated into a larger device or system in
order to automatically monitor and control a physical process or
environment.
They are used for a specific purpose rather than for general processing
tasks.
Typical applications include industrial and home automation, appliances,
telecommunication products, and vehicles.
Personal computers
have achieved widespread use in homes,
educational institutions, and business and
engineering office settings, primarily for dedicated
individual use.
A number of classifications are used for personal
computers.
Desktop computers
serve general needs and fit within a typical
personal workspace.
Advantage: Cost effective, easy to operate, suitable
for general purpose educational or business
application
Workstation computers
offer higher computational capacity and
more powerful graphical display capabilities for
engineering and scientific work.
More computational power than PC.
Costlier
Portable and Notebook computers
provide the basic features of a personal
computer in a smaller lightweight package.
They can operate on batteries to provide
mobility.
Compact form of personal computer
(laptop)
Advantage is portability
Enterprise systems(Mainframes)
are large computers that are meant to be
shared by a potentially large number of users who
access them from some form of personal
computer over a public or private network.
Price/Performance Pyramid
$Millions
Mainfram e
$100s Ks
Server
Workstation
Personal
Embedded
$10s Ks
$1000s
$100s
$10s
Computer Architecture,
Background and Motivation
Super
Data are numbers and characters that are used as operands by the instructions.
Data are also stored in the memory.
Data are the operands upon which instructions operate.
Data could be:
Numbers,
Encoded characters.
Data, in a broad sense means any digital information.
Computers use data that is encoded as a string of binary digits called bits.
The instructions and data handled by a computer must be encoded in a suitable
format.
Input unit
Binary information must be presented to a computer in a specific format.
This task is performed by the input unit:
- Interfaces with input devices.
- Accepts binary information from the input devices.
- Presents this binary information in a format expected by the computer.
- Transfers this information to the memory or processor.
Real world
Keyboard
Joysticks
Track balls
Mouse
Microphones
(Audio input)
Cameras
(video input)
Internet
Computer
Memory
Input Unit
Processor
Memory unit
Memory unit stores instructions and data.
Recall, data is represented as a series of bits.
To store data, memory unit thus stores bits.
Processor reads instructions and reads/writes data from/to the memory during
the execution of a program.
In theory, instructions and data could be fetched one bit at a time.
In practice, a group of bits is fetched at a time.
Group of bits stored or retrieved at a time is termed as word
Number of bits in a word is termed as the word length of a computer.
In order to read/write to and from memory, a processor should know where to
look:
Address is associated with each word location.
Part No:
Data bus
width
Memory size
Operating
frequency
INTEL
8085
3 MHz
8086
16
5 MHz
80186
16
80286
16
16MB (224 )
8 MHz
80386DX
32
4GB (232 )
33 MHz
80486DX
32
50 MHz
Pentium
64
60 MHz &
66 MHz
Pentium Pro
64
Pentium II
64
233 MHz to
450 MHz
Part No:
Data bus
width
Memory size
Operating
frequency
Pentium II
Xeon
64
450 MHz
Pentium III
Pentium 4
64
1 GHz
1.3 GHz .4GHz
Motorola
6800
64 K
2 MHz
68000
16
16M
8-20 MHz
68020
32
4G
12 MHz to
33 MHz.
68040
32
4G +8K Cache
40 MHz
68060
64
4G +16K Cache
66 MHz and
75 MHz.
POWER PC
64
4G +32K Cache
90 to 120 MHz
address
CPU
data
cache
address
data
main
memory
Multi-Level Caches
Processor
Regs
L1
d-cache
L1
i-cache
size:
speed:
$/Mbyte:
line size:
200 B
3 ns
8-64 KB
3 ns
Unified
L2
Cache
Memory
disk
80486 DX
CPU1
CPU
CPU2
Co pros
Co processor
16 K L1 cache
8K L1 cache
CPU1 CPU2
CPU3
Co
pros
CPU2
CPU3
32 K L1 cache
16 K L1 cache
256 K L2 cache
512 K L2 cache
Co
Pros
Output unit
Computers represent information in a specific binary form.
Output units:
- Interface with output devices.
- Accept processed results provided by the computer in specific binary form.
- Convert the information in binary form to a form understood by an
output device.
Computer
Memory
Output Unit
Processor
Printer
Graphics display
Speakers
Control unit
Operation of a computer can be summarized as:
Accepts information from the input units (Input unit).
Stores the information (Memory).
Processes the information (ALU).
Provides processed results through the output units (Output unit).
Operations of Input unit, Memory, ALU and Output unit are coordinated by
Control unit.
Control unit generates timing signals which determines when a particular
operation takes place.
Output
Memory
Processor
32
Input
Bus
Generations of Progress
The 5 generations of digital computers, and their ancestors.
Processor
technology
Memory
innovations
I/O devices
introduced
Dominant
look & fell
0 (1600s)
(Electro-)
mechanical
Wheel, card
Lever, dial,
punched card
Factory
equipment
1 (1950s)
Vacuum tube
Magnetic drum
Paper tape,
magnetic tape
Hall-size
cabinet
2 (1960s)
Transistor
Magnetic core
Drum, printer,
text terminal
Room-size
mainframe
3 (1970s)
SSI/MSI
RAM/ROM
chip
Disk, keyboard,
video monitor
Desk-size mini
4 (1980s)
LSI/VLSI
SRAM/DRAM
Network, CD,
mouse,sound
Desktop/
laptop micro
5 (1990s)
ULSI/GSI/
WSI,
SOC,SOP
SDRAM, flash
Sensor/actuator,
point/click
Invisible,
embedded
Computer Architecture,
Background and Motivation
Generation
(begun)
Silicon
crystal
ingot
Slicer
15-30
cm
x x
x x x
x
x
x
x
x x
Patterned wafer
Processing:
20-30 steps
0.2 cm
Dicer
Die
~1 cm
Die
tester
Good
die
Microchip
or other part
Mounting
~1 cm
Part
tester
Usable
part
to ship
Computer Architecture,
Background and Motivation
30-60 cm
Moores Law
TIPS
Tb
Memory
GIPS
80486
R10000
Pentium II
Pentium
256Mb
68040
64Mb
Gb
1Gb
16Mb
80386
68000
MIPS
80286
4Mb
1Mb
Mb
4 / 3 yrs
256kb
64kb
kIPS
1980
1990
2000
kb
2010
Calendar year
Short time challenge: putting processor and memory on the same device.
Computer Architecture,
Background and Motivation
Processor performance
1.6 / yr
2 / 18 mos
10 / 5 yrs
Processor
Various packaging schemes are used for connecting the processor, memory
and other blocks, depending on the computer type and cost/performance
targets.
Memory chips are mounted on small printed circuit boards known as
daughter cards.
Die
PC board
Bus
CPU
Connector
Memory
(a) 2D or 2.5D packaging now common
Stacked layers
glued together
Computer Architecture,
Background and Motivation
Backplane
Interlayer connections
deposited on the
outside of the stack
Floppy
disk
..
(a) Cutaway view of a hard disk drive
CD-ROM
..
.
Magnetic
tape
cartridge
Computer Architecture,
Background and Motivation
Typically
2-9 cm
Communication Technologies
10 12
Geographically distributed
I/O
network
System-area
network
(SAN)
Local-area
network
(LAN)
10 9
Metro-area
network
(MAN)
10 6
Same geographic location
10 3
10 9
(ns)
10 6
(s)
10 3
(ms)
Computer Architecture,
Background and Motivation
Bandwidth (b/s)
Processor
bus
Wide-area
network
(WAN)
(min)
10 3
Latency (s)
(h)
One task =
many statements
add
add
add
lw
lw
sw
sw
jr
One statement =
several instructions
$2,$5,$5
$2,$2,$2
$2,$4,$2
$15,0($2)
$16,4($2)
$16,0($2)
$15,4($2)
$31
Machine
language
instructions,
binary (hex)
00a51020
00421020
00821020
8c620000
8cf20004
acf20000
ac620004
03e00008
Mostly one-to-one
Computer Architecture,
Background and Motivation
temp=v[i]
v[i]=v[i+1]
v[i+1]=temp
Assembly
language
instructions,
mnemonic
Compiler
Swap v[i]
and v[i+1]
High-level
language
statements
Interpreter
Very
high-level
language
objectives
or tasks
Assembler
word processor,
spreadsheet,
circuit simulator,
.. .
System
Operating system
Translator:
Manager:
Enabler:
Coordinator:
MIPS assembler,
C compiler,
.. .
virtual memory,
security,
file system,
.. .
disk driver,
display driver,
printing,
.. .
scheduling,
load balancing,
diagnostics,
.. .
Computer Architecture,
Background and Motivation
Application:
Power in IC
Power also provides challenges as devices are scaled.
Dynamic power (watts, W)in CMOS chip: the traditional dominant
energy consumption has been in switching transistors.
Powerdynamic
1
Capacitive load Voltage 2 Frequency switched
2
0
.
85
0.61
2
Powerold
Voltage Frequency switch
2
Static power: an important issue because leakage current flows even when
a transistor is off:
No.of transistor , leakage current , power ;
Feature size , leakage current , power (why? You can find out in
VLSI area).
As a result, very low power systems are even gating the voltage to
inactive modules to control loss due to leakage.
Cost of die Cost of testing die Cost of packaging and final test
Final test yield
Wafer radius
Wafer diameter
# Dies per wafer
Die area
2 Die area
2
Find the number of dies per 300 mm (30 cm) wafer for a die that is 1.5 cm
on a side.
The total die area is 2.25 cm2. Thus
Wafer radius
Wafer diameter
# Dies per wafer
Die area
2 Die area
2
30 / 22
2.25
30
2 2.25
706.5 94.2
270
2.25 2.12
Execution time Y
n
Execution time X
Performance Measuring
Execution is the reciprocal of performance,
Performance X
1
Execution time X
1
Execution Time Y Performance Y Performance X
n
1
Execution Time X
Performance Y
Performance X