0% found this document useful (0 votes)

38 views

Large and Fast: Exploiting Memory Hierarchy: The Hardware/Software Interface

Uploaded by

hadeeda980

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

Large and Fast: Exploiting Memory Hierarchy: The Hardware/Software Interface

Uploaded by

hadeeda980

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 33

COMPUTER ORGANIZATION AND DESIGN

5
Edition
th

The Hardware/Software Interface

Chapter 5
Large and Fast: Exploiting
Memory Hierarchy
§5.1 Introduction
Principle of Locality
 Programs access a small proportion of
their address space at any time
 Temporal locality
 Items accessed recently are likely to be
accessed again soon
 e.g., instructions in a loop, induction variables
 Spatial locality
 Items near those accessed recently are likely
to be accessed soon
 E.g., sequential instruction access, array data
Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 2
Taking Advantage of Locality
 Memory hierarchy
 Store everything on disk
 Copy recently accessed (and nearby)
items from disk to smaller DRAM memory
 Main memory
 Copy more recently accessed (and
nearby) items from DRAM to smaller
SRAM memory
 Cache memory attached to CPU

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 3

Memory Hierarchy Levels
 Block (aka line): unit of copying
 May be multiple words
 If accessed data is present in
upper level
 Hit: access satisfied by upper level

Hit ratio: hits/accesses
 If accessed data is absent
 Miss: block copied from lower level

Time taken: miss penalty

Miss ratio: misses/accesses
= 1 – hit ratio
 Then accessed data supplied from
upper level

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 4

§5.2 Memory Technologies
Memory Technology
 Static RAM (SRAM)
 0.5ns – 2.5ns, $2000 – $5000 per GB
 Dynamic RAM (DRAM)
 50ns – 70ns, $20 – $75 per GB
 Magnetic disk
 5ms – 20ms, $0.20 – $2 per GB
 Ideal memory
 Access time of SRAM
 Capacity and cost/GB of disk

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 5

§5.3 The Basics of Caches
Cache Memory
 Cache memory
 The level of the memory hierarchy closest to
the CPU
 Given accesses X1, …, Xn–1, Xn

 How do we know if
the data is present?
 Where do we look?

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 6

Direct Mapped Cache
 Location determined by address
 Direct mapped: only one choice
 (Block address) modulo (#Blocks in cache)

 #Blocks is a
power of 2
 Use low-order
address bits

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 7

Tags and Valid Bits
 How do we know which particular block is
stored in a cache location?
 Store block address as well as the data
 Actually, only need the high-order bits
 Called the tag
 What if there is no data in a location?
 Valid bit: 1 = present, 0 = not present
 Initially 0

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 8

Cache Example
 8-blocks, 1 word/block, direct mapped
 Initial state

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 9

Cache Example
Word addr Binary addr Hit/miss Cache block
22 10 110 Miss 110

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 10

Cache Example
Word addr Binary addr Hit/miss Cache block
26 11 010 Miss 010

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 11

Cache Example
Word addr Binary addr Hit/miss Cache block
22 10 110 Hit 110
26 11 010 Hit 010

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 12

Cache Example
Word addr Binary addr Hit/miss Cache block
16 10 000 Miss 000
3 00 011 Miss 011
16 10 000 Hit 000

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 13

Cache Example
Word addr Binary addr Hit/miss Cache block
18 10 010 Miss 010

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 14

Address Subdivision

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 15

Example: Larger Block Size
 64 blocks, 16 bytes/block
 To what block number does byte address
1200 map?
 Block address = 1200/16 = 75
 Block number = 75 modulo 64 = 11
31 10 9 4 3 0
Tag Index Offset
22 bits 6 bits 4 bits

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 16

Block Size Considerations
 Larger blocks should reduce miss rate
 Due to spatial locality
 But in a fixed-sized cache
 Larger blocks  fewer of them

More competition  increased miss rate
 Larger blocks  pollution
 Larger miss penalty
 Can override benefit of reduced miss rate
 Early restart and critical-word-first can help

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 17

Cache Misses
 On cache hit, CPU proceeds normally
 On cache miss
 Stall the CPU pipeline
 Fetch block from next level of hierarchy
 Instruction cache miss

Restart instruction fetch
 Data cache miss

Complete data access

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 18

Write-Through
 On data-write hit, could just update the block in
cache
 But then cache and memory would be inconsistent
 Write through: also update memory
 But makes writes take longer
 e.g., if base CPI = 1, 10% of instructions are stores,
write to memory takes 100 cycles

Effective CPI = 1 + 0.1×100 = 11
 Solution: write buffer
 Holds data waiting to be written to memory
 CPU continues immediately

Only stalls on write if write buffer is already full

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 19

Write-Back
 Alternative: On data-write hit, just update
the block in cache
 Keep track of whether each block is dirty
 When a dirty block is replaced
 Write it back to memory
 Can use a write buffer to allow replacing block
to be read first

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 20

Write Allocation
 What should happen on a write miss?
 Alternatives for write-through
 Allocate on miss: fetch the block
 Write around: don’t fetch the block

Since programs often write a whole block before
reading it (e.g., initialization)
 For write-back
 Usually fetch the block

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 21

§5.4 Measuring and Improving Cache Performance
Measuring Cache Performance
 Components of CPU time

Program execution cycles

Includes cache hit time

Memory stall cycles

Mainly from cache misses
 With simplifying assumptions:
Memory stall cycles
Memory accesses
 Miss rate Miss penalty
Program
Instructions Misses
  Miss penalty
Program Instruction
Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 22
Performance Summary
 When CPU performance increased
 Miss penalty becomes more significant
 Decreasing base CPI
 Greater proportion of time spent on memory
stalls
 Increasing clock rate
 Memory stalls account for more CPU cycles
 Can’t neglect cache behavior when
evaluating system performance

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 23

Associative Caches
 Fully associative
 Allow a given block to go in any cache entry
 Requires all entries to be searched at once
 Comparator per entry (expensive)
 n-way set associative
 Each set contains n entries
 Block number determines which set

(Block number) modulo (#Sets in cache)
 Search all entries in a given set at once
 n comparators (less expensive)
Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 24
Associative Cache Example

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 25

Spectrum of Associativity
 For a cache with 8 entries

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 26

Associativity Example
 Compare 4-block caches
 Direct mapped, 2-way set associative,
fully associative
 Block access sequence: 0, 8, 0, 6, 8
 Direct mapped
Block Cache Hit/miss Cache content after access
address index 0 1 2 3

0 0 miss Mem[0]
8 0 miss Mem[8]
0 0 miss Mem[0]
6 2 miss Mem[0] Mem[6]
8 0 miss Mem[8] Mem[6]

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 27

Associativity Example
 2-way set associative
Block Cache Hit/miss Cache content after access
address index Set 0 Set 1

0 0 miss Mem[0]
8 0 miss Mem[0] Mem[8]
0 0 hit Mem[0] Mem[8]
6 0 miss Mem[0] Mem[6]
8 0 miss Mem[8] Mem[6]

 Fully associative
Block Hit/miss Cache content after access
address
0 miss Mem[0]
8 miss Mem[0] Mem[8]
0 hit Mem[0] Mem[8]
6 miss Mem[0] Mem[8] Mem[6]
8 hit Mem[0] Mem[8] Mem[6]

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 28

How Much Associativity
 Increased associativity decreases miss
rate
 But with diminishing returns
 Simulation of a system with 64KB
D-cache, 16-word blocks, SPEC2000
 1-way: 10.3%
 2-way: 8.6%
 4-way: 8.3%
 8-way: 8.1%

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 29

Set Associative Cache Organization

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 30

Replacement Policy
 Direct mapped: no choice
 Set associative

Prefer non-valid entry, if there is one

Otherwise, choose among entries in the set
 Least-recently used (LRU)

Choose the one unused for the longest time

Simple for 2-way, manageable for 4-way, too hard
beyond that
 Random

Gives approximately the same performance
as LRU for high associativity

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 31

Multilevel Caches
 Primary cache attached to CPU
 Small, but fast
 Level-2 cache services misses from
primary cache
 Larger, slower, but still faster than main
memory
 Main memory services L-2 cache misses
 Some high-end systems include L-3 cache

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 32

§5.16 Concluding Remarks
Concluding Remarks
 Fast memories are small, large memories are
slow
 We really want fast, large memories 
 Caching gives this illusion 
 Principle of locality
 Programs use a small part of their memory space
frequently
 Memory system design is critical for
multiprocessors

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 33

Predicting Consumer Behavior in E-Commerce Using Recommendation Systems
No ratings yet
Predicting Consumer Behavior in E-Commerce Using Recommendation Systems
8 pages
Chapter 05
No ratings yet
Chapter 05
113 pages
ROLCON - Product Catalogue
100% (1)
ROLCON - Product Catalogue
58 pages
Chapter 05 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
75% (4)
Chapter 05 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
105 pages
Lec 2
No ratings yet
Lec 2
26 pages
Lecture 9 - The Memory Hierarchy
No ratings yet
Lecture 9 - The Memory Hierarchy
25 pages
Large and Fast: Exploiting Memory Hierarchy: Omputer Rganization and Esign
No ratings yet
Large and Fast: Exploiting Memory Hierarchy: Omputer Rganization and Esign
87 pages
04 - Large and Fast Exploiting Memory Hierarchy
No ratings yet
04 - Large and Fast Exploiting Memory Hierarchy
92 pages
help2
No ratings yet
help2
102 pages
Large and Fast: Exploiting Memory Hierarchy
No ratings yet
Large and Fast: Exploiting Memory Hierarchy
24 pages
Chapter 5 Large and Fast Exploiting Memory Hierarchy
No ratings yet
Chapter 5 Large and Fast Exploiting Memory Hierarchy
101 pages
Chapter5 PDF
No ratings yet
Chapter5 PDF
95 pages
Lecture-17 CH-05 1
No ratings yet
Lecture-17 CH-05 1
21 pages
Chapter_05
No ratings yet
Chapter_05
52 pages
Chapter_05 9wY
No ratings yet
Chapter_05 9wY
136 pages
3. Lecture 19 Basics of Cache
No ratings yet
3. Lecture 19 Basics of Cache
23 pages
Memory
No ratings yet
Memory
12 pages
Chapter 05
No ratings yet
Chapter 05
105 pages
Week6 Memory Part2
No ratings yet
Week6 Memory Part2
23 pages
Chapter 5 Large and Fast Exploiting Memory Hierarchy
No ratings yet
Chapter 5 Large and Fast Exploiting Memory Hierarchy
96 pages
Large and Fast: Exploiting Memory Hierarchy: Computer Organization and Design
No ratings yet
Large and Fast: Exploiting Memory Hierarchy: Computer Organization and Design
107 pages
Chapter 5: Large and Fast Exploiting Memory Hierarchy Notes
No ratings yet
Chapter 5: Large and Fast Exploiting Memory Hierarchy Notes
16 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
32 pages
Lec 4a
No ratings yet
Lec 4a
25 pages
Chapter 3 Large and Fast
No ratings yet
Chapter 3 Large and Fast
86 pages
Large and Fast: Exploiting Memory Hierarchy
No ratings yet
Large and Fast: Exploiting Memory Hierarchy
48 pages
CH10 - Memory Hierarchy
No ratings yet
CH10 - Memory Hierarchy
106 pages
Associative Mapping
No ratings yet
Associative Mapping
65 pages
5 Memory Hierarchy
No ratings yet
5 Memory Hierarchy
99 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
51 pages
Cache1 2
No ratings yet
Cache1 2
30 pages
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
No ratings yet
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
77 pages
Lecture 13- Introduction to Cache
No ratings yet
Lecture 13- Introduction to Cache
47 pages
Unit-4 (2)
No ratings yet
Unit-4 (2)
72 pages
L - 3-AssociativeMapping - Virtual Memory
No ratings yet
L - 3-AssociativeMapping - Virtual Memory
52 pages
04_Cache Memory
No ratings yet
04_Cache Memory
61 pages
Unit 5 1 Cache Performance V 2
No ratings yet
Unit 5 1 Cache Performance V 2
29 pages
Computer Organization & Architecture: Cache Memory
No ratings yet
Computer Organization & Architecture: Cache Memory
52 pages
Cache 1 54
No ratings yet
Cache 1 54
54 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Cache Memory: William Stallings, Computer Organization and Architecture, 9 Edition
No ratings yet
Cache Memory: William Stallings, Computer Organization and Architecture, 9 Edition
47 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
05) Cache Memory Introduction
No ratings yet
05) Cache Memory Introduction
20 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
CA_Lecture_08
No ratings yet
CA_Lecture_08
38 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
55 pages
13_Large and Fast Exploiting Memory Hierarchy Final
No ratings yet
13_Large and Fast Exploiting Memory Hierarchy Final
118 pages
Memory Hierarchy
100% (1)
Memory Hierarchy
47 pages
Memory Design
No ratings yet
Memory Design
36 pages
Lecture 16
No ratings yet
Lecture 16
22 pages
Memory Hierarchy Design
No ratings yet
Memory Hierarchy Design
76 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
Computer Architecture: Memory Hierarchy Design
No ratings yet
Computer Architecture: Memory Hierarchy Design
60 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
Memory 2
No ratings yet
Memory 2
31 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
55 pages
03-Chap4-Cache Memory Mapping
No ratings yet
03-Chap4-Cache Memory Mapping
24 pages
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
Flash Memory Evolution
From Everand
Flash Memory Evolution
Sterling Blackwood
No ratings yet
Designing Secure Software: A Guide for Developers
From Everand
Designing Secure Software: A Guide for Developers
Loren Kohnfelder
No ratings yet
Youtube Video Download Using Python Project Report
No ratings yet
Youtube Video Download Using Python Project Report
38 pages
The Relationships Among Uncertainty, Social Support, and Psychological Distress in Adolescents Recently Diagnosed With Cancer
No ratings yet
The Relationships Among Uncertainty, Social Support, and Psychological Distress in Adolescents Recently Diagnosed With Cancer
10 pages
Physics For Engineers - Unit 7 - Heat and Temperature
No ratings yet
Physics For Engineers - Unit 7 - Heat and Temperature
13 pages
Simplified Field Table - Weight Increament - Girls
No ratings yet
Simplified Field Table - Weight Increament - Girls
5 pages
Schema 99
No ratings yet
Schema 99
34 pages
Power Factor Correction
No ratings yet
Power Factor Correction
24 pages
Ip Powerdsp TP Man Rev0
No ratings yet
Ip Powerdsp TP Man Rev0
16 pages
Screen Plate Brochure
No ratings yet
Screen Plate Brochure
24 pages
Graphplan Introduction 1
No ratings yet
Graphplan Introduction 1
18 pages
Math Performance Task Arithmetic Sequence
No ratings yet
Math Performance Task Arithmetic Sequence
24 pages
JEE Result
No ratings yet
JEE Result
1 page
Veritas Volume Manager 5.0
No ratings yet
Veritas Volume Manager 5.0
29 pages
AI- Machine Learning Engineer Sample Questions
No ratings yet
AI- Machine Learning Engineer Sample Questions
2 pages
Overview of The Global System For Mobile Communications
0% (1)
Overview of The Global System For Mobile Communications
25 pages
3 Konsep Mol, Formula Dan Persamaan Kimia: Mole Concept, Formulae and Chemical Equations
No ratings yet
3 Konsep Mol, Formula Dan Persamaan Kimia: Mole Concept, Formulae and Chemical Equations
11 pages
SoundToys V3.1 Native Effects Manual
No ratings yet
SoundToys V3.1 Native Effects Manual
31 pages
Bach French Suite No
No ratings yet
Bach French Suite No
13 pages
Report - Fish Tank Maintenance System Using Arduino
No ratings yet
Report - Fish Tank Maintenance System Using Arduino
40 pages
EMKO EZM-XX50 Datasheet
No ratings yet
EMKO EZM-XX50 Datasheet
1 page
Polar and Non Polar Molecules Physical Science
No ratings yet
Polar and Non Polar Molecules Physical Science
19 pages
Spectralissime: User Manual
No ratings yet
Spectralissime: User Manual
25 pages
Governor (Types I, II, IV, and V) - Check
No ratings yet
Governor (Types I, II, IV, and V) - Check
5 pages
Mah Jongg
No ratings yet
Mah Jongg
1 page
Class 12 Phy
No ratings yet
Class 12 Phy
2 pages
Current Mode Control in Switching Power Supplies
No ratings yet
Current Mode Control in Switching Power Supplies
3 pages
15 MB Column 1
No ratings yet
15 MB Column 1
11 pages
Meprobamate Tablets
No ratings yet
Meprobamate Tablets
1 page

Large and Fast: Exploiting Memory Hierarchy: The Hardware/Software Interface

Uploaded by

Large and Fast: Exploiting Memory Hierarchy: The Hardware/Software Interface

Uploaded by

COMPUTER ORGANIZATION AND DESIGN

The Hardware/Software Interface

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 3

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 4

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 5

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 6

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 7

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 8

Index V Tag Data

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 9

Index V Tag Data

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 10

Index V Tag Data

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 11

Index V Tag Data

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 12

Index V Tag Data

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 13

Index V Tag Data

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 14

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 15

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 16

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 17

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 18

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 19

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 20

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 21

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 23

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 25

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 26

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 27

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 28

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 29

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 30

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 31

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 32

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 33

You might also like