11 Cache Memory

Chapter 8 of ECEG-4123 discusses cache memory, its operation, design, and various mapping techniques including direct, associative, and set associative mapping. It highlights the importance of cache in the memory hierarchy, the impact of locality of reference, and the trade-offs involved in cache size and organization. Additionally, it covers replacement algorithms and write policies to manage cache effectively.

Uploaded by

surafeltadese315

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

11 Cache Memory

Uploaded by

surafeltadese315

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 40

ECEG-4123 Computer Architecture

and Organization

Chapter 8
Cache Memory
Memory Hierarchy - Diagram
Locality of Reference
• During the course of the execution of a
program, memory references tend to
cluster
• e.g. loops
Cache
• Small amount of fast memory
• Sits between normal main memory and
CPU
• May be located on CPU chip or module
Cache and Main Memory
Cache/Main Memory Structure
Cache operation – overview
• CPU requests contents of memory location
• Check cache for this data
• If present, get from cache (fast)
• If not present, read required block from
main memory to cache
• Then deliver from cache to CPU
• Cache includes tags to identify which
block of main memory is in each cache
slot
Cache Read Operation - Flowchart
Cache Design
• Addressing
• Size
• Mapping Function
• Replacement Algorithm
• Write Policy
• Block Size
• Number of Caches
Cache Addressing
• Where does cache sit?
— Between processor and virtual memory management
unit
— Between MMU and main memory
• Logical cache (virtual cache) stores data using
virtual addresses
— Processor accesses cache directly, not MMU
— Cache access faster, before MMU address translation
• Physical cache stores data using main memory
physical addresses
Size does matter
• Cost
—More cache is expensive
• Speed
—More cache is faster (up to a point)
—Checking cache for data takes time
Typical Cache Organization
Comparison of Cache Sizes
Processor Type Year of Introduction L1 cache L2 cache L3 cache
IBM 360/85 Mainframe 1968 16 to 32 KB — —
PDP-11/70 Minicomputer 1975 1 KB — —
VAX 11/780 Minicomputer 1978 16 KB — —
IBM 3033 Mainframe 1978 64 KB — —
IBM 3090 Mainframe 1985 128 to 256 KB — —
Intel 80486 PC 1989 8 KB — —
Pentium PC 1993 8 KB/8 KB 256 to 512 KB —
PowerPC 601 PC 1993 32 KB — —
PowerPC 620 PC 1996 32 KB/32 KB — —
PowerPC G4 PC/server 1999 32 KB/32 KB 256 KB to 1 MB 2 MB
IBM S/390 G4 Mainframe 1997 32 KB 256 KB 2 MB
IBM S/390 G6 Mainframe 1999 256 KB 8 MB —
Pentium 4 PC/server 2000 8 KB/8 KB 256 KB —
High-end server/
IBM SP supercomputer 2000 64 KB/32 KB 8 MB —
CRAY MTAb Supercomputer 2000 8 KB 2 MB —
Itanium PC/server 2001 16 KB/16 KB 96 KB 4 MB
SGI Origin 2001 High-end server 2001 32 KB/32 KB 4 MB —
Itanium 2 PC/server 2002 32 KB 256 KB 6 MB
IBM POWER5 High-end server 2003 64 KB 1.9 MB 36 MB
CRAY XD-1 Supercomputer 2004 64 KB/64 KB 1MB —
Mapping Function
• Cache of 64kByte
• Cache block of 4 bytes
—i.e. cache is 16k (214) lines of 4 bytes
• 16MBytes main memory
• 24 bit address
—(224=16M)
Direct Mapping
• Each block of main memory maps to only
one cache line
—i.e. if a block is in cache, it must be in one
specific place
• Address is in two parts
• Least Significant w bits identify unique
word
• Most Significant s bits specify one memory
block
• The MSBs are split into a cache line field r
and a tag of s-r (most significant)
Direct Mapping
Address Structure

Tag s-r Line or Slot r Word w

8 14 2

• 24 bit address
• 2 bit word identifier (4 byte block)
• 22 bit block identifier
— 8 bit tag (=22-14)
— 14 bit slot or line
• No two blocks in the same line have the same Tag field
• Check contents of cache by finding line and checking Tag
Direct Mapping from Cache to Main Memory
Direct Mapping
Cache Line Table

Cache line Main Memory blocks held

0 0, m, 2m, 3m…2s-m

1 1,m+1, 2m+1…2s-m+1

…
m-1 m-1, 2m-1,3m-1…2s-1
Direct Mapping Cache Organization
Direct
Mapping
Example
Direct Mapping Summary
• Address length = (s + w) bits
• Number of addressable units = 2s+w words
or bytes
• Block size = line size = 2w words or bytes
• Number of blocks in main memory
= 2s+w/2w = 2s
• Number of lines in cache = m = 2r
• Size of tag = (s – r) bits
Direct Mapping pros & cons
• Simple
• Inexpensive
• Fixed location for given block
—If a program accesses 2 blocks that map to the
same line repeatedly, cache misses are very
high
Victim Cache
• Lower miss penalty
• Remember what was discarded
—Already fetched
—Use again with little penalty
• Fully associative
• 4 to 16 cache lines
• Between direct mapped L1 cache and next
memory level
Associative Mapping
• A main memory block can load into any
line of cache
• Memory address is interpreted as tag and
word
• Tag uniquely identifies block of memory
• Every line’s tag is examined for a match
• Cache searching gets expensive
Associative Mapping from
Cache to Main Memory
Fully Associative Cache Organization
Associative Mapping
Address Structure

Word
Tag 22 bit 2 bit
• 22 bit tag stored with each 32 bit block of data
• Compare tag field with tag entry in cache to
check for hit
• Least significant 2 bits of address identify which
8 bit word is required from 32 bit data block
• e.g.
— Address Tag Data Cache
line
— 3FFFFF 3FFFFF 24682468 3FFF
Associative
Mapping
Example
Associative Mapping Summary
• Address length = (s + w) bits
• Number of addressable units = 2s+w words
or bytes
• Block size = line size = 2w words or bytes
• Number of blocks in main memory
= 2s+ w/2w = 2s
• Number of lines in cache = undetermined
• Size of tag = s bits
Set Associative Mapping
• Cache is divided into a number of sets
• Each set contains a number of lines
• A given block maps to any line in a given
set
—e.g. Block B can be in any line of set i
• e.g. 2 lines per set
—2 way associative mapping
—A given block can be in one of 2 lines in only
one set
Set Associative Mapping
Address Structure

Word
Tag 9 bit Set 13 bit 2 bit

• Use set field to determine cache set to

look in
• Compare tag field to see if we have a hit
• e.g
—Address Tag Data Set
number
—1FF 7FFC 1FF 12345678 1FFF
—001 7FFC 001 11223344 1FFF
Two Way Set Associative Mapping Example
Replacement Algorithms (1)
Direct mapping
• No choice
• Each block only maps to one line
• Replace that line
Replacement Algorithms (2)
Associative & Set Associative
• Hardware implemented algorithm (speed)
• Least Recently used (LRU)
• e.g. in 2 way set associative
—Which of the 2 block is lru?
• First in first out (FIFO)
—replace block that has been in cache longest
• Least frequently used
—replace block which has had fewest hits
• Random
Write Policy
• Must not overwrite a cache block unless
main memory is up to date
• Multiple CPUs may have individual caches
• I/O may address main memory directly
Write through
• All writes go to main memory as well as
cache
• Multiple CPUs can monitor main memory
traffic to keep local (to CPU) cache up to
date
• Lots of traffic
• Slows down writes
Write back
• Updates initially made in cache only
• Update bit for cache slot is set when
update occurs
• If block is to be replaced, write to main
memory only if update bit is set
• Other caches get out of sync
• I/O must access main memory through
cache
Line Size
• Retrieve not only desired word but a number of
adjacent words as well
• Increased block size will increase hit ratio at first
— the principle of locality
• Hit ratio will decreases as block becomes even
bigger
— Probability of using newly fetched information becomes
less than probability of reusing replaced
• Larger blocks
— Reduce number of blocks that fit in cache
— Data overwritten shortly after being fetched
— Each additional word is less local so less likely to be
needed
• No definitive optimum value has been found
• 8 to 64 bytes seems reasonable
• For HPC systems, 64- and 128-byte most
common
Multilevel Caches
• High logic density enables caches on chip
—Faster than bus access
—Frees bus for other transfers
• Common to use both on and off chip
cache
—L1 on chip, L2 off chip in static RAM
—L2 access much faster than DRAM or ROM
—L2 often uses separate data path
—L2 may now be on chip
—Resulting in L3 cache
– Bus access or now on chip…
Unified v Split Caches
• One cache for data and instructions or
two, one for data and one for instructions
• Advantages of unified cache
—Higher hit rate
– Balances load of instruction and data fetch
– Only one cache to design & implement
• Advantages of split cache
—Eliminates cache contention between
instruction fetch/decode unit and execution
unit
– Important in pipelining

SNES Architecture: Architecture of Consoles: A Practical Analysis, #4
From Everand
SNES Architecture: Architecture of Consoles: A Practical Analysis, #4
Rodrigo Copetti
No ratings yet
Surveying Problems and Solutions PDF Wordpresscom - 59c51eef1723dd2b1c9e659b PDF
50% (2)
Surveying Problems and Solutions PDF Wordpresscom - 59c51eef1723dd2b1c9e659b PDF
2 pages
CAO - Lecutre7 Cache Memory
100% (1)
CAO - Lecutre7 Cache Memory
39 pages
04_Cache Memory
No ratings yet
04_Cache Memory
61 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
4.1 Computer Memory System Overview
No ratings yet
4.1 Computer Memory System Overview
12 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
47 pages
Chapter 4 Cache - Memory Willim Sailling
No ratings yet
Chapter 4 Cache - Memory Willim Sailling
71 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
55 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
04 - Cache Memory (Compatibility Mode)
No ratings yet
04 - Cache Memory (Compatibility Mode)
12 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
Cache Memory
No ratings yet
Cache Memory
39 pages
Unit 1 Part 2 (Chapter 4) Cache Memory
No ratings yet
Unit 1 Part 2 (Chapter 4) Cache Memory
53 pages
William Stallings Computer Organization and Architecture 7th Edition
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition
57 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
54 pages
BiD 05
No ratings yet
BiD 05
97 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
67 pages
Chap 6
No ratings yet
Chap 6
48 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
72 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
51 pages
Cache Memory
No ratings yet
Cache Memory
61 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
04 - Cache Memory PDF
No ratings yet
04 - Cache Memory PDF
71 pages
04 Cache Memory
No ratings yet
04 Cache Memory
75 pages
55-Types of Caches, Caches Misses,-04!03!2025
No ratings yet
55-Types of Caches, Caches Misses,-04!03!2025
64 pages
Computer Arch 06
No ratings yet
Computer Arch 06
41 pages
Cache Memory
No ratings yet
Cache Memory
57 pages
Cache Memory
67% (3)
Cache Memory
72 pages
Cache + Associations Ch-4
No ratings yet
Cache + Associations Ch-4
52 pages
Lecture 7 Cache Memory
No ratings yet
Lecture 7 Cache Memory
44 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
66 pages
CH05
No ratings yet
CH05
56 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
45 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
64 pages
William Stallings Computer Organization and Architecture: Internal Memory
No ratings yet
William Stallings Computer Organization and Architecture: Internal Memory
60 pages
Lecture 04 IS064
No ratings yet
Lecture 04 IS064
41 pages
Cache Memory CAD
No ratings yet
Cache Memory CAD
16 pages
03-Chap4-Cache Memory Mapping
No ratings yet
03-Chap4-Cache Memory Mapping
24 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
79 pages
5-Cache Memories-14-02-2025
No ratings yet
5-Cache Memories-14-02-2025
42 pages
Cache Memory-Direct Mapping
0% (1)
Cache Memory-Direct Mapping
30 pages
Ch01-part3-Caches
No ratings yet
Ch01-part3-Caches
32 pages
04 Cache Memory
No ratings yet
04 Cache Memory
36 pages
9 - Cache
No ratings yet
9 - Cache
58 pages
Computer Organization & Architecture: Cache Memory
No ratings yet
Computer Organization & Architecture: Cache Memory
71 pages
CH04
No ratings yet
CH04
46 pages
CH 4.ppt Type I
No ratings yet
CH 4.ppt Type I
60 pages
Lec8 - Caches
No ratings yet
Lec8 - Caches
55 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
46 pages
CO Lec.4
No ratings yet
CO Lec.4
36 pages
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
From Everand
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
Rodrigo Copetti
No ratings yet
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
From Everand
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
Rodrigo Copetti
No ratings yet
Master System Architecture: Architecture of Consoles: A Practical Analysis, #15
From Everand
Master System Architecture: Architecture of Consoles: A Practical Analysis, #15
Rodrigo Copetti
2/5 (1)
CATIA - Syllabus S21 - 3
No ratings yet
CATIA - Syllabus S21 - 3
4 pages
Journal of Experimental Zoology Part A Comparative Experimental Biology - 2004 - Steinberg - Townes and Holtfreter 1955
No ratings yet
Journal of Experimental Zoology Part A Comparative Experimental Biology - 2004 - Steinberg - Townes and Holtfreter 1955
6 pages
Asking and Giving Information
100% (1)
Asking and Giving Information
9 pages
Shortell, T (2019) Social Types (Simmel)
100% (1)
Shortell, T (2019) Social Types (Simmel)
2 pages
Array
No ratings yet
Array
75 pages
Ch-7 Class - 6
No ratings yet
Ch-7 Class - 6
2 pages
CWS Notes
No ratings yet
CWS Notes
55 pages
Class Test 2 - Some Basic Concepts of Macroeconomics
No ratings yet
Class Test 2 - Some Basic Concepts of Macroeconomics
4 pages
Review Test: I: Choose The Word Which Has Different Sound in The Underlined Part in Each Line
No ratings yet
Review Test: I: Choose The Word Which Has Different Sound in The Underlined Part in Each Line
4 pages
Lecture 5 (Change Management)
No ratings yet
Lecture 5 (Change Management)
55 pages
Parfüm Liste
No ratings yet
Parfüm Liste
12 pages
Regulador EQA-722
No ratings yet
Regulador EQA-722
4 pages
Dating Format PDF Romance (Love) Passion (Emotion)
No ratings yet
Dating Format PDF Romance (Love) Passion (Emotion)
1 page
Quectel BG96 TCPIP at Commands Manual V1.1
No ratings yet
Quectel BG96 TCPIP at Commands Manual V1.1
44 pages
Derek and The Dominos
No ratings yet
Derek and The Dominos
8 pages
Which Are The Sub-Modules in SAP HR?
No ratings yet
Which Are The Sub-Modules in SAP HR?
28 pages
5 LIST OF SERVICES v2 - NTC
No ratings yet
5 LIST OF SERVICES v2 - NTC
7 pages
Free Colors Matching Activities For Toddlers Printable PDF
No ratings yet
Free Colors Matching Activities For Toddlers Printable PDF
11 pages
Lec 1 Th. Parasite
No ratings yet
Lec 1 Th. Parasite
6 pages
IFU0163rK - en Makoto Intravascular Imaging System User Guide TVC MC10
100% (1)
IFU0163rK - en Makoto Intravascular Imaging System User Guide TVC MC10
184 pages
Student Registration Form
No ratings yet
Student Registration Form
5 pages
Why Animals Should Not Be Kept in The Zoo Essay
No ratings yet
Why Animals Should Not Be Kept in The Zoo Essay
1 page
Physical-Layer Cell Identity (PCI) Planning Guidelines
No ratings yet
Physical-Layer Cell Identity (PCI) Planning Guidelines
18 pages
Directory 2024 (1)
100% (1)
Directory 2024 (1)
1,237 pages
Comparitive Anatomy Bipedalism
No ratings yet
Comparitive Anatomy Bipedalism
16 pages
h-cdm8056 7568
No ratings yet
h-cdm8056 7568
39 pages
Fire and Smoke Fire - and - Smoke - Tight - Sliding - Doorstight Sliding Doors Mutli Purpose Doors 86038 en
No ratings yet
Fire and Smoke Fire - and - Smoke - Tight - Sliding - Doorstight Sliding Doors Mutli Purpose Doors 86038 en
32 pages
Boyacioglu-BakingTECH 2022-Measuring Dough Characteristics During Fermentation-Proofing - Final
No ratings yet
Boyacioglu-BakingTECH 2022-Measuring Dough Characteristics During Fermentation-Proofing - Final
54 pages
Rheoluxe® 812 Product Data Sheet
No ratings yet
Rheoluxe® 812 Product Data Sheet
1 page

11 Cache Memory

Uploaded by

11 Cache Memory

Uploaded by

ECEG-4123 Computer Architecture

Tag s-r Line or Slot r Word w

Cache line Main Memory blocks held

• Use set field to determine cache set to

You might also like