0% found this document useful (0 votes)

205 views

10 Distributed Shared Memory

The document discusses distributed shared memory (DSM) which implements a shared memory model in distributed systems without physical shared memory. It describes how DSM works, including how data moves between nodes' main memories. It outlines algorithms for implementing DSM including a central server algorithm, migration algorithm, read-replication algorithm, and full-replication algorithm. It also discusses memory coherence, coherence protocols like write-invalidate and write-update, and design issues for DSM like granularity and page replacement. Finally, it provides a case study on the IVY DSM system.

Uploaded by

Aanchal Agarwal

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

205 views

10 Distributed Shared Memory

Uploaded by

Aanchal Agarwal

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

Distributed Resource Management: Distributed Shared Memory

Distributed shared memory (DSM)

What
- The distributed shared memory (DSM) implements the shared memory model in distributed systems, which have no physical shared memory - The shared memory model provides a virtual address space shared between all nodes - The overcome the high cost of communication in distributed systems, DSM systems move data to the location of access

How:
- Data moves between main memory and secondary memory (within a node) and between main memories of different nodes - Each data object is owned by a node
- Initial owner is the node that created object - Ownership can change as object moves from node to node

- When a process accesses data in the shared address space, the mapping manager maps shared memory address to physical memory (local or remote)
2

Distributed shared memory (Cont.)

NODE 1 NODE 2 NODE 3

Memory

Mapping Manager

Shared Memory
3

Advantages of distributed shared memory (DSM)

Data sharing is implicit, hiding data movement (as opposed to Send/Receive in message passing model) Passing data structures containing pointers is easier (in message passing model data moves between different address spaces) Moving entire object to user takes advantage of locality difference Less expensive to build than tightly coupled multiprocessor system: off-the-shelf hardware, no expensive interface to shared physical memory Very large total physical memory for all nodes: Large programs can run more efficiently No serial access to common bus for shared physical memory like in multiprocessor systems Programs written for shared memory multiprocessors can be run on DSM systems with minimum changes
4

Algorithms for implementing DSM

Issues
- How to keep track of the location of remote data - How to minimize communication overhead when accessing remote data - How to access concurrently remote data at several nodes

1. The Central Server Algorithm

- Central server maintains all shared data
Read request: returns data item Write request: updates data and returns acknowledgement message

- Implementation
A timeout is used to resend a request if acknowledgment fails Associated sequence numbers can be used to detect duplicate write requests If an applications request to access shared data fails repeatedly, a failure condition is sent to the application

- Issues: performance and reliability - Possible solutions

Partition shared data between several servers Use a mapping function to distribute/locate data
5

Algorithms for implementing DSM (cont.)

2. The Migration Algorithm
- Operation
Ship (migrate) entire data object (page, block) containing data item to requesting location Allow only one node to access a shared data at a time

- Advantages
Takes advantage of the locality of reference DSM can be integrated with VM at each node - Make DSM page multiple of VM page size - A locally held shared memory can be mapped into the VM page address space - If page not local, fault-handler migrates page and removes it from address space at remote node

- To locate a remote data object:

Use a location server Maintain hints at each node Broadcast query

- Issues
Only one node can access a data object at a time Thrashing can occur: to minimize it, set minimum time data object resides at a node
6

Algorithms for implementing DSM (cont.)

3. The Read-Replication Algorithm
Replicates data objects to multiple nodes DSM keeps track of location of data objects Multiple nodes can have read access or one node write access (multiple readers-one writer protocol) After a write, all copies are invalidated or updated DSM has to keep track of locations of all copies of data objects. Examples of implementations: IVY: owner node of data object knows all nodes that have copies PLUS: distributed linked-list tracks all nodes that have copies Advantage
The read-replication can lead to substantial performance improvements if the ratio of reads to writes is large
7

Algorithms for implementing DSM (cont.)

4. The FullReplication Algorithm
- Extension of read-replication algorithm: multiple nodes can read and multiple nodes can write (multiple-readers, multiple-writers protocol) - Issue: consistency of data for multiple writers - Solution: use of gap-free sequencer All writes sent to sequencer Sequencer assigns sequence number and sends write request to all sites that have copies Each node performs writes according to sequence numbers A gap in sequence numbers indicates a missing write request: node asks for retransmission of missing write requests

Memory coherence
DSM are based on
- Replicated shared data objects - Concurrent access of data objects at many nodes

Coherent memory: when value returned by read operation is the expected value (e.g., value of most recent write) Mechanism that control/synchronizes accesses is needed to maintain memory coherence Sequential consistency: A system is sequentially consistent if
- The result of any execution of operations of all processors is the same as if they were executed in sequential order, and - The operations of each processor appear in this sequence in the order specified by its program

General consistency:
- All copies of a memory location (replicas) eventually contain same data when all writes issued by every processor have completed
9

Memory coherence (Cont.)

Processor consistency:
- Operations issued by a processor are performed in the order they are issued - Operations issued by several processors may not be performed in the same order (e.g. simultaneous reads of same location by different processors may yields different results)

Weak consistency:
- Memory is consistent only (immediately) after a synchronization operation - A regular data access can be performed only after all previous synchronization accesses have completed

Release consistency:
- Further relaxation of weak consistency - Synchronization operations must be consistent which each other only within a processor - Synchronization operations: Acquire (i.e. lock), Release (i.e. unlock)
- Sequence: Acquire Regular access Release
10

Coherence Protocols
Issues
- How do we ensure that all replicas have the same information - How do we ensure that nodes do not access stale data

1. Write-invalidate protocol
- A write to shared data invalidates all copies except one before write executes - Invalidated copies are no longer accessible - Advantage: good performance for
Many updates between reads Per node locality of reference

- Disadvantage
Invalidations sent to all nodes that have copies Inefficient if many nodes access same object

- Examples: most DSM systems: IVY, Clouds, Dash, Memnet, Mermaid, and Mirage

2. Write-update protocol
- A write to shared data causes all copies to be updated (new value sent, instead of validation) - More difficult to implement
11

Design issues
Granularity: size of shared memory unit
- If DSM page size is a multiple of the local virtual memory (VM) management page size (supported by hardware), then DSM can be integrated with VM, i.e. use the VM page handling - Advantages vs. disadvantages of using a large page size:
- (+) Exploit locality of reference - (+) Less overhead in page transport - (-) More contention for page by many processes

- Advantages vs. disadvantages of using a small page size

- (+) Less contention - (+) Less false sharing (page contains two items, not shared but needed by two processes) - (-) More page traffic

- Examples
PLUS: page size 4 Kbytes, unit of memory access is 32-bit word Clouds, Munin: object is unit of shared data structure
12

Design issues (cont.)

Page replacement
- Replacement algorithm (e.g. LRU) must take into account page access modes: shared, private, read-only, writable - Example: LRU with access modes Private (local) pages to be replaced before shared ones Private pages swapped to disk Shared pages sent over network to owner Read-only pages may be discarded (owners have a copy)

Case studies: IVY

IVY (Integrated shared Virtual memory at Yale) implemented in Apollo DOMAIN environment, i.e. Apollo workstations on a token ring Granularity: 1 Kbyte page Process address space: private space + shared VM space
- Private space: local to process - Shared space: can be accesses by any process through the shared part of its address space

Node mapping manager: does mapping between local memory of that node and the shared virtual memory space Memory access operation
- On page fault, block process - If page local, fetch from secondary memory - If not local, request a remote memory access, acquire page

Page now available to all processes at the node

Case studies: IVY (Cont.)

Coherence protocol
- Page access modes: read only, write, nil (invalidate) - Multiple readers-single writer semantics - Protocol - Write invalidation: before a write to a page is allowed, all other read-only copies are invalidated - Strict consistency: a reader always sees the latest value written

Write sequence
- Processor i has write fault to page p - Processor i finds owner of page p and sends request - Owner of p sends page and its copyset to i and marks p entry in its page table nil (copyset = list of processors containing read-only copy of page) - Processor i sends invalidation messages to all processors in copyset

Read sequence
- Processor i has read fault to page p - Processor i finds owner of page p - Owner of p sends copy of page to i and adds i to copyset of p. Processor i has read-only access to p
15

Case studies: IVY (Cont.)

Algorithms used for implementing actions for Read and Write actions Centralized manager scheme
- Central manager resides on single processor: maintains all data ownership information - On page fault, processor i requests copy of page from central manager - Central manager sends request to page owner. If Write requested, updates owner information to indicate i is the new owner - Owner sends copy of page to processor i and
If Write, also sends copyset of page If Read, adds i to the copyset of page

On write, central manager sends invalidation messages to all processors in copyset - Performance issues
Two messages are required to locate page owner On Writes, invalidation messages are sent to all processors in copyset Centralized manager can become bottleneck
16

Case studies: IVY (Cont.)

Algorithms used for implementing actions for Read and Write actions (cont.) The fixed distributed manager scheme
Distributes the central managers role to every processor in the system Every processor keeps track of the owners of a predetermined set of pages (determined by a mapping function H) When a processor i faults on page p, processor i contacts processor H(p) for a copy of the page The rest the protocol is the same as the one with the centralized manager

Note: In both the centralized and fixed distributed manager schemes, if two or more concurrent accesses to the same page are requested, the requests are serialized by the manager
17

Case studies: IVY (Cont.)

Algorithms used for implementing actions for Read and Write actions (cont.) The dynamic distributed manager scheme
Every host keeps track of the ownership of the pages that are in its local page table
Every page table has a field called probowner (probable owner) Initially, probowner is set to a default processor The field is modified as pages are requested from various processors

When a processor has a page fault, it sends a page request to processor i indicated by the probowner field If processor i is the true owner of the page, fault handling proceeds like in centralized scheme If I is not the owner, it forwards the request to the processor indicated in its probowner field This continues until the true owner of the page is found
18

Case studies: Mirage

Developed at UCLA, kernel modified to support DSM operation Extends the coherence protocol of IVY system to control thrashing (in IVY, a page can move back and forth between multiple processors sharing the page) When a shared memory page is transferred to a processor, that processor will keep the page for delta seconds
If a request for the page is made before delta seconds expired, processor informs control manager of the amount of time left Delta can be a combination of real-time and service-time for that processor

Advantages
Benefits locality of reference Decreases thrashing
19

Case studies: Clouds

Developed at Georgia Institute of Technology The virtual address space of all objects is viewed as a global distributed shared memory
The objects are composed of segments which are mapped into virtual memory by the kernel using the memory management hardware A segment is a multiple of the physical page size

For remote object invocations, the DSM mechanism transfers the required segments to the requesting host
On a segment fault, a location system object is consulted to locate the object The location system object broadcasts a query for each locate operation The actual data transfer is done by the distributed shared memory controller (DSMC)

Distributed Resource Management: Distributed Shared Memory
No ratings yet
Distributed Resource Management: Distributed Shared Memory
20 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
20 pages
Distributed Shared Memory (DSM)
No ratings yet
Distributed Shared Memory (DSM)
4 pages
WINSEM2022-23 CSE4001 ETH VL2022230503162 ReferenceMaterialI TueFeb1400 00 00IST2023 Module4DistributedSystemsLecture2
No ratings yet
WINSEM2022-23 CSE4001 ETH VL2022230503162 ReferenceMaterialI TueFeb1400 00 00IST2023 Module4DistributedSystemsLecture2
27 pages
Distributed Shared Memory For Advanced Os
No ratings yet
Distributed Shared Memory For Advanced Os
21 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
109 pages
DW - Bigdata9
No ratings yet
DW - Bigdata9
113 pages
Haoop Architecture
No ratings yet
Haoop Architecture
34 pages
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
No ratings yet
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
34 pages
DDB Slides
No ratings yet
DDB Slides
67 pages
1.symmetric and Distributed Shared Memory Architectures
79% (19)
1.symmetric and Distributed Shared Memory Architectures
29 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
22 pages
CA Lecture 13
No ratings yet
CA Lecture 13
27 pages
Chapter 4 TLP
No ratings yet
Chapter 4 TLP
46 pages
Adbms
No ratings yet
Adbms
70 pages
Parallel and distributed computing lec 6
No ratings yet
Parallel and distributed computing lec 6
26 pages
Parallel Database
No ratings yet
Parallel Database
22 pages
Hadoop
No ratings yet
Hadoop
25 pages
Threads in Operating System
No ratings yet
Threads in Operating System
103 pages
Distributed Shared Memory
100% (1)
Distributed Shared Memory
20 pages
Memory in Multiprocessor System
No ratings yet
Memory in Multiprocessor System
52 pages
DoS - Unit 1
No ratings yet
DoS - Unit 1
57 pages
Processes and Their Scheduling Multiprocessor Scheduling Threads Distributed Scheduling/migration
No ratings yet
Processes and Their Scheduling Multiprocessor Scheduling Threads Distributed Scheduling/migration
18 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
Snooping vs. Directory Based Coherency: Professor David A. Patterson Computer Science 252 Fall 1996
No ratings yet
Snooping vs. Directory Based Coherency: Professor David A. Patterson Computer Science 252 Fall 1996
59 pages
Database
No ratings yet
Database
6 pages
Black Gpfs 2up
No ratings yet
Black Gpfs 2up
11 pages
Operating System - Memory Management Strategies
No ratings yet
Operating System - Memory Management Strategies
59 pages
Sun Solaris OS: Glenn Barney Gb2174@columbia - Edu
No ratings yet
Sun Solaris OS: Glenn Barney Gb2174@columbia - Edu
35 pages
Ex05
No ratings yet
Ex05
2 pages
Distributed System
100% (1)
Distributed System
26 pages
Unit 6 Advanced Databases
No ratings yet
Unit 6 Advanced Databases
108 pages
Threads
No ratings yet
Threads
22 pages
UNIT IV - Compatibility Mode
No ratings yet
UNIT IV - Compatibility Mode
77 pages
William Stallings Computer Organization and Architecture 8 Edition Operating System Support
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Operating System Support
66 pages
08 - Operating System Support
No ratings yet
08 - Operating System Support
66 pages
Memory Management Topics
No ratings yet
Memory Management Topics
6 pages
comporg6_ch12
No ratings yet
comporg6_ch12
36 pages
Parallel 2
No ratings yet
Parallel 2
14 pages
07 Modul 7 - DSM
No ratings yet
07 Modul 7 - DSM
6 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
Gfs Google File System 13331
No ratings yet
Gfs Google File System 13331
28 pages
Cloud Computing Unit 2 Notes
No ratings yet
Cloud Computing Unit 2 Notes
14 pages
MODULE 4 hpc
No ratings yet
MODULE 4 hpc
41 pages
Parallel Programming: Process and Threads
No ratings yet
Parallel Programming: Process and Threads
18 pages
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
No ratings yet
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
46 pages
EE6304 Lecture12 TLP
No ratings yet
EE6304 Lecture12 TLP
70 pages
Ipc
No ratings yet
Ipc
41 pages
DBS Architecture
No ratings yet
DBS Architecture
12 pages
Operating System Support: William Stallings Computer Organization and Architecture 7 Edition
No ratings yet
Operating System Support: William Stallings Computer Organization and Architecture 7 Edition
51 pages
Hadoop Architecture
No ratings yet
Hadoop Architecture
48 pages
CS4513 Distributed Computer Systems
No ratings yet
CS4513 Distributed Computer Systems
32 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
08 Operating System Support
No ratings yet
08 Operating System Support
58 pages
Unit 5 Lecture 2
No ratings yet
Unit 5 Lecture 2
22 pages
Hack into your Friends Computer
From Everand
Hack into your Friends Computer
Magelan Cyber Security
No ratings yet
Kubernetes Made Easy
From Everand
Kubernetes Made Easy
Pankaj Joshi
No ratings yet
Computer Science I Essentials
From Everand
Computer Science I Essentials
Randall Raus
5/5 (7)
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
From Everand
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
Steve Will
4.5/5 (3)
Cin305 Lecture Notes For Week 6-7 Computer Hardware & Software
No ratings yet
Cin305 Lecture Notes For Week 6-7 Computer Hardware & Software
16 pages
NCR 7197 Owners Manual PDF
100% (1)
NCR 7197 Owners Manual PDF
206 pages
The 8086 Essentials
No ratings yet
The 8086 Essentials
132 pages
Computer Science: Characteristics of A Computer
No ratings yet
Computer Science: Characteristics of A Computer
3 pages
CCS - View Topic - #Int - RDA and Disable - Interrupts
100% (1)
CCS - View Topic - #Int - RDA and Disable - Interrupts
3 pages
Pca General DB 0
No ratings yet
Pca General DB 0
22 pages
Harvard Archi
No ratings yet
Harvard Archi
10 pages
Microcontroller - PPT 2. 2020
No ratings yet
Microcontroller - PPT 2. 2020
13 pages
Unit I Ict
No ratings yet
Unit I Ict
59 pages
Question Bank-Unit 1 2 3
No ratings yet
Question Bank-Unit 1 2 3
2 pages
Dev List
No ratings yet
Dev List
7 pages
Parallel Implementation of Cryptographic Algorithm Aes Using Opencl On Gpu
No ratings yet
Parallel Implementation of Cryptographic Algorithm Aes Using Opencl On Gpu
5 pages
Materi IT English
No ratings yet
Materi IT English
19 pages
Operating Systems & Networking
No ratings yet
Operating Systems & Networking
53 pages
Memory PDF
No ratings yet
Memory PDF
16 pages
Materi-CE542-M08-Managing Partitions
No ratings yet
Materi-CE542-M08-Managing Partitions
40 pages
Fusion4 Portal R222.1 IOM Rev07
No ratings yet
Fusion4 Portal R222.1 IOM Rev07
253 pages
Iare Co Ece QP 1
No ratings yet
Iare Co Ece QP 1
2 pages
HP Xw4300 Workstation Frequently Asked Questions - HP Small & Medium Business Products
No ratings yet
HP Xw4300 Workstation Frequently Asked Questions - HP Small & Medium Business Products
14 pages
OMR Sheet.cdr 1
No ratings yet
OMR Sheet.cdr 1
1 page
HP ProDesk 400 G5 PDF
No ratings yet
HP ProDesk 400 G5 PDF
4 pages
Can11 Gem5
No ratings yet
Can11 Gem5
8 pages
Types of Mouse and How They Work
No ratings yet
Types of Mouse and How They Work
1 page
Loader and Types of Loader
No ratings yet
Loader and Types of Loader
2 pages
Chapter7 Input Output Organization PDF
No ratings yet
Chapter7 Input Output Organization PDF
19 pages
STC12C5A English
No ratings yet
STC12C5A English
312 pages
Distributed Operating System
100% (1)
Distributed Operating System
24 pages
Form 2 End Term 1 Comp MS
No ratings yet
Form 2 End Term 1 Comp MS
6 pages
SPARC S7 Architecture and Common Compone
No ratings yet
SPARC S7 Architecture and Common Compone
11 pages
F19 Sysc3006 Finalexam 1
No ratings yet
F19 Sysc3006 Finalexam 1
13 pages

10 Distributed Shared Memory

Uploaded by

10 Distributed Shared Memory

Uploaded by

Distributed Resource Management: Distributed Shared Memory

Distributed shared memory (DSM)

Distributed shared memory (Cont.)

Advantages of distributed shared memory (DSM)

Algorithms for implementing DSM

1. The Central Server Algorithm

- Issues: performance and reliability - Possible solutions

Algorithms for implementing DSM (cont.)

- To locate a remote data object:

Algorithms for implementing DSM (cont.)

Algorithms for implementing DSM (cont.)

Memory coherence (Cont.)

- Advantages vs. disadvantages of using a small page size

Design issues (cont.)

Case studies: IVY

Page now available to all processes at the node

Case studies: IVY (Cont.)

Case studies: IVY (Cont.)

Case studies: IVY (Cont.)

Case studies: IVY (Cont.)

Case studies: Mirage

Case studies: Clouds

You might also like