05 - Lecture #5 - 6
05 - Lecture #5 - 6
Computing
LECTURE #5&6
1
Agenda
oPhysical Organization of Parallel Platforms.
oPRAM
o Interconnection Networks
2
Parallel Computing Platform
3
1- PRAM
o The RAM contains a single processor operates under control of a sequential
algorithm. One instruction at a time is issued.
o The processor can load/store data from/to memory and can perform basic
arithmetic and logical operations.
4
Parallel Random Access Machine (PRAM)
o Group of p processors
o Processors share a common clock but may execute different
instructions in each cycle.
oThe PRAM was designed so that the user could design and analyze
parallel algorithms without concern for communication (either
between processors and memory or within sets of processors).
5
❖During the read phase,
o All P processors have the opportunity to read simultaneously a piece of data from
a memory location.
o Each processor places the data item into one of its registers.
6
Memory Access (Resolving Data Access Conflicts)
❖ If two processors are trying to read from the same memory location,
should only one succeed? If so, which?
7
Exclusive Read (ER) Concurrent read (CR)
only one processor is allowed to Multiple processors are allowed to
read from a given memory read from the same memory
location during a cycle. location during a clock cycle.
Sum: Write the sum (or any associative operator) of all data items.
9
PRAM Model… Is it Practical??
11
❖ Interconnection networks can be classified as static or dynamic.
❖ Static networks:
✓ Consist of point-to-point communication links among processing nodes.
❖ Dynamic networks:
✓ Are built using switches and communication links.
13
14
Network Topologies
15
A- Static Interconnection Networks
Evaluating Static Interconnection Networks
❖Diameter: The maximum distance between any two processing nodes in
the network. (number of hops through which a message in transferred on
its way from one point to another )
❖Bisection Width: The minimum number of wires you must cut to divide
the network into two equal parts.
❖Cost: The number of links or switches besides the length of wires, etc., are
factors in to the cost.
16
1. PRAM
2.Interconnection networks
❖ Static Network
▪ Topology
▪ Evaluation of networks
❖ Dynamic Network
▪ Topology
▪ Evaluation of networks
❖Cache Coherence in Multiprocessor Systems
17
A- Static Interconnection Networks
1. Complete network (clique)
2. Star network
3. Linear array
4. Ring
5. Tree
6. 2D & 3D mesh/torus
7. Hypercube
8. Fat tree
18
A- Static Interconnection Networks
1- Completely Connected
❖ Each processor is connected to every other processor.
19
A- Static Interconnection Networks
2-Star
❖ Every node is connected only to a common node at the center.
20
A- Static Interconnection Networks
3- linear
❖ Each node has two neighbors, one to its left and one to its right.
4- Ring (1D)
❖ It is linear but the nodes at either end are connected.
21
A- Static Interconnection Networks
5- 2D & 3D mesh
❖ Has nodes with 4 neighbors, to the north, south, east, and west.
22
A- Static Interconnection Networks
6.Hypercubes
❖A special case of a d-dimensional mesh is a hypercube.
26
Static Interconnection Networks
1. Complete network (clique)
2. Star network none of them is very practical due to some
3. Linear array reasons “cost ..”
4. Ring
5. Tree
6. 2D & 3D mesh/torus
7. Hypercube
8. Fat tree
27
1. PRAM
2.Interconnection networks
❖ Static Network
▪ Topology
▪ Evaluation of networks
❖ Dynamic Network
▪ Topology
▪ Evaluation of networks
❖Cache Coherence in Multiprocessor Systems
28
A- Static Interconnection Networks
Evaluating Static Interconnection Networks
❖Diameter: The maximum distance between any two processing nodes in
the network. (number of hops through which a message in transferred on
its way from one point to another )
❖Bisection Width: The minimum number of wires you must cut to divide
the network into two equal parts.
❖Cost: The number of links or switches besides the length of wires, etc., are
factors in to the cost.
29
Calculate it A- Static Interconnection Networks
30
1. PRAM
2.Interconnection networks
❖ Static Network
▪ Topology
▪ Evaluation of networks
❖ Dynamic Network
▪ Topology
▪ Evaluation of networks
31
B- Dynamic Interconnection Networks
2. Crossbar network
3. Multistage networks
32
B- Dynamic Interconnection Network
1.Bus based networks
❖ Some of the simplest and earliest parallel machines used
buses.
❖ However, the bandwidth of the shared bus is a major (a) with no local caches;
bottleneck
(scalable in terms of cost and non scalable in terms of
performance).
2.Crossbar networks
❖ Simple way to connect P processor to b
memory banks
❖ Total number of switching nodes required
to implement such network is O(pb).
❖ As the number of processing nodes
becomes large, this switch complexity is
difficult to realize at high data rates.
34
B- Dynamic Interconnection Network
3- Multistage networks
❖ Crossbars have excellent performance scalability but poor cost scalability.
❖It is more scalable than the bus in terms of performance and more scalable
than the crossbar in terms of cost.
36
B- Dynamic Interconnection Networks
37
B- Dynamic Interconnection Network
39
1. PRAM
2.Interconnection networks
❖ Static Network
▪ Topology
▪ Evaluation of networks
❖ Dynamic Network
▪ Topology
▪ Evaluation of networks
40
B- Dynamic Interconnection Network
41
1. PRAM
2.Interconnection networks
❖ Static Network
▪ Topology
▪ Evaluation of networks
❖ Dynamic Network
▪ Topology
▪ Evaluation of networks
42