CS575 Parallel Processing - PowerPoint PPT Presentation

1 / 21

About This Presentation

Title:

CS575 Parallel Processing

Description:

Only one switch on per (row,column) pair. Non blocking: Pi to Mj does not block Pl to Mk ... Put 0 in front of first batch, 1 in front of second ... – PowerPoint PPT presentation

Number of Views:89

Avg rating:3.0/5.0

Slides: 22

Provided by: csColo

Category:

more less

Transcript and Presenter's Notes

Title: CS575 Parallel Processing

1
CS575 Parallel Processing

Lecture three Interconnection Networks
Wim Bohm, CSU

Except as otherwise noted, the content of this
presentation is licensed under the Creative
Commons Attribution 2.5 license.
2
Interconnection networks

Connect
Processors, memories, I/O devices
Dynamic interconnection networks
Connect any to any using switches or busses
Two types of switches
On / off 1 input, 1 output
Pass through / cross over 2 inputs, 2 outputs
Static interconnection networks
Connect point to point using wires

3
Dynamic Interconnection NetworkCrossbar

Connects e.g. p processors to b memories
p b matrix
p horizontal lines, b vertical lines
Cross points on/off switches
Only one switch on per (row,column) pair
Non blocking Pi to Mj does not block Pl to Mk
Very costly, does not scale well
p b switches, complex timing and checking

4
Dynamic Interconnection NetworkBus

Connects processors, memories, I/O devices
Master can issue a request to get the bus
Slave can respond to a request, one bus is
granted
If there are multiple masters, we need an arbiter
Sequential
Only one communication at the time
Bottleneck
But simple and cheap

5
Crossbar vs bus

Crossbar
Scalable in performance
Not scalable in hardware complexity
Bus
Not scalable in performance
Scalable in hardware complexity
Compromise multistage network

6
Multi-stage network

Connects n components to each other
Usually built from O(n.log(n)) 2x2 switches
Cheaper than cross bar
Faster than bus
Many topologies
e.g. Omega (book fig 2.12), Butterfly, ...

7
Static Interconnection Networks

Fixed wires (channels) between devices
Many topologies
Completely connected
(n(n-1))/2 channels
Static counterpart of crossbar
Star
One central PE for message passing
Static counterpart of bus
Multistage network with PE at each switch

8
More topologies

Necklace or ring
Mesh / Torus
2D, 3D
Trees
Fat tree
Hypercube
2n nodes in nD hypercube
n links per node in nD hypercube
Addressing 1 bit per dimension

9
Hypercube

Two connected nodes differ in one bit
nD hypercube can be divided in
2 (n-1) D cubes in n ways
4 (n-2) D cubes
8 (n-3) D cubes
To get from node s to node t
Follow the path determined by the differing bits
E.g. 01100 ? 11000 01100 ? 11100 ? 11000
Question how many (simple) paths from one node
to another?

10
Measures of static networks

Diameter
Maximal shortest path between two nodes
Ring ?p/2?, hypercube log(p)
2D wraparound mesh 2 ?sqrt(p)/2?
Connectivity
Measure of multiplicity of paths between nodes
Arc connectivity
Minimum arcs to be removed to create two
disconnected networks
Ring 2, hypercube log(p), mesh 2,
wraparound mesh 4

11
More measures

Bisection width
Minimal arcs to be removed to partition the
network in two
(off by one node) equal halves
Ring 2, Complete binary tree 1, 2D mesh
sqrt(p)
Question bisection width of a hypercube?
Channel width
bits communicated simultaneously over channel
Channel rate / bandwidth
Peak communication rate (bits/second)
Bisection bandwidth
Bisection width channel bandwidth

12
Summary of measures p nodes
The textbook mentions bisection width of a star
as 1, but the only way to split a star into
(almost) equal halves is by cutting half of its
links.
13
Meshes and Hyper cubes

Mesh
Buildable, scalable, cheaper than hyper cubes
Many (eg grid) applications map naturally
Cut through works well in meshes
Commercial systems based on it.
Hyper cube
Recursive structure nice for algorithm design
Often same O complexity as PRAMs
Often hypercube algorithm also good for other
topologies, so good starting point

14
Embedding