HPC 01

About This Presentation

Title:

Description:

Number of Views:20

Avg rating:3.0/5.0

Slides: 14

Provided by: abani5

Learn more at: http://www.math.buffalo.edu

Category:

Tags: hpc | algo

Transcript and Presenter's Notes

Title: HPC 01

1
HPC 01

2
Message Passing Time

3
Communication Model

Speed l/tcomm Actual ltlt Theoretical hardware
limit advertised
Consequences
Send messages in blocks -- avoid small single
messages
Arrange data distributions to get nearest
neighbor communications e.g. use ring shift with
direct neighbors

4
Communication Model

5
Communication Model

Latency Hiding use asynchronous messaging to
overlap communication and computation
(MPI_ISEND,MPI_IRECV)
Domain decomposition in solving grid problems
Compute with first and communicate those
while computing

6
Amdahls Law

Consider the execution of a program on p
processors -- let the part q (0ltqlt1) of each
operation be parallelized. Maximum speedup
spfalse t1/tp 1/ (q/p) (1-q)
Indicates the rapid loss of speedup if parallel
fraction is not high enough as p increases
To get 50 efficiency i.e. 256 on 512 q 0.998

7
Amdahls Law
8
Amdahls Law

Why False in speedup ?
Assumed that no. of ops are same for sequential
and parallel -- usually algorithms and data
structures are different
Did not account for parallelization cost --
communication and synchronization costs!
assumed that performance does not change for
sequential/parallel code (diff. vector length ...)

9
Speeduphonest

10
Scalability

There is an optimal number of processors for each
problem
Fixed problem size with increasing numbers of
processors is a poor use of parallel machine

11
Scalability

Increasing problem size with increasing numbers
of processors leads to better use of parallel
machine

12
Scalability

13
Scalability

Thus scalability is the desired measure of a
parallel algoritthm/code and not speedup!
Scalability is achieved if the quantity
hpp/m is constant or increases very slowly as
p increases

Write a Comment

User Comments (0)