Tuning Libraries to Effectively Exploit Memory

About This Presentation

Title:

Description:

Number of Views:25

Avg rating:3.0/5.0

Slides: 12

Provided by: sec62

Learn more at: http://www.cs.tufts.edu

Category:

more less

Transcript and Presenter's Notes

Title: Tuning Libraries to Effectively Exploit Memory

1
Tuning Libraries to Effectively Exploit Memory

2
A Project in Numerical Linear Algebra

An understanding of mathematics (linear algebra)
An understanding the movement of data in the
computer memory to constructefficient algorithms
for solving large-scale linear systems of
equations where the matrices are sparse (have
lots ofzero entries)

3
Storage of Arrays

4
Optimization and BLAS

The idea is to isolate frequently occurring code
into subprograms where it can be optimized
BLAS basic linear algebra subprograms
Original dot code
i 1
for k 1 to n
x(k) b(k)
for j 1 to k-1
x(k) x(k) l(i)x(k)
i i 1
end for j
x(k) x(k)/l(i)
i ip-k
end for k

5
Optimization and BLAS

6
Algorithm

Assuming n, m, and k are divisible by 2, matrices
can be partitioned into blocks such that a matrix
consists of blocks A11, A12, A21, A22. Then, when
multiplying matrices A and B to get matrix C, the
upper left hand block of C A11B11A12B21.

A11
B12
B11
A12
A11B11 A12B21
A12B12 A11B22
B22
B21
A22
A21
A21B12 A22B22
A21B11 A22B21
Matrix A
Matrix B
Matrix C
7
Localities

Locality in Time - The concept that a resource
that is referenced at one point in time will be
referenced again sometime in the near future.
Locality in Space - The concept that likelihood
of referencing a resource is higher if a resource
near it was just referenced.
Cache Coherency - The concept that memory is
accessed sequentially from the cache.

8
Results
9
Problems

Memory access faults with sufficiently large
matrices, potentially due to algorithm.
Relatively small variance in timing, presumably
due to other processes on server

10
Future Work