Parallel Sparse Operations in Matlab: Exploring Large Graphs - PowerPoint PPT Presentation

About This Presentation
Title:

Parallel Sparse Operations in Matlab: Exploring Large Graphs

Description:

Parallel Sparse Operations in Matlab: Exploring Large Graphs John R. Gilbert University of California at Santa Barbara Aydin Buluc (UCSB) Brad McRae (NCEAS) – PowerPoint PPT presentation

Number of Views:121
Avg rating:3.0/5.0
Slides: 36
Provided by: John61
Category:

less

Transcript and Presenter's Notes

Title: Parallel Sparse Operations in Matlab: Exploring Large Graphs


1
Parallel Sparse Operations in Matlab Exploring
Large Graphs

John R. Gilbert University of California at Santa
Barbara Aydin Buluc (UCSB) Brad McRae
(NCEAS) Steve Reinhardt (Interactive
Supercomputing) Viral Shah (ISC UCSB) with
thanks to Alan Edelman (MIT ISC) and Jeremy
Kepner (MIT-LL)
Support DOE, NSF, DARPA, SGI, ISC
2
3D Spectral Coordinates
3
2D Histogram RMAT Graph
4
Strongly Connected Components
5
Social Network Analysis in Matlab 1993
Co-author graph from 1993 Householdersymposium
6
Combinatorial Scientific Computing
  • Emerging large scale, high-performance
    applications
  • Web search and information retrieval
  • Knowledge discovery
  • Computational biology
  • Dynamical systems
  • Machine learning
  • Bioinformatics
  • Sparse matrix methods
  • Geometric modeling
  • . . .
  • How will combinatorial methods be used by
    nonexperts?

7
Outline
  • Infrastructure Array-based sparse graph
    computation
  • An application Computational ecology
  • Some nuts and bolts Sparse matrix multiplication

8
MatlabP
  • A rand(4000p, 4000p)
  • x randn(4000p, 1)
  • y zeros(size(x))
  • while norm(x-y) / norm(x) gt 1e-11
  • y x
  • x Ax
  • x x / norm(x)
  • end

9
Star-P Architecture
Star-P
MATLAB
processor 0
client manager
processor 1
package manager
processor 2
dense/sparse
sort
processor 3
ScaLAPACK
FFTW
Ordinary Matlab variables
. . .
FPGA interface
MPI user code
UPC user code
processor n-1
server manager
Distributed matrices
matrix manager
10
Distributed Sparse Array Structure
31
1
41
P0
53
59
26
3
2
P1
Each processor stores local vertices edges in
a compressed row structure. Has been scaled to
gt108 vertices, gt109 edges in interactive session.
P2
Pn
11
Sparse Array and Matrix Operations
  • dsparse layout, same semantics as ordinary full
    sparse
  • Matrix arithmetic , max, sum, etc.
  • matrix matrix and matrix vector
  • Matrix indexing and concatenation
  • A (13, 4 5 2) B(, J) C
  • Linear solvers x A \ b using SuperLU (MPI)
  • Eigensolvers V, D eigs(A) using PARPACK
    (MPI)

12
Large-Scale Graph Algorithms
  • Graph theory, algorithms, and data structures are
    ubiquitous in sparse matrix computation.
  • Time to turn the relationship around!
  • Represent a graph as a sparse adjacency matrix.
  • A sparse matrix language is a good start on
    primitives for computing with graphs.
  • Leverage the mature techniques and tools of
    high-performance numerical computation.

13
Sparse Adjacency Matrix and Graph
?
AT
x
ATx
  • Adjacency matrix sparse array w/ nonzeros for
    graph edges
  • Storage-efficient implementation from sparse data
    structures

14
Breadth-First Search sparse mat vec
?
AT
x
ATx
  • Multiply by adjacency matrix ? step to neighbor
    vertices
  • Work-efficient implementation from sparse data
    structures

15
Breadth-First Search sparse mat vec
?
AT
x
ATx
  • Multiply by adjacency matrix ? step to neighbor
    vertices
  • Work-efficient implementation from sparse data
    structures

16
Breadth-First Search sparse mat vec
?
AT
x
ATx
  • Multiply by adjacency matrix ? step to neighbor
    vertices
  • Work-efficient implementation from sparse data
    structures

17
HPCS Graph Clustering Benchmark
Fine-grained, irregular data access Searching and
clustering
  • Many tight clusters, loosely interconnected
  • Input data is edge triples lt i, j, label(i,j) gt
  • Vertices and edges permuted randomly

18
Clustering by Breadth-First Search
  • Grow local clusters from many seeds in parallel
  • Breadth-first search by sparse matrix matrix
  • Cluster vertices connected by many short paths
  • Grow each seed to vertices
  • reached by at least k
  • paths of length 1 or 2
  • C sparse(seeds, 1ns, 1, n, ns)
  • C A C
  • C C A C
  • C C gt k

19
Toolbox for Graph Analysis and Pattern
Discovery
  • Layer 1 Graph Theoretic Tools
  • Graph operations
  • Global structure of graphs
  • Graph partitioning and clustering
  • Graph generators
  • Visualization and graphics
  • Scan and combining operations
  • Utilities

20
Typical Application Stack
Computational ecology, CFD, data exploration
Applications
CG, BiCGStab, etc. combinatorial
preconditioners (AMG, Vaidya)
Preconditioned Iterative Methods
Graph querying manipulation, connectivity,
spanning trees, geometric partitioning, nested
dissection, NNMF, . . .
Graph Analysis PD Toolbox
Arithmetic, matrix multiplication, indexing,
solvers (\, eigs)
Distributed Sparse Matrices
21
Landscape Connnectivity Modeling
  • Landscape type and features facilitate or impede
    movement of members of a species
  • Different species have different criteria,
    scales, etc.
  • Habitat quality, gene flow, population stability
  • Corridor identification, conservation planning

22
Pumas in Southern California
Habitat quality model
23
Predicting Gene Flow with Resistive Networks
24
Early Experience with Real Genetic Data
  • Good results with wolverines, mahogany, pumas
  • Matlab implementation
  • Needed
  • Finer resolution
  • Larger landscapes
  • Faster interaction

5km resolution(too coarse)
25
Circuitscape Combinatorics and Numerics
  • Model landscape (ideally at 100m resolution for
    pumas).
  • Initial grid models connections to 4 or 8
    neighbors.
  • Partition landscape into connected components via
    GAPDT
  • Use GAPDT to contract habitats into single graph
    nodes.
  • Compute resistance for pairs of habitats .
  • Direct methods are too slow for largest problems.
  • Use iterative solvers via Star-PHypre (PCGAMG)

26
Parallel Circuitscape Results
  • Pumas in southern California
  • 12 million nodes
  • Under 1 hour (16 processors)
  • Original code took 3 days at coarser resolution
  • Targeting much larger problems
  • Yellowstone-to-Yukon corridor

Figures courtesy of Brad McRae, NCEAS
27
Sparse Matrix times Sparse Matrix
  • A primitive in many array-based graph algorithms
  • Parallel breadth-first search
  • Shortest paths
  • Graph contraction
  • Subgraph / submatrix indexing
  • Etc.
  • Graphs are often not mesh-like, i.e. geometric
    locality and good separators.
  • Often do not want to optimize for one repeated
    operation, as in matvec for iterative methods

28
Sparse Matrix times Sparse Matrix
  • Current work
  • Parallel algorithms with 2D data layout
  • Sequential and parallel hypersparse algorithms
  • Matrices over semirings

29
ParSpGEMM
B(K,J)
J
K
K


C(I,J)
I
A(I,K)
  • Based on SUMMA
  • Simple for non-square matrices, etc.

C(I,J) A(I,K)B(K,J)
30
How Sparse? HyperSparse !
nnz(j)
nnz(j) c
blocks
  • Any local data structure that depends on local
    submatrix dimension n (such as CSR or CSC)
    is too wasteful.

31
SparseDComp Data Structure
  • Doubly compressed data structure
  • Maintains both DCSC and DCSR
  • C AB needs only A.DCSC and B.DCSR
  • 4nnz values communicated for AB in the worst
    case (though we usually get away with much less)

32
Sequential Operation Counts
Required non- zero operations (flops)
  • Matlab O(nnnz(B)f)
  • SpGEMM O(nzc(A)nzr(B)flogk)

Number of columns of A containing at least one
non-zero
Break-even point
33
Parallel Timings
time vs n/nnz, log-log plot
  • 16-processor Opteron, hypertransport, 64
    GB memory
  • R-MAT R-MAT
  • n 220
  • nnz 8, 4, 2, 1, .5 220

34
Matrices over Semirings
  • Matrix multiplication C AB (or
    matrix/vector)
  • Ci,j Ai,1?B1,j Ai,2?B2,j Ai,n?Bn,j
  • Replace scalar operations ? and by
  • ? associative, distributes over ?, identity 1
  • ? associative, commutative, identity 0
    annihilates under ?
  • Then Ci,j Ai,1?B1,j ? Ai,2?B2,j ? ?
    Ai,n?Bn,j
  • Examples (?,) (and,or) (,min) . . .
  • Same data reference pattern and control flow

35
Remarks
  • Tools for combinatorial methods built on parallel
    sparse matrix infrastructure
  • Easy-to-use interactive programming environment
  • Rapid prototyping tool for algorithm development
  • Interactive exploration and visualization of data
  • Sparse matrix sparse matrix is a key primitive
  • Matrices over semirings like (min,) as well as
    (,)
Write a Comment
User Comments (0)
About PowerShow.com