Project Name - PowerPoint PPT Presentation

1 / 1
About This Presentation
Title:

Project Name

Description:

Harnessing the data parallelism within each SM. Orchestrating the execution across SMs to exploit task and pipeline parallelism ... – PowerPoint PPT presentation

Number of Views:85
Avg rating:3.0/5.0
Slides: 2
Provided by: Danie292
Category:

less

Transcript and Presenter's Notes

Title: Project Name


1
High Performance Applications on Heterogeneous
Windows Clusters http//www.hpc.serc.iisc.ernet.in
/MSP
Supercomputer Education Research Centre, Indian
Institute of Science
R. Govindarajan, Sreepathi Pai, Matthew J.
Thazhuthaveetil, Abhishek Udupa
Accelerators for HPC StreamIt on GPU
The Process
  • Speedups of upto 33.82X over a single threaded
    CPU execution
  • Software Pipelined scheme outperforms a naive
    serial SAS schedule in most cases
  • Coalesced Accesses results in huge performance
    gains, as evident from the poor performance of
    SWPNC
  • Further details in Software Pipelined
    Execution of Stream Programs on GPUs (CGO 2009)

The Results
Accelerators for HPC Programming Models
Enter PLASMA
  • Operator
  • Add, Mult,
  • Vector
  • 1-D bulk data type of base types
  • E.g. lt1, 2, 3, 4, 5gt
  • Distributor
  • Distributes operator over vector
  • Example
  • par add lt1,2,3,4,5gt lt10,15,20,25,30gt returns
    lt11, 17, 23, 29, 35gt
  • Vector composition
  • Concat, slice, gather, scatter,
  • What a Solution Needs to Provide
  • Rich abstractions for Functionality
  • Not a lowest common denominator
  • Independence from any single architecture
  • Portability without compromises on efficiency
  • Don't forget high-performance goals of the ISA
  • Scalability Both ways, from micro engines to
    massively parallel devices
  • Single core embedded processor to multi-core
    workstation
  • Take advantage of Accelerators (GPU, Cell, etc.)
  • Transparent Access to Distributed Memory
  • Management and load balance across heterogeneous
    devices

Initial Performance Results
  • The PLASMA Framework
  • CPLASM, a prototype high-level assembly
    language
  • Prototype PLASMA IR Compiler
  • Currently Supported Targets
  • C (Scalar), SSE3, CUDA (NVIDIA GPUs)?
  • Future Targets
  • Cell, ATI, ARM Neon, ...
  • Compiler Optimizations for Vector IR

Microsoft External Research Initiative
Write a Comment
User Comments (0)
About PowerShow.com