The CMU Reconfigurable Computing Project - PowerPoint PPT Presentation

About This Presentation
Title:

The CMU Reconfigurable Computing Project

Description:

(the basic processing elements compute on 1 bit) word-based architectures: PipeRench (CMU) ... (basic PE is a MIPS 2000 core) SSS 4/9/99. CMU Reconfigurable ... – PowerPoint PPT presentation

Number of Views:36
Avg rating:3.0/5.0
Slides: 39
Provided by: Mihai49
Learn more at: http://www.cs.cmu.edu
Category:

less

Transcript and Presenter's Notes

Title: The CMU Reconfigurable Computing Project


1
The CMU Reconfigurable Computing Project
  • April 9, 1999
  • Mihai Budiu
  • mihaib_at_cs.cmu.edu

2
Current Project Members
ECE Department Herman Schmit Srihari
Cadambi Matt Moe Robert Taylor Ronald Laufer
CS Department Seth Copen Goldstein Mihai Budiu
3
Why Study Reconfigurable Hardware?
  • It is a nice computation paradigm
  • (wire your own computer)

4
Why Study Reconfigurable Hardware
5
Commercial Players
Source In-stat April 1998  Does not include
software, hardwire or support EPROMs
6
What Is Reconfigurable Hardware?
Interconnection network
Universal gates and/or storage elements
Switches
7
Basic Ingredient RAM cell
8
Basic Ingredients (ctd)
0
1
1
1
A switch is controlled by a 1-bit RAM cell
9
Outline
  • What is reconfigurable hardware
  • RH vs other computation paradigms
  • Challenges in RH research
  • PipeRench the CMU project
  • the hardware
  • the software
  • Conclusions

10
RH vs ASICs
  • Generally Application-Specific Integrated
    Circuits will be faster than RH
  • RH wires are slow big
  • RH bit-slices are costly to interconnect
  • RH devices must store configuration on the chip
  • but
  • RH can be reprogrammed
  • new algorithms
  • to fix bugs
  • RH cheaper in small production
  • RH tolerates faults better
  • RH sometimes faster with staged computation

11
RH vs Microprocessors
  • RH less flexible (like a VLIW with fixed
    instructions)
  • but
  • RH provides more (customized) computation
    elements
  • RH can decrease memory traffic
  • RH can be tailored for specific algorithms and
    data types
  • RH will not replace mP, but complement them

12
Types of RH
  • FPGAs bit-level logic functionality
  • (the basic processing elements compute on 1 bit)
  • word-based architectures PipeRench (CMU)
  • (basic PE operates on 8 bits)
  • (basic PE is a small ALU)
  • coarse architectures RAW (MIT)
  • (basic PE is a MIPS 2000 core)

13
RH In A System
14
Challenges In RC
  • Software tools
  • Programming RC like software development
  • Automatic compilation from HLL
  • Automatic program partitioning
  • Mapping efficiently algorithms (no ISA)
  • System issues
  • interfaces
  • find ideal RC fabric

15
The CMU Reconfigurable Computing Project
16
Hardware Goals
  • To build a complete reconfigurable hardware
    device
  • To build the system integration hardware
  • To host the device in a PC

17
Our Device
  • Word processing elements
  • Pipelined architecture
  • Virtualized hardware
  • Local interconnection network
  • Wide pipelined bus

18
Configuration memory
Data Config controller
Stripes
Processing elements
19
Hardware Virtualization
Actual available hardware
Instructions currently in hardware
Program
Instructions paged out
20
Hardware Virtualization (2)
Page out
compute
compute
Program in configuration memory
compute
configure
Page in
hardware
Overlap configuration with computation.
21
Processing Elements
a
b
Cin
PE2
PE0
PE1
out
  • Look-up table
  • Any 3-to-1 function

22
The Interconnection Network
23
The PCI Board
24
Our Target Applications
v9
Input data
  • Pipelineable applications
  • Stream processing (e.g. DSP, encryption)
  • Multimedia processing
  • Vector processing
  • Limited data dependencies

v8
v7
v6
v5
HW
v4
v3
v2
Output data
v1
Computational power stems from massive parallelism
25
Software Goal
  • To program reconfigurable devices using the
    standard software development processes
  • Compile C or Java
  • Do it quickly

Java
Partitioner
Data-flow Intermediate Language
DIL
Built
Configuration
CPU
Reconfigurable HW
26
Building Circuits From DIL
  • a b c d
  • e c - d
  • variables wires
  • operators gates

d
c
b


-
a
e
27
Mapping Circuits To
a
b
c
a

b
c
c
a
b
-

-

-
c
a
b
-

28
The DIL Compiler Front-End
Circuit
Parser Evaluator Loader
Dil input file
Backend
Loader
component library
Component circuits
29
The DIL Compiler Backend
Circuit (expanded)
Circuit (placed)
Circuit
Optimizer
Placer- Router
Front-end
The whole compilation process is very fast
(compared to classical CAD tools). We can
compile two orders of magnitude faster.
Code generator
Asm
C
C
xfig
30
Processing Element Size Tradeoffs
31
Stripe Width Tradeoffs
32
Bus Width Tradeoffs
33
Clock Speed Tradeoffs(run-time)
24
24
24
24


8
8
24

8

24
34
(No Transcript)
35
(No Transcript)
36
Project Status
  • Operational
  • Behavioral and structural models of Piperench in
    Verilog
  • Assembler, simulator
  • Tools for visualization and debugging
  • One tile fabricated and tested
  • Very fast compiler from intermediate language
  • In work
  • Prototype PipeRench to be taped this summer
  • PCI board to host PipeRench in a PC

37
Simulated Speed-up vs. UltraSparc _at_ 300Mhz
38
Future Work
  • Build the PCI board
  • Build the OS device drivers
  • Start investigating HLL issues
  • automatic partitioning
  • translation to DIL
  • special code transformations

39
Conclusions
  • A set of important applications can benefit from
    RC devices
  • RC offer potential for substantial performance
    improvement at a low cost
  • RC devices will soon be mainstreamin the
    embedded computing world perhaps in the future
    they will also permeate the desktop

U V R
Pentium V
Write a Comment
User Comments (0)
About PowerShow.com