AccelChip Presentation - PowerPoint PPT Presentation

1 / 18
About This Presentation
Title:

AccelChip Presentation

Description:

An Automated System for Floating- to Fixed-Point Conversion of High Performance of MATLAB Algorithms in FPGAs and ASICs Eric Cigan and Robert Anderson – PowerPoint PPT presentation

Number of Views:28
Avg rating:3.0/5.0
Slides: 19
Provided by: klabsOrgm
Learn more at: http://klabs.org
Category:

less

Transcript and Presenter's Notes

Title: AccelChip Presentation


1

An Automated System for Floating- to Fixed-Point
Conversion of High Performance of MATLAB
Algorithms in FPGAs and ASICs Eric Cigan and
Robert Anderson September 2004
2
Outline
  • Motivation
  • For FPGA / ASIC implementation
  • For fixed-point arithmetic
  • Alternatives
  • Related approaches methods used
  • Changshun Shi performance criteria and
    optimization
  • Approach used
  • MATLAB-based algorithmic synthesis
  • Heuristic-based approach
  • Advantages of approach
  • Single design source
  • Verification flow
  • Example
  • FIR filter

3
Rationale for DSP Algorithms in Silicon
Function TI C6416Tat 1 GHz Xilinx Virtex-II Pro Platform
8x8 Multiply Accumulate (MAC) 8.0 Billion MAC/s 1 Trillion MACs/s fclk 300 MHz
FIR Filter  - 256 Taps, Linear phase  - 16-bit data/coefficients 15.5 MSPS 300 MSPS  fclk 300 MHz
Complex FFT  - 1024 point, 16-bit data 6 µs 1 µs fclk 150 MHz
Viterbi Decoding Throughput 600 channels at 7.95-Kbps for a total of 4.77 Mbps 155 Mbps (OC-3 rates)
Reed-Solomon Decoding Throughput 6.8 Mbps 10 Gbps (OC-192 rates) fclk 85 MHz
Turbo Convolutional Decoder Throughput Seven 2-Mbps data streams (6 iterations) 5.4 Mbps (6 iterations)
Dedicated coprocessor 16 cores running in parallel in a single device.
Source Xilinx website (1/2004) and AccelChip estimates
4
Fixed-Point Implications
  • Increases in performance (throughput, frequency,
    etc.)
  • In communications a 1 dB increase in the link
    power budget results in a 25 increase in
    coverage
  • Power consumption
  • Word size silicon area/memory gt Cost

Source The MathWorks, 2003
5
Previous Work
  • Rule-of-thumb methods
  • Ad-hoc methods such as rounding and truncation.
  • Manual scaling to and from integer
    representations.
  • Recoding the source code in a hardware
    description language and then verifying
    performance in the RTL code.
  • Substitution of floating-point functions with
    fixed-point equivalents, such as in C fixed-point
    libraries.
  • Recent research
  • Keding, et al (RWTH Aachen, 1998)
  • Algorithm described in C/C
  • Oriented toward minimizing all wordlengths at
    same time
  • Can require large number of iterations
  • Shi and Brodersen (UC Berkeley, 2002-2004)
  • Algorithm described in graphical design tool
  • Optimization-based methodology
  • Requires Monte Carlo simulation runs

6
MATLAB-Based Algorithmic Synthesis Flow
Algorithm Exploration
Floating Point Design
DSP IP Libraries
Fixed Point Models
SignalProcessing
System-Level Verification
Automated Fixed Point Model Generation
Project Directory
Communications
Fixed-Point Model Generation Report
Model Export
Implementation Exploration
Verification Report
IC Design Tools
RTL Model
Place Route
Final IC Design Verification
Algorithmic Synthesis Environment
Implementation Optimization
7
Synthesizable MATLAB Coding Basics
  • MATLAB script file applies input vectors to the
    design function in a loop called the streaming
    loop
  • The design to be synthesized needs to exist in a
    function called the design function
  • Additional MATLAB files or external data files
    may exist

8
Fixed-Point Quantization
  • Tool supports two fixed point datatypes
  • Fixed 2s complement
  • Required for negative numbers
  • Much easier to multiply (dont have to worry
    about sign bit)
  • ufixed (unsigned)
  • Provides a greater range with same hardware when
    numbers are all positive
  • To optimize the dynamic range of a number
  • Use a minimal of integer bits to accommodate
    the range of possible values
  • Use a minimal of fraction bits to accommodate
    acceptable precision

9
Automated Quantization Method
  • Quantization performed on floating-point MATLAB
  • Bit widths are derived from
  • Stimulus waveforms
  • Text files loaded to initialize constants
  • Bit growth propagated from arithmetic operations
  • MATLAB quantizer statements in M-file
  • Explicit quantizer directives

10
Fixed-Point Generation Process
  • Process is automated, yet requires designers
    knowledge

11
Generated Fixed-Point MATLAB Files
  • Tool generates fixed-point M-files from source
    floating point M-files
  • The MATLAB quantize and quantizer functions
    are inserted into the original source code
  • Optimization of quantizer functions reduces
    simulation overhead of fixed-point simulation
    up to 100x faster

Floating Point MATLAB
Generated Fixed-Point Output
12
The Quantize Directive
  • The Quantize directive allows users to override
    the automatic quantization
  • Applying directive allows use of unmodified
    MATLAB source
  • Directives can be typed at the command line or
    added to an .add directives file
  • AccelChip will create this file automatically
    when Save -gt Project is executed

Directives file located here
Quantize directives can be applied interactively
from the design browser
SetDirective Variables.indatabuf -quantize fixed
floor wrap 32 16
13
Auto-Quantization Report
MATLAB Editor
All variables in a design are displayed in a
single, flat format
Directives Editor
Double-Click
Specifies the source of the directive Flags
directives that are set to maximum
Double-Click
14
Overflow and Underflow
  • An underflow is defined as a number that is
    nonzero before it is quantized, and zero after it
    is quantized.
  • Example of an underflow
  • X quantize(quantizer(ufixed,floor,wrap,2
    2), 0.1)
  • 00b 0
  • 01b 0.25
  • 10b 0.50
  • 11b 0.75
  • An overflow occurs when the magnitude of a number
    exceeds the dynamic range allowable by the
    integer bits as defined by the quantizer
  • Example of an overflow
  • X quantize(quantizer(ufixed,floor,wrap,2
    0), 5)
  • 00b 0

X will be zero because the smallest non-zero
number will be too large
X will be one because the binary representation
of 5, 101 will be wrapped to 01
15
Managing Underflow and Overflow Issues
  • Often a design has a large number of underflows
    and only a few overflows.
  • Displaying overflow and underflow messages
    together often makes it difficult to locate the
    overflows
  • Recommended Practice
  • Begin the debug process by setting Show
    Underflows to False
  • Once overflows are addressed then set Show
    Underflows to True and address any remaining
    fidelity issues

Project options are available to selectively
disable displaying overflow or underflow message
16
Addressing Bit Growth Due to Constants
A constant introduced here
Will affect all downstream hardware
Will affect all downstream hardware
  • Constants are represented in binary form with
    maximum accuracy
  • This can lead to unnecessary bit growth
  • Constants used directly in expressions cant be
    quantized directly
  • Change in coding style allows bit growth
    management

Y x quantize( quantizer(ufixed','floor','wrap
', 10,9), 1.3 )
or
Y x 1.3
Const 1.3 Y x cost
Recommended
17
Improving Hardware through Quantization
  • Quantization can have a dramatic affect on
    performance and area
  • Tool will attempt to auto-quantize to preserve
    signal fidelity
  • A maximum of 53 bits will be used for internal
    variables
  • Consistent with abscissa of MATLAB double
    number
  • A maximum of 32 bits will be used for IO ports
  • Start by reviewing the fractional bits of the
    default quantization assigned to constants and
    the input data stream
  • Tool will use as much precision as necessary to
    represent number
  • E.g., 0.375 ltbinary ptgt011 and 0.3 ltbinary
    ptgt01100110011001100110011001100110011001100110011
    001100
  • Designer uses tool directives to trim bits

Example 16-tap FIR filter
Quantization on Filter Coefficients of Slices
Fixed, wrap, floor, 10 8 55
Fixed, wrap, floor, 53 51 169
Default quantization used 53 bits
18
Conclusion
  • Algorithmic synthesis tool enables MATLAB design
    to be design source throughout process
  • Tool aids in automating process of converting
    floating point designs to fixed-point
  • Provides design exploration to increase
    performance and reduce size/power

MATLAB Domain (Pure algorithmicnon-implementation
-specific)
m language (used by MATLAB)
More abstract, lessimplementation-specific
Untimed C Domain (non-implementation-specific)
Standard C (used by Catapult C)
Handel-C
Timed C Domain (implementation-specific)
SystemC
Less abstract, moreimplementation-specific
RTL Domain (implementation-specific)
Verilog and/or VHDL
Different levels of synthesis abstractionsource
Mentor Graphics white paper, Catapult C
Synthesis-based Design Flow, October 2003
Write a Comment
User Comments (0)
About PowerShow.com