An Automated Design Flow for LowPower, HighThroughput Dedicated Signal Processing Systems - PowerPoint PPT Presentation

1 / 24
About This Presentation
Title:

An Automated Design Flow for LowPower, HighThroughput Dedicated Signal Processing Systems

Description:

DSSS TDMA w/ length 31 spreading code & 25 MHz chip rate ... Routing began after 2 months. No modification to dataflow graph from switch-level sims. ... – PowerPoint PPT presentation

Number of Views:50
Avg rating:3.0/5.0
Slides: 25
Provided by: BobBro8
Category:

less

Transcript and Presenter's Notes

Title: An Automated Design Flow for LowPower, HighThroughput Dedicated Signal Processing Systems


1
An Automated Design Flow for Low-Power,
High-ThroughputDedicated Signal Processing
Systems
W. Rhett Davis, Ning Zhang, Kevin Camera, Dejan
Markovic, Tina Smilkstein, Nathan Chan, M. Josie
Ammer, Engling Yeo, Borivoje Nikolic, Robert W.
Brodersen
  • http//bwrc.eecs.berkeley.edu

2
Energy-Efficiency of Architectures
1000
Dedicated HW
Direct mapped 100-1000 MOPS/mW
100
ReconfigurableProcessor/Logic
Reconfiguration (???) Potential of 10-100 MOPS/mW
Energy Efficiency MOPS/mW (or MIPS/mW)
10
1
Embedded mProcessors
Microprocessor .1-1 MIPS/mW
0.1
Flexibility (Coverage)
3
Results in fully parallel solutions
Reducing supply voltage saves energy E CV2
(numbers taken from vendor-published
benchmarks) Orders of magnitude lower efficiency
even for an optimized processor architecture
4
Standard DSP-ASIC Design Flow
Problems
  • Three translations of design data
  • Requirements for re-verification at each stage
  • Uncontrolled looping when pipeline stalls

Prohibitively Long Design Time for Direct Mapped
Architectures
5
Direct Mapping Design Flow
Algorithm/System
Simulation
Back-End
Front-End
Floorplan
RTL Libraries
Automated Flow
Mask Layout
Performance Estimates
  • Encourages iterations of layout
  • Controls looping
  • Reduces the flow to a single phase
  • Depends on fast automation

6
Outline
  • Chip-in-a-Day Flow
  • User Perspective
  • Design Decisions
  • Estimation
  • Automation
  • Design Example
  • Baseband Receiver
  • Design Effort
  • Problems

7
Capturing Design Decisions
  • Categories
  • Function - basic input-output behavior
  • Signal - physical signals and types
  • Circuit - transistors
  • Floorplan - physical positions

Layout and performance estimates in a day
8
Performance Estimates
  • Estimates vary in accuracy and execution time
  • Different estimates should be available depending
    on how many decisions have been made

9
Push-Button Automation
  • Automation similar to MAKE
  • No decisions made after button is pressed
  • No translation of design data
  • No decisions expressed more than once

10
Example Multiple Graphs
  • Early versions inferred pads from ports
  • Another copy of design was maintained to examine
    internal signals (decisions expressed twice)
  • Todays version allows SimOnly ports

11
Choice of Computation Model
  • Balance between conflicting needs
  • Abstraction of Hardware (simulation speed)
  • Clear Verification Strategy (detailed hardware)

12
Example Dataflow Graph
  • Discrete-Time(cycle accurate)
  • Fixed-Point Types(bit true)
  • No need for RTL simulation
  • Embed macro choices
  • Used SimulinkTM

Multiply / Accumulate
13
Hierarchical Dataflow Graphs
  • Want the hardware hierarchy to match dataflow
    graph hierarchy
  • Grouping needed to hide complexity
  • Referencing needed to save design effort

14
Modeling Control Logic
  • Extended finite state-machine editor
  • Co-simulation with dataflow graph
  • New SoftwareStateflow-VHDL translator
  • No need for RTL

Address Generator / MAC Reset
15
Floorplan Merged on Each Iteration
(Function, Signal, Circuit decisions)
(Floorplan decisions)
save the state
16
Outline
  • Chip-in-a-Day Flow
  • User Perspective
  • Design Decisions
  • Estimation
  • Automation
  • Design Example
  • Baseband Receiver
  • Design Effort
  • Problems

17
TDMA Baseband Receiver
  • DSSS TDMA w/ length 31 spreading code 25 MHz
    chip rate
  • 806 kHz symbol rate, w/ QPSK gives 1.6 Mb/s data
    rate
  • 7 bit I Q streams at 200 MHz, 8 parallel
    streams at 25 MHz

Q
18
Design Effort
  • Spec. Changes required no modification of
    datapath macros
  • Routing began after 2 months
  • No modification to dataflow graph from
    switch-level sims.
  • Flow under development
  • Reuse is crucial

19
Automation Statistics
  • Assuming automated flow and libraries are
    debugged, design time is little more than a day

20
Chip Layout Plot
  • 600k transistors
  • 0.18 mm
  • 1.0 V
  • 25 MHz
  • 3.7 mm x 3.7 mm(w/ pads)
  • 1.8 mm x 1.3 mm(core only)
  • 21 mW

21
Constant Shift Discrepancy
  • Incorrect for right-shift and negative values
    only
  • Breaks the verification methodology
  • Solved by creating a new primitive

22
Disabled Clock Discrepancy
  • State is updated but output is held when disabled
  • Solved by specifying certain cycles as dont
    care

23
Moving Beyond Cycle-Accuracy
  • Using dont care windows could relax
    cycle-accuracy constraint
  • Must be automated into verification process

24
Conclusions
  • Direct-Mapped hardware is the most efficient use
    of silicon
  • Design with dataflow graphs, not sequential code
  • Dont translate design data, refine it
  • Verification methodology determines level of
    abstraction
  • Chip-in-a-Day design flows are feasible
Write a Comment
User Comments (0)
About PowerShow.com