Title: ??? (Youn-Long Lin)
1High-Level Synthesis of VLSIs
- ??? (Youn-Long Lin)
- Department of Computer Science
- National Tsing Hua University
THEDA Tsing Hua Electronic Design Automation
2VLSI Design Tools
- Design Capturing/Entry
- Analysis and Characterization
- Synthesis/Optimization
- Physical (Floor planning, Placement, Routing)
- Logic (FSM, Retiming, Sizing, DFT)
- High Level(RTL, Behavioral)
- Management
3Design Methodology Progress
Capture and Simulate
4Why not Synthesis?
Why Synthesis?
Productivity
Performance Loss
Correctness
Unsynthesizability
Re-Targetability
Inertial
5Structural
Behavioral
Block
Algorithm
FSM
RTL
Boolean
Gate
Xtor
GDSII
Placement
Y-Chart Dan D Gajski
Floorplan
Physical
6Structural
Behavioral
Block
Algorithm
FSM
RTL
Boolean
Gate
Xtor
GDSII
Placement
Layout Synthesis
Floorplan
Physical
7Structural
Behavioral
Block
Algorithm
FSM
RTL
Boolean
Gate
Xtor
GDSII
Placement
Logic Synthesis
Floorplan
Physical
8Structural
Behavioral
Block
Algorithm
FSM
RTL
Boolean
Gate
Xtor
GDSII
Placement
High-Level Synthesis
Floorplan
Physical
9High Level Synthesis
Behavioral Description
10What Went Wrong?
- Too much emphasis on incremental work on
algorithms and point tools - Unrealistic assumption on component capability,
architectures, timing, etc - Lack of quality-measurement from the low level
- Too much promising on fully automation (silicon
compiler??)
11Essential Issues
- Behavioral Specification Languages
- Target Architectures
- Intermediate Representation
- Operation Scheduling
- Allocation/Binding
- Control Generation
12Behavioral Specification Languages
- Add hardware-specific constructs to existing
languages - HardwareC
- Popular HDL
- Verilog, VHDL
- Synthesis-oriented HDL
- UDL/I
13Target Architectures
- Bus-based
- Multiplexer-based
- Register file
- Pipelined
- RISC, VLIW
- Interface Protocol
14Design Space Exploration
Delay
Arch I
Arch II
Arch III
Area
15FSM with Data Path (FSMD)
16Intermediate Representation
Data Flow Graph
Control Flow Graph
17Scheduling (Temporal Binding)
- Time Resource Tradeoff
- Time-Constrained
- Integer Linear Programming (ILP)
- Force-Directed
- Resource-Constrained
- List Scheduling
- Other Heuristics
- Simulated Annealing, Tabu Search, ...
18Allocation/Binding
19(No Transcript)
20Controller Specification Generation
Scheduled CDFG
Allocated Datapath
21HLS Quality Measures
- Performance
- Area Cost
- Power Consumption
- Testability
- Reusability
22Hardware Variations
- Functional Units
- Pipelined, Multi-Cycle, Chained, Multi-Function
- Storage
- Register, RF, Multi-Ported, RAM, ROM, FIFO,
Distributed - Interconnect
- Bus, Segmented Bus, Mux, Protocol-Based
23Functional Unit Variations
Step 1
Step 2
-
Step 3
Step 4
24Storage/Interconnect Variations
25Architectural Pipelining
FSM
Data Path
26THEDAs Work on HLS
- ILP-based Scheduling
- Bipartite Weighted Matching for Datapath
Allocation - Performance-Driven Interconnect Synthesis
- Loop Folding Retiming
- Integrating Synthesis and Layout
- DSP Core Generation
- Book on HLS
27Integer Linear Programming for Scheduling
- Given Control Steps
- ASAP ALAP gt Possible Steps for each
Operations - Tight Constraints on
- Dependency
- One Scheduled Step per Op
- Resource Usage per Step
- Many Extensions
28Advanced Scheduling for Loop Folding
1
1
2
2
3
3
1 iteration per 3 cycles
1 iteration per 2 cycles
29Loop Folding(cont.)
1
2
3
30Retiming and Loop Folding
B
A
C
D
E
F
B
A
C
D
E
F
E
A
C
E
B
A
C
D
F
B
D
F
31Integrating Layout and Synthesis
HDL Description
PR
RC Extraction Delay Calculation
HDL Synthesis
Soft-Macro Formation
Soft-Macro Formation
Post-Layout Timing Analysis
Block Placement
Timing Ok no more area improvement
No
Soft-Macro Placement
Soft-Macro Placement
Module Resynthesis
Module Resynthesis
Chip Layout
32HLS Techniques for DSP Code Generation
Memory Allocation
Scheduling
Address Generation
33Applications of HLS Technology
- Code generation for embedded processors
- Retargetable compilers for application-specific
instruction-set processors (ASIP) - Reconfigurable computing
- Advanced features in logic synthesizer
34System-on-a-Chip
Processor
Memory
Wireless
Bridge
External Memory Interface
USB
IP
Bus Master
UART
35SOC with PLDs
Processor
Memory
Wireless
Bridge
External Memory Interface
USB
FPGA
Bus Master
FPGA
36System Houses/ IC Vendors (Fabless)
Wafer Foundry
Library/ IP Vendors (Chipless)
Integrators
EDA Vendors
Paradigm Shift
37IP and Synthesis
- Authoring IP for Synthesis
- Synthesis utilizing IP
- Synthesizing IPs
Executable Data Sheets
38Executable Data Sheets
More than just the Port Interface
IP Wrapper
IP
39Future Directions
- Realistic Methodology
- Evolutional Transition from Current Practice
- Domain Specific
- IP-Centric
- As both Authoring Aid and Integrator
- Software
- Co-design and Code Generation
40IC
Value
IP
EDA
Time