Title: A Robust Celllevel Crosstalk Delay Change analysis
1A Robust Cell-level Crosstalk Delay Change
analysis
- Igor Keller, Ken Tseng, Nishath Verghese
- Cadence Design Systems
- San Jose, CA
2Some Technology Trends
- Interconnect is a key player
- Cc/Ctot, Rline/Rdrv grow
- Timing closure is harder to reach
- Clock cycle smaller
- Tighter margins
- Bigger uncertainty
- Waveform effects important
- Resistive interconnect
- X-Talk
- Skewed gates
3Modern SI-STA problems
- Biggest inaccuracy factors
- Modeling switching aggressors (linear ramp? ?)
- Aggressor selection
- Alignment
- Superposition of noiseless transition and glitch
? - Irregular waveform
- Measurement (slew delay) ? ?
- Downstream effect ?
- Aggressor driver net impedance ? ?
- Linear(ized) driver model
- Ceff Line resistance and x-talk ?
- Non-robustness
- Amplifies small inaccuracy ?
- Discontinuities in results ?
4Today SI-STA must be
- Accurate
- Reference WC realizable setup spice
- Conservative
- Slightly conservative (10)
- Efficient
- Big designs overnight
- Run time linear with number of nets, gates
- Robust with respect to variations of
- Waveforms on aggressors/inputs
- Alignment
- Voltage supply, load on receiver
5Building blocks of new SI-STA
Accurate driver model
Interconnect reduction
Robust delay measurement
PWL wave propagation
Constrained optimization of alignment
6Driver model
7Interconnect
Reduction
8Robust delay measurement
Delay change
9PWL Waveform
10Alignment optimization
delta
?2
11Delay Change problem formulation
12Driver Current Model
accurate
Fast
- Current is a function of voltages
- Like Spice
13ViVo Current Model
Vo
14Standard Cell Library
ViVo models Input and output CCCs Miller and
ground caps
15Interconnect reduction
- RC-network
- Distributed victim aggressors
- Soft-grounded
- Block-Arnoldi
- Proven passivity
Rcv2
16Flow
nets
Fast PWL propagation
nets
arcs
Driver output response
Noise less
Receiver input response
Receiver output response
Aggressor responses
Alignments, receivers
Select alignment
noisy
Driver output response
Receiver input response
Receiver output response
Delay changes
17Fast PWL transition
- BFS traverse of whole design
- One-stage cells
- Driver is modeled using Vivo model
- Multi-stage cells
- Last CCC is modeled using Vivo model
- Slew propagation table till last CCC
VivoSim
Slew prop table from .cdB
18Flow
nets
Fast PWL propagation
nets
arcs
Driver output response
Noise less
Receiver input response
Receiver output response
Aggressor responses
Alignments, receivers
Select alignment
noisy
Driver output response
Receiver input response
Receiver output response
Delay changes
19Driver output noiseless response
- Truncate Z(s) to PI-Load
- 2 ODEs solved efficiently
- 2d order implicit numerical scheme
(Crank-Nicholson) - Newton iterations employ predictor-corrector
idea
20Noiseless response at receiver
j
Voltage Substitution
Y(s) H(s)
j
vj0Hj0.v0
v0
21nets
Fast PWL propagation
nets
arcs
Driver output response
Noise less
Receiver input response
Receiver output response
Aggressor responses
Alignments, receivers
Select alignment
noisy
Driver output response
Receiver input response
Receiver output response
Delay changes
22Voltage and current responses from aggressors
k
S vjk(t-tk)
Si0k(t-tk)
i0
k
k
j
Open circuit voltage
Short circuit current
k
Y (s) H (s)
S vjk(t-tk)
Si0k(t-tk)
i0
k
j
k
Y0k.vk and Hjk.vk are found once for all
alignments!
i0kY0k.vk
vjkHjk.vk
23nets
Fast PWL propagation
nets
arcs
Driver output response
Noise less
Receiver input response
Receiver output response
Aggressor responses
Alignments, receivers
Select alignment
noisy
Driver output response
Receiver input response
Receiver output response
Delay changes
24Driver output response with x-talk
Y (s) H (s)
Iagg
25Noisy response at receiver
j
Voltage Substitution
vj0Hj0.v0
Y(s) H(s)
vj0 Hj0.v0 is found once for each alignment !
j
v0
Linear superposition
S vjk(t-tk)
vj vj0
26ViVoSim accuracy vs. Spice
r200 Ohm xc20 fF
Miller Effect
1.1u/1.5u
27ViVoSim accuracy on path
28Results 40K, 1.3um, 200sec on Linux 2Gh
29Pessimism reduction with ROP
Without Noise
Path delay without noise 4ns Path delay with
noise Rcv Input Probing 8.5ns Rcv Output
Probing 6ns Pessimism reduction 2.5ns!
6ns
RIP
6ns
4ns
3.5ns
ROP
30Path-Based vs Net-Based analysis 0.13um, 500K
instances
Net Based Worst Case Slack -1.56ns
Path Based Worst Case Slack -1.33ns
31Conclusions
- A new delay change analysis is based on
- Current model
- Efficient linear and nonlinear solvers
- PWL waves propagation
- ROP idea
- Nonlinear constrained optimization for alignment
- Accurate
- 2-3 off linear sweep in Spice
- Robust, due to
- Optimization, use of low-pass filter (ROP)
- Reduces pessimism, due to
- ROP, PWL on aggressors
- Fast Less than 1 hour for 1M gates design
- Implemented in CeltIC
32(No Transcript)
33Alignment search
- Nonlinear optimization under constraints
- 2-step process
- Aggressor cluster aligned with victim
- Weighted average of individual alignements
- Refinement per each aggressor
- Rarely needed
34ROP vs RIP slacks
Worst RIP slack -1.56ns
Worst ROP slack -1.33ns
Missing noisy slew annotation?