Title: Soft errors in adder circuits
1Soft errors in adder circuits
- Rajaraman Ramanarayanan, Mary Jane Irwin,
Vijaykrishnan Narayanan, Yuan Xie - Penn State University
- Kerry Bernstein
- IBM
2Talk Overview
- Introduction
- Soft errors
- Introduction
- Impact on data-path circuits
- Modeling soft errors in logic circuits
- Experimental setup and methodology
- Error injection mechanism
- Adder circuits considered
- Methodology
- Results
- Conclusions and Future work (Lessons learned)
3Talk Overview
- Introduction
- Soft errors
- Introduction
- Impact on data-path circuits
- Modeling soft errors in logic circuits
- Experimental setup and methodology
- Error injection mechanism
- Adder circuits considered
- Methodology
- Results
- Conclusions and Future work (Lessons learned)
4Introduction
- Soft errors, which are transient errors caused
due to external radiations, affected mainly
memory circuits. - Soft error rates (SER) in data-path structures
and combinational logic have been increasing due
to - Continuous device scaling.
- Voltage scaling and increased speed.
- increasing pipeline lengths.
5Introduction
- Adder circuits form an integral part of
data-path. - Hence, in this work, we
- Analyze SER in different types of adder circuits.
- Analyze the effect of voltage and frequency
scaling on SER. - Experimented techniques to improve the error
rates in adder circuits based on above results.
6Talk Overview
- Introduction
- Soft errors
- Introduction
- Impact on data-path circuits
- Modeling Soft errors in logic circuits
- Experimental setup and methodology
- Error injection mechanism
- Adder circuits considered
- Methodology
- Results
- Conclusions and Future work (Lessons learned)
7Soft errors - Introduction
- Soft errors or transient errors are circuit
errors caused due to excess charge carriers
induced primarily by external radiations. - These errors cause an upset event but the circuit
it self is not damaged.
8Soft errors - Sources
- At ground level, there are three major
contributors to Soft errors. - Alpha particles emitted by decaying radioactive
impurities in packaging and interconnect
materials. - Cosmic Ray induced neutrons cause errors due the
charge induced due to Silicon Recoil. - Neutron induced 10B fission which releases a
Alpha particle and 7Li.
9Soft Errors - The Phenomena
Current
A particle strike
G
D
S
n
p substrate
B
10Soft Errors - The Phenomena
A particle strike
Bit Flip !!!
11Soft errors - Impact on data-path circuits
- In data-path circuits, an error is caused when
the pulse generated by a particle is latched on
at the output by a flip-flop. - Here, the critical charge (Qcritical), can be
defined, as the minimum charge required to latch
on to the pulse. - There are three masking effects in combinational
circuits that affect the propagation of any given
pulse - Logical masking
- Electrical masking
- Latching window masking
12Masking effects in data-path
13Modeling soft errors
Simulated 150 MEV Proton-induced charge
collection for 90 nm and 130 nm bulk
technologies per-bit SER per unit collection
Courtesy - K. Bernstein
14Modeling soft error in logic circuits
- Massengill et al. developed a model for a tool,
which would predict the probability of an error
occurring in a given combinational circuit. - Probability that a random particle hit (resulting
in a current pulse) at a node N in a clock cycle
C will be latched on by the output latch or
flip-flop (PSFC,N). - Probability of soft errors occurring in a given
circuit can be determined using the above
probability.
15Modeling soft error in logic circuits
- In this work, we propose a model that can
accurately models the above probability. - We borrow the term Timing Vulnerability defined
in the work by S. Mukherjee et al. - This is defined as the fraction of time in a
clock cycle in which a given node in a circuit is
vulnerable (tv). - For example, a latch has a tv of 50.
- Thus, PSFC,N ? PQcoll tv , where PQcoll is
the collected charge at a given node.
16Talk Overview
- Introduction
- Soft errors
- Introduction
- Impact on in data-path circuits
- Modeling Soft errors in logic circuits
- Experimental setup and methodology
- Error injection mechanism
- Adder circuits considered
- Methodology
- Results
- Conclusions and Future work (Lessons learned)
17Error injection mechanism
- Based on the models provided by previous works,
- We modeled our current pulse as an exponential
wave form with a pulse width of 50 ps in our
HSPICE simulations. - Charge colleted at a node can be determined using
the following expression - Q ? Id dt, where Id Drain Current.
18Adder Designs Considered
19Methodology
- Our analysis consists of
- Measuring Qcritical at different nodes affecting
different outputs in B-K adders. - Comparing B-K with K-S and Ripple Carry (RC)
adders. - Measuring Qcritical and tv after scaling voltage
and frequency. - Next we consider techniques to improve the above
two quantities to increase the robustness of
adders - Use a Flip-Flop with better tv values.
- Using a Semi-dynamic Flip-Flop (SDFF) instead of
a Transmission gate Flip-Flop (TGFF) used
initially. - Increase threshold voltage, which increases
Qcritical. - All circuits were custom designed and laid out in
70nm technology.
20Talk Overview
- Introduction
- Soft errors
- Introduction
- Impact on data-path circuits
- Modeling Soft errors in logic circuits
- Experimental setup and methodology
- Error injection mechanism
- Adder circuits considered
- Methodology
- Results
- Conclusions and Future work (Lessons learned)
21Qcritical for B-K, K-S and RC adders
22Results B-K adders
- For B-K adders (at node C0)
- Qcriticals for a node to cause a flip at all the
sum outputs are similar. - Qcritical for all outputs flipping together is
higher.
23Adder comparisons
- For B-K and K-S, Qcritical at a node for flipping
different outputs are comparable while RC has
progressively increasing Qcritical. - Qcritical is smaller in K-S adders due to
shortest path carry chains. - Also KS adders have greater area susceptible to
soft errors due to larger number of carry cells. - B-K adder has more nodes that fans out equally
to many outputs - Hence, a single particle strike at a node can
cause multi-bit errors.
24Voltage and frequency scaling
25Voltage and Frequency scaling
- The adders were run at 1 GHz, 0.833GHz and 0.5
GHz with 1V, 0.8 V and 0.6V as supply voltages
respectively. - As both voltage and frequency are scaled,
Qcritical reduces slightly at many nodes. - Reducing frequency reduces tv, but reduction in
Qcritical plays a much important role.
26Optimization Techniques
- Using Flip-Flop with lesser set-up and hold time
(SDFF) - Improves timing vulnerability.
- Also improves Qcritical for multi-bit errors.
- Increasing threshold voltage
- Increases Qcritical as it increases the gain of
the logic circuit. - But it increases the timing vulnerability also.
27Conclusions and lessons learnt
- The timing vulnerability determines the
occurrences of multi-bit errors in adders. - Trade-offs have to be considered in using
different type of adders. - K-S adders have lesser Qcritical and higher soft
error rate. - Multi-bit errors can be more common in B-K adders
- Voltage and frequency scaling worsens the soft
error rate. - Techniques to improve Qcritical and tv were
presented. - Trade-offs in choosing these techniques.
28References
- L. W. Massengill, A. E. Baranski, D. O. V. Nort,
J. Meng, and B. L. Bhuva. Analysis of
Single-Event Effects in Combinational Logic
Simulation of the AM2901 Bitslice Processor. IEEE
Trans. on Nuclear Science, 47(6)26092615,
December 2000. - S. K. Reinhardt and S. Mukherjee. Transient Fault
Detection via Simultaneous Multithreading.
International Symposium on Computer Architecture,
pages 2536, July 2000.