Melanie Berg - PowerPoint PPT Presentation

About This Presentation
Title:

Melanie Berg

Description:

Embedding Asynchronous FIFO Memory Blocks in Xilinx Virtex ... After GTMR, SEUs become insignificant. MBUs may be insignificant (still under investigation) ... – PowerPoint PPT presentation

Number of Views:134
Avg rating:3.0/5.0
Slides: 39
Provided by: melani55
Category:

less

Transcript and Presenter's Notes

Title: Melanie Berg


1

Embedding Asynchronous FIFO Memory Blocks in Xilinx Virtex Series FPGAs Targeted for Critical Space System Applications
  • Melanie Berg
  • MEI Technologies- NASA/GSFC Radiation Effects and
    Analysis Group

2
Agenda
  • Asynchronous FIFO (AFIFO) Basics Operation and
    Architecture
  • Single Event Upsets (SEUs) and Xilinx FPGA Error
    Signatures
  • Mitigating Radiation Effects is AFIFOs with
    Triple Modular Redundancy (TMR)
  • Xilinx Generated Intellectual Property (IP) Cores
  • User Customized Design
  • Conclusion

3
AFIFO Basics
  • Used to pass data between two separate clock
    domains
  • Write Domain
  • Read Domain
  • Each domain has its own
  • Clock
  • Enable
  • Data port
  • Address Pointer Protection
  • Domains share memory space

Basic AFIFO
WCLK
RCLK
WEN
REN
DIN
DOUT
EMPTY
FULL
Write Protection Flag
Read Protection Flag
4
AFIFO Operation Internal Address Pointers
Write Pointer (WP)
Read Pointer (RP)
  • There are 2 internal address pointers
  • Write Pointer (WP)
  • Read Pointer (RP)
  • Pointers increment independently
  • Pointers are inaccessible by user
  • At Start
  • RP WP 0
  • Empty 1
  • Full 0
  • If Empty RP will not increment
  • If Full WP will not increment

Dual-Port Memory
5
Whats the Problem Regarding Critical Mission
Application?
  • Asynchronous nature of the AFIFO leads to
    non-deterministic behavior
  • Empty and full flag can be generated on either
    cycle(t) or cycle(t1)
  • Redundant AFIFOs can be out of step with each
    other
  • Critical Missions will generally require
    redundancy mitigation due to SEEs
  • IP Cores have limited protection capability if
    redundancy is required
  • Designers must be aware of asynchronous corner
    cases when inserting redundancy and mitigation
  • Design for Radiation Effects, M. Berg Short
    Course MAPLD 2008
  • A Comparative Study of Field Programmable Gate
    Array Error Cross Sections Putting Data into
    Perspective, M. Berg, C. Perez April, SEE 2008

6
AFIFO Operation Internal FULL Detection Example
(WP)
(RP)
Traditional
Look Ahead
7
WCLK Example Comparing WP and RP Across
Asynchronous Clock Domains
  • WP changes _at_ WCLKedge(tclk?q)
  • RP changes _at_ RCLKedge(tclk?q)
  • Compare RP Crosses into WCLK

Metastable Event
8
Reducing the Number of Metastable bits in Compare
Path
  • Problem With binary encoding too many address
    bits can change simultaneously during an
    increment 11111? 00000
  • Resolution Use gray encoding
  • Only used to cross domains
  • Only one bit changes per clock cycle
  • Comparison must still be performed in binary

Binary Gray
000 000
001 001
010 011
011 010
100 110
101 111
110 101
111 100
9
Scheme Reducing the Number of Metastable bits in
Compare Path using Converters
Gray
Binary
10
Xilinx AFIFO IP Core Block Diagram
Previously Illustrated Example
11
Single Event Upsets (SEUs) and Xilinx FPGA Error
Signatures
After Nikkei Science, Inc. of Japan, by K. Endo
12
Xilinx Architecture Basics
  • Configuration
  • programmable switches
  • static during operation
  • Logic
  • Switches during operation
  • Performs expected functionality

13
Programmable Switch Implementation and (SEU)
Susceptibility
SRAM (Reprogrammable Configuration)
Sensitive Volume
14
Xilinx XQR4VSX55 Radiation Test Data
Xilinx Consortium VIRTEX-4VQ STATIC SEU
CHARACTERIZATION SUMMARY April/2008
Probability Error Rate LEO GEO
Configuration Memory XQR4VSX55 Pconfiguration 7.43 4.2
Combined SEFIs per device PSEFI 7.5x10-5 2.7x10-5
  • For non-mitigated designs the most significant
    upset factor is

M Berg, Trading ASIC and FPGA Considerations for
System Insertion IEEE Nuclear Science Radiation
Effects Conference 2009
15
Configuration Upset Effects
16
DFF Configuration Upset Effects
17
Routing Configuration Upset Effects
18
AFIFO Error Signatures
  • Will Read Invalid locations
  • Can write over unread data
  • Fixing the configuration bit will not fix the
    address pointer
  • Most importantly, a large amount of data can be
    lost

19
AFIFO IP Core Error Signatures
  • Address Pointers are internal to FIFO and cannot
    be fixed
  • Pointer Corruption (and data loss) is not
    detectable
  • Error and Full Flags become invalid

FIFO Core
WCLK
RCLK
WEN
REN
DIN
DOUT
FULL
EMPTY
20
Mitigating Radiation Effects in AFIFOs with
Triple Modular Redundancy (TMR)
21
IP Cores Manual Core TMR
22
Implications of Core Triplication Lack of
Visibility and Control
V O T I N G M A T R I X
  • Inability to correct read and internal address
    pointers
  • More than one core will go bad over time
  • Scrubbing will not fix logic state of IP Core
    (need a voter)
  • Inactivity will accumulate and the system will
    eventually not be valid
  • Asynchronous issues negate effectiveness of
    mitigation

FPGA IP CORE
FPGA IP CORE
FPGA IP CORE
23
User Customized Design Creating Correction
Control
24
User AFIFO Custom Design and Global TMR (GTMR)
  • Triplicate everything in circuit Clocks, I/O,
    logic, resets
  • Place Voter after every DFF (or only DFFs with
    feedback)
  • Level of triplication can vary (e.g. no clock
    redundancy, I/O redundancy, or only voters in
    feedback paths)

Mitigated
User Non-Mitigated Design
25
Mitigation Windows Significantly Decrease
Susceptibility
Small mitigation windows Greater Protection
Low Probability of upset
DFF
DFF
0
1
2
V
DFF
DFF
0
1
2
V
DFF
6
7
8
V
DFF
DFF
0
1
2
V
DFF
6
7
8
V
DFF
6
7
8
V
26
Upper-Bound Error Prediction Xilinx FPGA XTMR
  • What about PConfiguration ???
  • After GTMR, SEUs become insignificant
  • MBUs may be insignificant (still under
    investigation)

Assumes Unmitigated SEFIs are the most
predominant source Error Rates migrate from Days
to Years
Years - Decades
Source M. Berg Trading ASIC and FPGA
Considerations for System Insertion, NSREC Short
Course 2009, Quebec Canada
27
Global TMR (GTMR) Continued
  • Redundancy and Mitigation created via a tool
  • Tool creates redundancy and inserts voters
    post-synthesis (at gate level)
  • Can be difficult to validate tool TMR insertion
  • did anything break?
  • is everything mitigated as expected?
  • Commercially Available tools
  • Xilnx XTMR
  • Mentor Graphics (soon to be available)
  • Will have a formal checker to validate RTL
    matches tool
  • Redundancy and Mitigation created via RTL
  • RTL implementation Best for design reviews
  • Difficult to implement and may be impractical

28
User Custom AFIFO GTMR Design and Address
Correction
  • Address Pointers are protected
  • VotersCorrection
  • 1 Voter per Address Bit
  • Voters Feed Memory Block
  • Memory Triplication is Optional

29
GTMR Example Non Redundant Flag Generation Logic
Path We Start Here!
Gray
USER non-Redundant Design
Binary
30
Full Flag Example Non-voted Asynchronous Domain
Crossings
No Feedback No Voter
31
Asynchronous Triggering Lagging or Leading
Circuitry
Downstream Voter Insertion
Too Late!
V O T I N G M A T R I X
FLAG Trigger
State Machine
Will Never Match
FLAG Trigger
LAG Behind other two
State Machine
FLAG Trigger
SEU
State Machine
  • Downstream Logic can get triggered in different
    clock cycles (skewed redundant chains)
  • SEU event will not get corrected if chains have
    skew

32
Asynchronous Flags and Voter Placement
Voter Placed downstream on feedback-only DFFs
flag is not mitigated
Voted Flag can Trigger logic at the same clock
cycle
State Machine starts early
Domains are out of Sync
All State Machines start
33
GTMR User Implementation Solution 1 Insert
Voters after Metastability Filter
Voters Will Correct out of sync Redundant String
34
GTMR User Implementation Solution 2 Insert
Voters after Flag DFFs
  • Voters can be inserted by hand or.
  • Voters can be instantiated by making the flag DFF
    have feedback (use of enable)

Voters Will Correct Out of sync Redundant String
35
Risk XTMR Clock Skew and Race Conditions
  • Example Red Clock domain (as seen by DFFs) has
    skew relative to other domains

Race condition on feedback path if its delay is
faster than clock skew
Tfb
Total Skew Tsk Sio Sroute Sint_max
Sio Skew Measured at Input Boundary Sroute
Skew of route from Input to Clock tree
buffer Sint_max Static Timing Analysis max Skew
36
Advantages of Custom Designed AFIFOs with GTMR
  • Addresses can be corrected with voters
  • Addresses will be synchronized for all three
    clock domains
  • FIFOs will always write to the same location
  • Availability of operation is prolonged due to
    internal correction capability
  • Design reviews are enhanced because actual AFIFO
    implementation can be evaluated in the RTL (VHDL
    or Verilog).

37
Conclusion
  • AFIFOs pass data from one clock domain to another
  • AFIFOs SEU susceptibility
  • Configuration can have several upsets per day
    (Xilinx)
  • Large amounts of data can be lost
  • AFIFO IP Cores have no internal correction and
    can be significantly susceptible to SEUs
  • Implementing a customized GTMR AFIFO design
    reduces SEU susceptibility and availability of
    operation
  • Asynchronous concerns must be addressed
  • Most tools will not handle these situations
  • Design reviewers must validate domain crossings

38
Thank You
  • NASA Electronics Parts and Packaging (NEPP)
  • Kenneth LaBel and NASA REAG
  • Chris Perez
  • Mark Friendlich
Write a Comment
User Comments (0)
About PowerShow.com