Download presentation
Presentation is loading. Please wait.
Published byBeatrice Gardner Modified over 9 years ago
1
Electronic Parts Engineering Ramin Roosta Jet Propulsion Laboratory Office 514 Xilinx SRAM Based FPGA Testing, Testability, and Reliability Issues New Electronic Technologies and Insertion into Flight Programs Workshop January 30- February 1, 2007 at NASA/GSFC in Greenbelt, MD
2
Electronic Parts Engineering 2-1-2007 Ramin Roosta 2 ● FPGA Testing FPGA Test Goals FPGA Testing Phases Why FPGA Testing and Testability Analysis is Difficult? FPGA testing approach requirements ● Virtex Product Test Flow ● Application-independent Testing Problems of Application-independent Testing ● Application Dependent Testing Interconnect Testing Configurable Logic Blocks (CLB) Testing The fault models in FPGA testing Related works ● Deep Submicron process and its effects on Testing ● FPGA (Virtex-4 Power Reduction) ● FPGA Reliability Analysis/Concerns ● Conclusion Table of Contents
3
Electronic Parts Engineering 2-1-2007 Ramin Roosta 3 FPGA Test Goals ● Fully verify all parameters and the functionality of all features and resources to ensure full compliance with the data sheet ● Product features are fully characterized across temperature and voltages, with key parameters measured to guarantee performance
4
Electronic Parts Engineering 2-1-2007 Ramin Roosta 4 FPGA Testing Phases ● Design verification phase: behavioral simulation, static timing simulation (analysis), post layout functional and timing simulation, back-annotated testing and prototyping testing. (Relies heavily on design automation tools, such as simulation, logic/physical synthesis, and place & route tools). How Accurate are these tools? ● Production phase test: includes screening tests such as; burn in test, functional test, fault coverage analysis, internal speed test, at speed test, external speed test including verifications of set up/hold and delay characteristics of the IC, IO level test, and finally analog parametric tests including gain, noise, delay, time constants, precision and margins.
5
Electronic Parts Engineering 2-1-2007 Ramin Roosta 5 Why FPGA Testing and Testability Analysis is Difficult? ● To Xilinx the FPGA looks like an ASIC. To the consumers it is an FPGA. This distinction should be kept in mind when testing the device ● Today's FPGAs are practically “System on a Chip”, thus testing the chip thoroughly is a daunting task, especially without the benefit of DFT ● Re-programmability of Xilinx FPGAs should be used to make several (different) images to test specific resource(s) of the FPGA
6
Electronic Parts Engineering 2-1-2007 Ramin Roosta 6 FPGA Testing Approach Requirements ● Test methodology must be generic, uniform and application independent ● Test methodology must be scalable and independent of array size ● Test methodology must be reusable and lend itself to automation ● Test methodology must be must have measurable test quality metrics Source[1]
7
Electronic Parts Engineering 2-1-2007 Ramin Roosta 7 Xilinx Test Flow
8
Electronic Parts Engineering 2-1-2007 Ramin Roosta 8 ● I/O testing Opens and Shorts Icc and Leakage I/O Parametric ● Functional tests CLB Test BRAM memory test Configuration memory test ● Router Driven Test Methods ● Layout Driven Metal Test Methods ● Speed tests FPGA Functional Test Descriptions
9
Electronic Parts Engineering 2-1-2007 Ramin Roosta 9 FPGA Architecture (Virtex-II Fabric)
10
Electronic Parts Engineering 2-1-2007 Ramin Roosta 10 Every instance of LUT, SELRAM, Flip-Flops, TBUF, BRAM, DCM, Global Clocks, Carry, etc are tested FPGA Architecture (Virtex-II Fabric)
11
Electronic Parts Engineering 2-1-2007 Ramin Roosta 11 Virtex-4 Architecture
12
Electronic Parts Engineering 2-1-2007 Ramin Roosta 12 Testing the Slice Using Serial Shift Register ● Easy to understand and document ● Quick diagnostics of a failure ● Consistency across the array ● Few I/Os required for test environment
13
Electronic Parts Engineering 2-1-2007 Ramin Roosta 13 BlockRAM Memory
14
Electronic Parts Engineering 2-1-2007 Ramin Roosta 14 Testing Configuration Memory ● Readback Process of reading back the contents of configuration memory ● Four Readback test patterns for Configuration Memory All Zeros for Stuck-At-1 All ones for Stuck-At-0 Checkerboard for Coupling (AND/OR) Inverted Checkerboard for Coupling (AND/OR)
15
Electronic Parts Engineering 2-1-2007 Ramin Roosta 15 ● Address Fault (AF) Caused by defects in the address lines and address decoder ● Stuck-at Fault (SAF) The logic value of a stuck-at memory cell is always 0 or 1 ● Transition Fault (TF) A faulty cell or line with a rising (falling) transition fault fails to undergo a 0-1 (1-0) transition when written ● Stuck Open Fault (SOF) The word line retains the previous value when certain cells are accessed (typically, an open in word line access transistors) ● Coupling Fault (CF) Shorts and crosstalk between memory cells or lines Idempotent (forces a cell), Inversion (flips a cell), or Bridging (AND/OR) ● Passive Neighborhood Pattern Sensitive Fault (PNPSF) The contents of a memory cell cannot be changed due to a certain neighborhood pattern Memory Fault Models
16
Electronic Parts Engineering 2-1-2007 Ramin Roosta 16 Router Driven Test Method -Patterns routed using same software as customer (Xilinx PAR) -Most Favorable Interconnect routed first (not all interconnects routed equal) -Utilization of interconnect is <3% for a single design, customer or test -99% interconnect coverage when compared to customer design utilization Test Pattern generation flow - Input routed design to PAR - Route the design using the PAR - Routing information enter DB -PAR references DB to route next design
17
Electronic Parts Engineering 2-1-2007 Ramin Roosta 17 Configurable Logic Block Tile
18
Electronic Parts Engineering 2-1-2007 Ramin Roosta 18 Routing Phases
19
Electronic Parts Engineering 2-1-2007 Ramin Roosta 19 FPGA Routing Resources ● The nature and availability of “Routing Resources” ultimately dictates the interconnect scenarios within the FPGA. ● Interconnects is a major impediment to the performance/Power consumption -Wires consume power, threatening chip performance. Main routing components are : Wire segment Switching Matrix Low-skew (clock) Low-skew distribution IEEE Spectrum - June 2006
20
Electronic Parts Engineering 2-1-2007 Ramin Roosta 20 FPGA Test Coverage Claims (Xilinx) ● Features Coverage 100% of FPGA features are tested Every instance of LUT RAM, Flip-Flops, Carry, Tbuf, BlockRAM, DCM, etc. are tested ● Interconnect Coverage Overall Interconnect Coverage is > 99.7% For Customer Designs, Coverage is > 99.9% Interconnect is SAF and TF coverage by pattern construction, structured (BIST) method ● Today’s (Xilinx) test program contains 1800+ test configurations ● Zero customer returns related to missing interconnect coverage CF coverage is in development
21
Electronic Parts Engineering 2-1-2007 Ramin Roosta 21 Layout Driven Metal Test (LDMT) ● Test Metal Lines based on the physical Layout ● Attach test logic to each line Proven to be very effective to detect metal short
22
Electronic Parts Engineering 2-1-2007 Ramin Roosta 22 The Fault Models in FPGA Testing ● Bridging Fault A short between a group of signals The logic value of the shorted 1-dominant (OR bridge) 0-dominant (AND bridge) Indeterminate ● Stuck-at Fault A fixed (0 or 1) value to a signal line in the circuit single stuck-at faults: Most popular form (classical fault model) ● Delay Fault The fault by the combinational delay of a circuit to exceed clock period
23
Electronic Parts Engineering 2-1-2007 Ramin Roosta 23 The Functional Defects ● Interconnect defect Modeled by Bridging faults and/or stuck-at faults ● CLB defect A faulty CLB can be detected through “the functional test” of the CLB. ● IOB defect The information exchange with other components in the system may not be possible or reliable CLB: configurable logic block, IOB: input/output block
24
Electronic Parts Engineering 2-1-2007 Ramin Roosta 24 Problems of Application-independent Testing ● Low efficiency in detecting timing-related faults It is impossible to test even a small fraction of “all possible” interconnection patterns that may occur in the user-defined configurations ● The decreased yield of FPGA vendors Some defected chips are used in some designs The defected resources are not used by the designs
25
Electronic Parts Engineering 2-1-2007 Ramin Roosta 25 Application (Specific) dependent Testing ● Only resources used by a specific configuration (Design) is tested ● Avoids the disadvantage of the application-independent FPGA testing ● Time (to test) saving ● The increased yield of FPGA vendors ● Xilinx uses this approach “A Dynamic Platform for Reliability and Environmental Test of Re-programmable Xilinx Virtex-II 3000 FPGA”; Ramin Roosta, Ph.D. et al, Electronic Parts Engineering, NASA/JPL ( Sponsored By NASA Electronic Parts and Packaging Program (NEPP))
26
Electronic Parts Engineering 2-1-2007 Ramin Roosta 26 Multi-Configuration Strategy (MCS) The MCS have three test configurations: ● Interconnect testing (2) -All the (built-in) LUTs in the used CLBs are reconfigured to implement logic “AND” or logic “OR” functions ( All-0/1 pattern test vectors at the PIs) -All the flip-flops in the application need to be preset to value “1” or “0” ● CLB Testing (1) Reprogramming the interconnect network and make each used CLB controlled by the primary inputs (PIs) MCS: Multi-configuration strategy, CLB: configurable logic block, LUT: Look-Up Table
27
Electronic Parts Engineering 2-1-2007 Ramin Roosta 27 General Model of the Interconnect Test Configuration
28
Electronic Parts Engineering 2-1-2007 Ramin Roosta 28 An Example of Interconnect Testing
29
Electronic Parts Engineering 2-1-2007 Ramin Roosta 29 Original Application Configuration
30
Electronic Parts Engineering 2-1-2007 Ramin Roosta 30 Modified Configuration
31
Electronic Parts Engineering 2-1-2007 Ramin Roosta 31 ● Law of physics: leakage current increases as channel and gate oxide thickness decrease Xilinx Triple-Oxide Technology (90nm) ● Two oxide thicknesses are commonly used - Thin oxide in the fast core logic - Thick oxide in the versatile I/O ● Virtex-4 adds a third medium thickness oxide to reduce leakage current without compromising performance Deep Submicron process and its effects on Testing
32
Electronic Parts Engineering 2-1-2007 Ramin Roosta 32 Nanometer-scale CMOS technologies Challenges ● Modeling, simulation and verification of system components ● Accurate prediction of timing and powerdissipation ● Design robustness and fault tolerance in the presence of highly unpredictable device behavior ● Some physical design issues such as floor planning and routing give rise to challenges in system timing and signal integrity
33
Electronic Parts Engineering 2-1-2007 Ramin Roosta 33 Nanometer-scale CMOS technologies Issues ● Design size and complexity ● Timing based on signal integrity and IR drop ● IR Drop ● Crosstalk and Inductance ● Electro-migration ● Digital/Analog Integration ● Power consumption ● System signal transmission ● Manufacturing rules ● Yield optimization
34
Electronic Parts Engineering 2-1-2007 Ramin Roosta 34 Deep Submicron process and its effects on Testing ● Increased variability (on chip) ● Decreased reliability ● Leakage Current, Power Consumption ● Loss of operating margin ● Junction Temperature and Thermal Issues (Thermal Run away) ● Signal Integrity Issues (caused by faster I/O) at Chip/Board Level ● Design Entry and Power prediction tools’ Accuracy
35
Electronic Parts Engineering 2-1-2007 Ramin Roosta 35 Static Power variation & Saving ● At 90nm process technology static power becomes the dominant power factor (I/O’s are drawing minimal power) - Some FPGAs offer a lower power mode feature that disables the I/O putting it into a sleep mode that further reduces static power ● Static Power from leakage increases exponentially with temperature -Proportional to voltage (0.3 VCCINT /1.2) -Increases exponentially due to source → Drain leakage -At 1.26V static power for V CCINT is ~20% higher than at 1.2V (Try to use V CCINT close to 1.2V) -Keep junction temperature as low as possible ● Static power scales linearly with part size -Use smallest part to reduce leakage (Lx60 has 40% less leakage power than LX100) ● Static power is increased with process variation [V T and gate length (2.5x)] -Look at worst case and typical at a given temperature
36
Electronic Parts Engineering 2-1-2007 Ramin Roosta 36 Dynamic Power Variation & Saving Dynamic power consumption (and performance) is very sensitive to switched capacitance, (mainly routing capacitance in Xilinx FPGAs ) -Dynamic Power = N*CV 2 f N = Number of nodes switching C = Capacitive load V = Voltage swing f = Switching rate – Dynamic power varies linearly with frequency ● Tighten VCCINT and run at center of range or down to 5% below center (Reduces leakage by better than 10% over Run at 1.2 V vs. 1.26V) ● Run non-critical functions with a low speed clock (rather than an arbitrary high speed clock present in the design). ●
37
Electronic Parts Engineering 2-1-2007 Ramin Roosta 37 Overall Power Minimization in Virtex-4 ● Power Minimization fall into a few areas - Static & Dynamic Power (Adjustment to operating environment) -Design Code Optimization -Interconnect transistors --Bump up performance target for XST router (maybe able to gain 5-10% power improvement --Minimize path length (capacitance and power is lowered) --Minimize interconnect hops (capacitance and power is lowered) --Interconnect capacitance --Use a Relationally Placed Macros (RPM) or other placement method to guide tighter placement and help reduce routing length, especially on repeated macros --Number of nodes switching into a capacitive load --Minimize logic levels, Try to pack logic (if possible) --Clocks Driving Loads (Use BUFGMUX), reduces switching at target flip-flops, a common practice in ASIC design
38
Electronic Parts Engineering 2-1-2007 Ramin Roosta 38 FPGA Reliability Analysis/Concerns ● Transient errors due to complexity and feature size reduction in FPGAs thru redundancy based techniques ● The rate of degradation due to the accelerated aging phenomena is dependent on: Supply voltage, temperature, switching activity, and leakage currents ● Impact of different aging phenomenon resulting in permanent failures of the FPGAs’ components/interconnect circuitry such as; TDDB (reduces as gate leakage increases), Impact on HCE (as function of switching activities) and EM (interconnect) ● Aging impact of TDDB, EM and HCE on Xilinx style (SRAM Based) FPGAs using a set of benchmarks show that a significant portion of the FPGA resources (LUTs) may fail in the first 3 to 5 years of operation (commercial) [8]
39
Electronic Parts Engineering 2-1-2007 Ramin Roosta 39 Conclusion (FPGA Testing) ● Multi-Configuration Strategy (MCS) provides a simple way to perform the interconnect and CLB testing in the application- dependent testing ● FPGAs are really SOCs requiring some DFT to be built in ● Use BIST for Memory (MBIST) Megacells (ROMs, RAMs, FIFO) ● Imbedded Scan to improve manufacturability ● I ddq measurement as a substitute or complementing Burn-In ● Signal Integrity related issues will dominate ● Power consumption, Junction Temperature, Thermal Runaway ● Some Design Code & Place/Route Optimization is required ● Leave plenty of Margin
40
Electronic Parts Engineering 2-1-2007 Ramin Roosta 40 References [1] M. B. Tahoori, E. J. McCluskey, M. Renovell, P. Faure, “A Multi-Configuration Strategy for an Application Dependent Testing of FPGAs,” Proc. VLSI Test Symp., 2004. [2] M. B. Tahoori, “Application-Dependent Testing of FPGA Interconnects,” Proc. Int’l Symp. On Defect and Fault Tolerance, 2003. [3] C. Jordan, W. P. Marnane, “Incoming inspection of FPGAs”, Proc. European Test Conf. pp. 371-377, 1993. [4] W. K. Huang, F. J. Meyer, X.-T. Chen, F. Lombardi, “Testing Configurable LUT-Based FPGAs,” IEEE Trans. on VLSI Systems, pp. 276-283, June 1998. [5] M. Abramovici, C. Stroud, “BIST-Based Detection and Diagnosis of Multiple Faults in FPGAs,” Proc. of Int’l Test Conf., 2000. [6] A. Krasniewski, “Application-Dependent Testing of FPGA Delay Faults,” Proc. 25th EUROMICRO Conf., vol. 1, pp. 260-267, 1999. [7] Das, N. A. Touba, “A Low Cost Approach for Detecting, Locating, and Avoiding Interconnect Faults in FPGA-Based Reconfigurable Systems,: Proc. of Int’l Conf. On VLSI Design, 1999. [8] S. Srinivasan, N. Vijaykrishnan,K. Sarpatvari, “ FLAW: FPGA Lifetime Awareness”,DAC 2006, July 24-28, 2006, San Francisco, [9] S. Mahapatra, V. R. Rao, B. Cheng, M. Khare, C. D. Parikh, J. C. S. Woo and J. M. Vasi. “Performance and hot-carrier reliability of 100 nm channel length jet vapor deposited Si3N4 MNSFETs” IEEE Transactions on Electron Devices, vol.48, (no.4), April 2001. pp 679-84. [10] S. M. Alam, C. L. Gan, D. E. Troxel, and C. V. Thompson “Circuit-Level Reliability Analysis of Cu Interconnects” In Proceedings of International Symposium on Quality Electronics Design (ISQED), 2004. [11] J. Srinivasan, S. V. Adve, P. Bose and J. A. Rivers, “The Impact of Technology Scaling on Lifetime Reliability ” In Proceedings of International Conference on Dependable Systems and Networks (DSN), 2004. [12] X. Xuan, A. Chatterjee, and A. D. Singh “Local Redesign for Reliability of CMOS Digital Circuits Under Device Degradation” In proceedings of International Reliability Physics Symposium (IRPS), 2004. [13] F. N. Najm “Transition density, a stochastic measure of activity in digital circuits” In Proceedings of Annual ACM IEEE Design Automation Conference, 1991. [14] J. H. Anderson, F. Najm, and T. Tuan. “Active leakage power optimization for FPGAs,” In Proceedings of ACM/SIGDA International Symposium on Field-programmable gate arrays, 2004. [15] “Critical Reliability Challenges for the International Technology Roadmap for Semiconductors” In International Sematech Technology transfer 03024377A-TR, 2003. [16] S. Srinivasan, A. Gayasen, N. VijayKrishnan and T. Tuan “Leakage control in FPGA routing fabric” In Proceedings of Asia-Pacific Design Automation Conference (ASPDAC), 2005.
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.