Using Memory to Cope with Simultaneous Transient Faults Authors: Universidade Federal do Rio Grande do Sul Programa de Pós-Graduação em Engenharia Elétrica.

Slides:



Advertisements
Similar presentations
Embedded Systems Laboratory and Electrical Engineering Department Informatics Institute and Engineering School Federal University of Rio Grande do Sul.
Advertisements

PhD Student: Carlos Arthur Lang Lisbôa Advisor: Luigi Carro VLSI-SoC PhD Forum Low overhead system level approaches to deal with multiple and long.
Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. YuGuy G.F. Lemieux September 15, 2005.
Baloch 1MAPLD 2005/1024-L Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan 1,2.
10/14/2005Caltech1 Reliable State Machines Dr. Gary R Burke California Institute of Technology Jet Propulsion Laboratory.
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
Microprocessor Reliability
Fault Injection in Mixed-Signal Environment Using Behavioral Fault Modeling in Verilog-A Seyed‌ Nematollah Ahmadian, Seyed Ghassem Miremadi Behavioral.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
DSD 2007 Concurrent Error Detection for FSMs Designed for Implementation with Embedded Memory Blocks of FPGAs Andrzej Krasniewski Institute of Telecommunications.
NATW 2008 Using Implications for Online Error Detection Nuno Alves, Jennifer Dworak, R. Iris Bahar Division of Engineering Brown University Providence,
Self-Checking Carry-Select Adder Design Based on Two-Rail Encoding
Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. Yu August 15, 2005.
Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. Yu August 15, 2005.
Slide 1/20 Fault Tolerant Approaches to Nanoelectronic Programmable Logic Arrays Authors: Wenjing Rao, Alex Orailoglu, Ramesh Karri Conference: DSN 2007.
Logic Simulation 4 Outline –Fault Simulation –Fault Models –Parallel Fault Simulation –Concurrent Fault Simulation Goal –Understand fault simulation problem.
Carlos Arthur Lang Lisbôa, Luigi Carro, Erika Cota ETS 2005 RobOps - Arithmetic Operators for Future Technologies Future technologies, bellow 90nm, will.
4/20/2006ELEC7250: Alexander 1 LOGIC SIMULATION AND FAULT DIAGNOSIS BY JINS DAVIS ALEXANDER ELEC 7250 PRESENTATION.
Address comments to Robust FPGA Resynthesis Based on Fault-Tolerant Boolean Matching Yu Hu 1, Zhe Feng 1, Lei He 1 and Rupak Majumdar 2.
Carlos Arthur Lang Lisbôa, Luigi Carro DATE EDAA PhD Forum Dealing with Multiple Simultaneous Faults in Future Technologies INFORMÁTICA Universidade.
Embedded Systems Laboratory Informatics Institute Federal University of Rio Grande do Sul Porto Alegre – RS – Brazil SRC TechCon 2005 Portland, Oregon,
Carlos Arthur Lang Lisbôa, Erik Schüler, Luigi Carro SRC TechCon 2005 Dealing with Multiple Simultaneous Faults in Future Technologies in Future Technologies.
Carlos Arthur Lang Lisbôa, Erik Schüler, Luigi Carro SRC TechCon 2005 Dealing with Multiple Simultaneous Faults in Future Technologies in Future Technologies.
FPGA Defect Tolerance: Impact of Granularity Anthony YuGuy Lemieux December 14, 2005.
ED 4 I: Error Detection by Diverse Data and Duplicated Instructions Greg Bronevetsky.
Spring 07, Apr 17, 19 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Soft Errors and Fault-Tolerant Design Vishwani.
1 Efficient Analytical Determination of the SEU- induced Pulse Shape Rajesh Garg Sunil P. Khatri Department of ECE Texas A&M University College Station,
Roza Ghamari Bogazici University.  Current trends in transistor size, voltage, and clock frequency, future microprocessors will become increasingly susceptible.
Reduced Cost Reliability via Statistical Model Detection Jon-Paul Anderson- PhD Student Dr. Brent Nelson- Faculty Dr. Mike Wirthlin- Faculty Brigham Young.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
Instituto de Informática and Dipartimento di Automatica e Informatica Universidade Federal do Rio Grande do Sul and Politecnico di Torino Porto Alegre,
IVEC: Off-Chip Memory Integrity Protection for Both Security and Reliability Ruirui Huang, G. Edward Suh Cornell University.
Reconfiguration Based Fault-Tolerant Systems Design - Survey of Approaches Jan Balach, Jan Balach, Ondřej Novák FIT, CTU in Prague MEMICS 2010.
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
THE TESTING APPROACH FOR FPGA LOGIC CELLS E. Bareiša, V. Jusas, K. Motiejūnas, R. Šeinauskas Kaunas University of Technology LITHUANIA EWDTW'04.
Title of Selected Paper: IMPRES: Integrated Monitoring for Processor Reliability and Security Authors: Roshan G. Ragel and Sri Parameswaran Presented by:
SET Fault Tolerant Combinational Circuits Based on Majority Logic
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs Ghazanfar (Hossein) Asadi and Mehdi B. Tahoori Why Soft Error Rate (SER) Estimation?
Hrushikesh Chavan Younggyun Cho Structural Fault Tolerance for SOC.
Copyright © 2010 Houman Homayoun Houman Homayoun National Science Foundation Computing Innovation Fellow Department of Computer Science University of California.
Evaluating Logic Resources Utilization in an FPGA-Based TMR CPU
Eduardo L. Rhod, Álisson Michels, Carlos A. L. Lisbôa, Luigi Carro ETS 2006 Fault Tolerance Against Multiple SEUs using Memory-Based Circuits to Improve.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
IPR: In-Place Reconfiguration for FPGA Fault Tolerance Zhe Feng 1, Yu Hu 1, Lei He 1 and Rupak Majumdar 2 1 Electrical Engineering Department 2 Computer.
Paper by F.L. Kastensmidt, G. Neuberger, L. Carro, R. Reis Talk by Nick Boyd 1.
Defect-tolerant FPGA Switch Block and Connection Block with Fine-grain Redundancy for Yield Enhancement Anthony J. YuGuy G.F. Lemieux August 25, 2005.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
Winter Semester 2010 ”Politehnica” University of Timisoara Course No. 5: Expanding Bio-Inspiration: Towards Reliable MuxTree  Memory Arrays – Part 2 –
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Fault-Tolerant Resynthesis for Dual-Output LUTs Roy Lee 1, Yu Hu 1, Rupak Majumdar 2, Lei He 1 and Minming Li 3 1 Electrical Engineering Dept., UCLA 2.
Robust FPGA Resynthesis Based on Fault-Tolerant Boolean Matching
CFTP ( Configurable Fault Tolerant Processor )
MAPLD 2005 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan Dr. V. Kamakoti.
nZDC: A compiler technique for near-Zero silent Data Corruption
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Computer Architecture & Operations I
MAPLD 2005 BOF-L Mitigation Methods for
M. Aguirre1, J. N. Tombs1, F. Muñoz1, V. Baena1, A. Torralba1, A
Arithmetic Operators Robust to Multiple Simultaneous Upsets
Approximate Fully Connected Neural Network Generation
Sequential circuits and Digital System Reliability
Soft Error Detection for Iterative Applications Using Offline Training
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
Avidan Efody, Mentor Graphics Corp.
Hardware Assisted Fault Tolerance Using Reconfigurable Logic
Fixed-point Analysis of Digital Filters
Seminar on Enterprise Software
Presentation transcript:

Using Memory to Cope with Simultaneous Transient Faults Authors: Universidade Federal do Rio Grande do Sul Programa de Pós-Graduação em Engenharia Elétrica Eduardo L. Carlos A. L. Luigi

2 The Problem Due to the technology scaling, future (an actual) technologies will be heavily influenced by electromagnetic noise causing SEU and SET inducted errors; The ocurence of multiple SEU and SET, which was not a problem in the past, must have to be considered; We must guarantee robustness at lowest cost; Some usual protection techniques like TMR and N-MR might not work properly;

3 Motivations Memory comes with intrinsic protection against manufacturing errors (spare columns and spare rows); There are protection techniques with low area and latency overhead like Reed Solomon that can be applied;

4 Our Proposal Use Reed-Solomon protected memory to replace combinational circuit; Reducing the area sensible to faults; Reducing the SER (soft error rate) of the circuit;

5 Outline Case Studies; Results; Conclusions; Future Work.

6 Replacing Combinational Circuit by Memory (ROM memory) Example: 4x4 bit multiplier -Fully combinational: Total area = 304 transistors Fully memory: Memory Input A Input B 4 4 result 8 Total area = 2,048 transistors considering 1 transistor per bit 8 inputs and 8 outputs 2 8 x 8 = 2,048 bits EXPENSIVE X

7 Replacing Combinational Circuit by Memory (ROM memory) Example: 4x4 bit multiplier -Fully combinational: Total area = 304 transistors Let’s Replace just some part of the circuit !!! 1 column Area cost = 512 transistors Latency = 7 cycles Memory 512 bits x 4 = 512 bits 7 inputs and 4 outputs

8 Case Studies 4x4 bit multiplier Two memory based solutions were proposed: Column multiplier; Line multiplier; These two solutions were compared with the TMR and N-MR techniques.

9 Case Studies 4 taps 8 bit FIR Filter Memory based solution compared with the combinational one

10 Case Studies 4x4 bit multiplier -Column Solution Protected by RS code Sensitive to Faults

11 Case Studies 4x4 bit multiplier - Line Solution Sensitive to Faults Protected by RS code

12 Memory With coef. Input 1 Input 2 Input 3 Input 4 Result Case Studies 8-bits FIR Filter with 4 taps Just using memory: Memory size 2 4*8 x 18 = 77 Gb Memory + comb sol.: Memory size 2 4 x 10 = 160 bits Latency = 8 cycles Sensitive to faults Protected by RS code

13 Fault Injection Process Fault injection Steps: Run the circuit fault free with the 1st input; Run the circuit with “single event level 0” at the 1st gate; Compare the fault free and the “single event level 0” results to detect if the fault have propagated; Run the circuit with “single event level 1” at the 1st gate; Compare the fault free and the “single event level 1” results to detect if the fault have propagated; Repeat the process for all gates; Repeat the process for all inputs; Repeat the process for double faults;

14 Results CircuitTotal Area # of gates that fail Latency (ns) Fault rate (%) Proportional fault rate (%) 5-MR TMR Combinational Column Line x4 Bit Multiplier Fault Rate Results for SINGLE Fault Injection 3 x 7 x 2 x more area The voter Is too big

15 Results CircuitTotal Area # of gates that fail Latency (ns) Proportional fault rate (%) 5-MR TMR Combinational Column Line x4 Bit Multiplier Fault Rate Results for DOUBLE Fault Injection 5 x 13 x 2 x more area 2 x 5 x The voter Is too big 4 x more area

16 Results CircuitTotal Area# of gates that fail Latency (ns) Proportional fault rate (%) Combinational Memory FIR Filter Fault Rate Results for SINGLE Fault Injection 3.5 x less area 18 x CircuitTotal Area# of gates that fail Latency (ns) Proportional fault rate (%) Combinational Memory FIR Filter Fault Rate Results for DOUBLE Fault Injection 3.5 x less area 22.5 x

17 Conclusions This work showed that replacing combinational circuit by memory based circuit can be used to improve circuit reliability against single and double faults, with some penalties in area and computational time; The presented technique, permits different memory based solutions with different costs and gains; Results showed that 5-MR technique may not work as expected.

18 Future Work Implement this technique using magnetic memory (no area overhead); Test the presented approach with different case studies; Develop a tool that chooses between different memory based solutions, which best fit for each application; Implement this technique to develop a memory based processor.

19 Thank You !!! Questions ??? s: Eduardo L. Carlos A. L. Luigi

20 Fault Injection Process Tools: 4x4 bit multiplier Caco-ps – Cycle Accurate Configurable Power Simulator - combinational; - column; - line; Synthesized solutions* (for more than 100 gates failing): - TMR; - 5-MR; FIR Filter - combinational; - memory based; *using Altera FPGA EP20K200EFC484-2X.