5/18/2015CPE 731, 4-Principles 1 Define and quantify dependability (1/3) How decide when a system is operating properly? Infrastructure providers now offer.

Slides:



Advertisements
Similar presentations
1 MM3 - Reliability and Fault tolerance in Networks Service Level Agreements Jens Myrup Pedersen.
Advertisements

RELIABILITY Dr. Ron Lembke SCM 352. Reliability Ability to perform its intended function under a prescribed set of conditions Probability product will.
6. Reliability Modeling Reliable System Design 2010 by: Amir M. Rahmani.
Forced Outages in Genesys Examining the overall and individual outages for thermal generating resources.
SMJ 4812 Project Mgmt and Maintenance Eng.
James Ngeru Industrial and System Engineering
Reliable System Design 2011 by: Amir M. Rahmani
Reliability 1. Probability a product will perform as promoted for a given time period under given conditions Functional Failure: does not operate as designed.
CPSC 614 Computer Architecture Lec 2 - Introduction EJ Kim Dept. of Computer Science Texas A&M University Adapted from CS 252 Spring 2006 UC Berkeley Copyright.
SWE Introduction to Software Engineering
1 The Difference Engine, Charles Babbage Images from Wikipedia (Joe D and Andrew Dunn) Slides courtesy Anselmo Lastra.
Network Management 1 School of Business Eastern Illinois University © Abdou Illia, Fall 2006 (Week 16, Tuesday 12/5/2006)
Dependability Evaluation. Techniques for Dependability Evaluation The dependability evaluation of a system can be carried out either:  experimentally.
J. Gray, Dependability in the Internet Era (acknowledgement: slides from J.Gray, E.Brewer)
1 Fundamentals of Reliability Engineering and Applications Dr. E. A. Elsayed Department of Industrial and Systems Engineering Rutgers University
CS 325: Software Engineering March 26, 2015 Software Quality Assurance Software Metrics Defect Injection Software Quality Lifecycle Measuring Progress.
Industrial Engineering
ATF2 Power Supply Availability Comparison February 5, ATF2 Power Supply Availability C0mparison Paul Bellomo and Briant lam.
3. Software product quality metrics The quality of a product: -the “totality of characteristics that bear on its ability to satisfy stated or implied needs”.
Lecture Objectives: 1)Draw a picture showing the connection between the processor and memory mapped IO devices. 2)Define the terms reliability, dependability,
System Testing There are several steps in testing the system: –Function testing –Performance testing –Acceptance testing –Installation testing.
Slide 6.1 CHAPTER 6 TESTING. Slide 6.2 Overview l Quality issues l Nonexecution-based testing l Execution-based testing l What should be tested? l Testing.
A 2 TH OS: availability analysis and optimisation in SLAs Group 7: Deyang Rao Chenhao Cui Yichi Zhang Luyang Li.
Reliability Engineering
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 9 Slide 1 Critical Systems Specification 2.
Transition of Component States N F Component fails Component is repaired Failed state continues Normal state continues.
1 Logistics Systems Engineering Reliability Fundamentals NTU SY-521-N SMU SYS 7340 Dr. Jerrell T. Stracener, SAE Fellow.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Background on Reliability and Availability Slides prepared by Wayne D. Grover and Matthieu Clouqueur TRLabs & University of Alberta © Wayne D. Grover 2002,
Performance Evaluation of Computer Systems Introduction
Ch. 1.  High-profile failures ◦ Therac 25 ◦ Denver Intl Airport ◦ Also, Patriot Missle.
Part.1.1 In The Name of GOD Welcome to Babol (Nooshirvani) University of Technology Electrical & Computer Engineering Department.
I/O Computer Organization II 1 Introduction I/O devices can be characterized by – Behavior: input, output, storage – Partner: human or machine – Data rate:
Reliability Study of Power Generation System Utilizing Wind Power Derek H. Geiger DSES-6070 HV5 Term Project 31-March-2008.
Statistical Software Quality Assurance Implies –Information about defects is collected and categorized –An attempt is made to trace each defect to underlying.
Eng. Mohammed Timraz Electronics & Communication Engineer University of Palestine Faculty of Engineering and Urban planning Software Engineering Department.
Lecture 4: State-Based Methods CS 7040 Trustworthy System Design, Implementation, and Analysis Spring 2015, Dr. Rozier Adapted from slides by WHS at UIUC.
Analytical Approaches to Evaluate Residual Cable Lifetime Module 4 Dr. John H. Bickel Evergreen Safety & Reliability Technologies, LLC.
Fault-Tolerant Computing Systems #4 Reliability and Availability
Reliability and availability considerations for CLIC modulators Daniel Siemaszko OUTLINE : Give a specification on the availability of the powering.
Reliability Failure rates Reliability
Failures and Reliability Adam Adgar School of Computing and Technology.
Chapter 4 Supplement Reliability 1Saba Bahouth – UCO.
Survivable Traffic Grooming with Differentiated End-to-End Availability Guarantees in WDM Mesh Networks Proceedings of the 13th IEEE Workshop on Local.
Dr. W. H. Wellssow, © i d s GmbH EttlingenMai 2003 CIRED Round Table 5 Fault Statistics in Distribution Networks Dr. W.H. Wellssow IDS GmbH Ettlingen,
Unit-3 Reliability concepts Presented by N.Vigneshwari.
Increased Reliability Through Failure Predictive Scheduling with Temperature Sensor Feedback Wesley Emeneker CSE 534 Dr. Sandeep Gupta.
Mean Time To Repair
Copyright 2007 Koren & Krishna, Morgan-Kaufman Part.1.1 FAULT TOLERANT SYSTEMS Fault tolerant Measures.
CS203 – Advanced Computer Architecture Dependability & Reliability.
 Software reliability is the probability that software will work properly in a specified environment and for a given amount of time. Using the following.
Ch1. Fundamentals of Computer Design 1. Formulas ECE562 Advanced Computer Architecture Prof. Honggang Wang ECE Department University of Massachusetts Dartmouth.
Prof. Enrico Zio Availability of Systems Prof. Enrico Zio Politecnico di Milano Dipartimento di Energia.
 How do you know how long your design is going to last?  Is there any way we can predict how long it will work?  Why do Reliability Engineers get paid.
LOG 211 Supportability Analysis “Reliability 101”
Software Defects Cmpe 550 Fall 2005
Software Metrics and Reliability
Relationships between MTTF & MTTR
Fault-Tolerant Computing Systems #5 Reliability and Availability2
An EXCEL Add-In for Comparing Two Exponential Distributions
Fundamentals of Computer Design - Trends and Performance
Morgan Kaufmann Publishers Large and Fast: Exploiting Memory Hierarchy
Software Reliability: 2 Alternate Definitions
Introduction I/O devices can be characterized by I/O bus connections
Software Test Termination
Reliability Failure rates Reliability
Fundamentals of Computer Design - Trends and Performance
T305: Digital Communications
Reliability.
RELIABILITY THEORY Presented By S.ITHAYA EZHIL MANNA Assistant Professor In Mathematics St.Joseph’s College Trichy-2.
Presentation transcript:

5/18/2015CPE 731, 4-Principles 1 Define and quantify dependability (1/3) How decide when a system is operating properly? Infrastructure providers now offer Service Level Agreements (SLA) to guarantee that their networking or power service would be dependable Systems alternate between 2 states of service with respect to an SLA: 1.Service accomplishment, where the service is delivered as specified in SLA 2.Service interruption, where the delivered service is different from the SLA Failure = transition from state 1 to state 2 Restoration = transition from state 2 to state 1

5/18/2015CPE 731, 4-Principles 2 Define and quantify dependability (2/3) Module reliability = measure of continuous service accomplishment (or time to failure). 2 metrics 1.Mean Time To Failure (MTTF) measures Reliability 2.Failures In Time (FIT) = 1/MTTF, the rate of failures Traditionally reported as failures per billion hours of operation Mean Time To Repair (MTTR) measures Service Interruption –Mean Time Between Failures (MTBF) = MTTF+MTTR Module availability measures service as alternate between the 2 states of accomplishment and interruption (number between 0 and 1, e.g. 0.9) Module availability = MTTF / ( MTTF + MTTR)

5/18/2015CPE 731, 4-Principles 3 Example calculating reliability (3/3) If modules have exponentially distributed lifetimes (age of module does not affect probability of failure), overall failure rate is the sum of failure rates of the modules Calculate FIT and MTTF for 10 disks (1M hour MTTF per disk), 1 disk controller (0.5M hour MTTF), and 1 power supply (0.2M hour MTTF):

5/18/2015CPE 731, 4-Principles 4 Example calculating reliability (3/3) If modules have exponentially distributed lifetimes (age of module does not affect probability of failure), overall failure rate is the sum of failure rates of the modules Calculate FIT and MTTF for 10 disks (1M hour MTTF per disk), 1 disk controller (0.5M hour MTTF), and 1 power supply (0.2M hour MTTF):