Reliability & Maintainability Engineering An Introduction Robert Brown Electrical & Computer Engineering Worcester Polytechnic Institute.

Slides:



Advertisements
Similar presentations
Chapter 12 Maintainability and Availability introduction Availability analysis of the system requires a knowledge: (1)How the components are functional.
Advertisements

Design Constraints For Engineering Projects
These slides are designed to accompany Software Engineering: A Practitioner’s Approach, 6/e (McGraw-Hill 2005). Slides copyright 2005 by Roger Pressman.1.
© 2009 EMC Corporation. All rights reserved. Introduction to Business Continuity Module 3.1.
Software Reliability Engineering
Software Quality Assurance (SQA). Recap SQA goal, attributes and metrics SQA plan Formal Technical Review (FTR) Statistical SQA – Six Sigma – Identifying.
SMJ 4812 Project Mgmt and Maintenance Eng.
James Ngeru Industrial and System Engineering
Introduction to Maintainability
Overview Lesson 10,11 - Software Quality Assurance
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
Testing Metrics Software Reliability
1 Review Definition: Reliability is the probability that a component or system will perform a required function for a given period of time when used under.
THE MANAGEMENT AND CONTROL OF QUALITY, 5e, © 2002 South-Western/Thomson Learning TM 1 Chapter 13 Reliability.
Quality Control and Improvement, Reliability, Liability P. King/Chapter 13 overview.
3. Software product quality metrics The quality of a product: -the “totality of characteristics that bear on its ability to satisfy stated or implied needs”.
BPT2423 – STATISTICAL PROCESS CONTROL.  Fundamental Aspects  Product Life Cycle Curve  Measures of Reliability  Failure Rate, Mean Life and Availability.
1 Product Reliability Chris Nabavi BSc SMIEEE © 2006 PCE Systems Ltd.
Slide 1 Availability Management. Slide 2 Goal – Primary Objective To optimise the capability of the IT infrastructure and supporting organisations to.
Software Project Management
4. Dynamic reliability models Objectives Be able to find reliability of series, parallel, stand-by and shared load parallel systems, when the reliabilities.
Reliability-with-Repair.PPT Reliability with Repair Last revised 12/06/2006.
System Testing There are several steps in testing the system: –Function testing –Performance testing –Acceptance testing –Installation testing.
 The software systems must do what they are supposed to do. “do the right things”  They must perform these specific tasks correctly or satisfactorily.
1 Fault Tolerance in the Nonstop Cyclone System By Scott Chan Robert Jardine Presented by Phuc Nguyen.
Reliability Engineering
Unit 8 Syllabus Quality Management : Quality concepts, Software quality assurance, Software Reviews, Formal technical reviews, Statistical Software quality.
Chapter 2: Non functional Attributes.  It infrastructure provides services to applications  Many of these services can be defined as functions such.
© 2002 Eaton Corporation. All rights reserved. Designing for System Reliability Dave Loucks, P.E. Eaton Corporation.
Software Project Management Lecture # 10. Outline Quality Management (chapter 26)  What is quality?  Meaning of Quality in Various Context  Some quality.
Chapter 9 Testing the System Shari L. Pfleeger Joann M. Atlee
Reliability Management Benbow and Broome (Ch 1, 2, and 3)
Lean Six Sigma: Process Improvement Tools and Techniques Donna C. Summers © 2011 Pearson Higher Education, Upper Saddle River, NJ All Rights Reserved.
Software Project Management Lecture # 11. Outline Quality Management (chapter 26 - Pressman)  What is quality?  Meaning of Quality in Various Context.
Ch. 1.  High-profile failures ◦ Therac 25 ◦ Denver Intl Airport ◦ Also, Patriot Missle.
Building Dependable Distributed Systems Chapter 1 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
4-1Product and Service Design CHAPTER 4 Product and Service Design McGraw-Hill/Irwin Operations Management, Eighth Edition, by William J. Stevenson Copyright.
Network Management. Network management means monitoring and controlling the network so that it is working properly and providing value to its users. A.
Failures and Reliability Adam Adgar School of Computing and Technology.
DESIGN FOR XXX Design Strategies. Design for X Design Strategies 1/4/ Part shape strategies:  adhere to specific process design guidelines  if.
Unit-3 Reliability concepts Presented by N.Vigneshwari.
Chapter - Software Engineering Fail safe design problems Component jams Operator detected failure Erroneous input Unsafe modes Programming errors Sabotage.
Introduction to Reliability Engineering
Maintainance and Reliability Pertemuan 26 Mata kuliah: J Manajemen Operasional Tahun: 2010.
Software Engineering Lecture 8: Quality Assurance.
Stracener_EMIS 7305/5305_Spr08_ Systems Availability Modeling & Analysis Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7305/5305.
CS203 – Advanced Computer Architecture Dependability & Reliability.
 Software reliability is the probability that software will work properly in a specified environment and for a given amount of time. Using the following.
Intelligent Maintenance Program
The Troubleshooting Process. Hardware Maintenance Make sure that the hardware is operating properly.  Check the condition of parts.  Repair or replace.
PRODUCT RELIABILITY ASPECT RELIABILITY ENGG COVERS:- RELIABILITY MAINTAINABILITY AVAILABILITY.
Module 13 Reliability 1. Key Dimensions of Quality Performance – primary operating characteristics Features – “bells and whistles” Reliability – probability.
LOG 211 Supportability Analysis “Reliability 101”
Software Metrics and Reliability
Hardware & Software Reliability
Fault Tolerance & Reliability CDA 5140 Spring 2006
Software Reliability Definition: The probability of failure-free operation of the software for a specified period of time in a specified environment.
Software Reliability PPT BY:Dr. R. Mall 7/5/2018.
Software Reliability: 2 Alternate Definitions
Reliability and Maintainability
Reliability.
Introduction to Fault Tolerance
Quality Measurable characteristic Cyclomatic complexity Cohesion
QPM delivers value over the life cycle of the product
THE MANAGEMENT AND CONTROL OF QUALITY, 5e, © 2002 South-Western/Thomson Learning TM 1 Chapter 13 Reliability.
RELIABILITY Reliability is -
AVAILABILITY PERFORMANCE
Design Constraints For Engineering Projects
Definitions Cumulative time to failure (T): Mean life:
Presentation transcript:

Reliability & Maintainability Engineering An Introduction Robert Brown Electrical & Computer Engineering Worcester Polytechnic Institute

Reliability / Maintainability *-ability Reliability – Probability that a device will operate within specifications for the specified duration Reliability – Probability that a device will operate within specifications for the specified duration Maintainability – Amount of resources required to remove faults Maintainability – Amount of resources required to remove faults –Diagnose-ability – The ability to locate faults –Repair-ability – The ability to remove faults Availability Probability that a device is operationally ready

Why is it important? Reliability & Maintainability Engineering Mission Success - The product correctly operates for the duration of the mission Mission Success - The product correctly operates for the duration of the mission Customer Satisfaction – Perceived product quality Customer Satisfaction – Perceived product quality –Brand loyalty, customer loyalty Competitive Products – Market requirements Competitive Products – Market requirements Safety Safety Support Legal & Contractual Requirements Support Legal & Contractual Requirements Reduce Cost of Ownership Reduce Cost of Ownership –Reduce Product Cost – Warranty –Reduce Service Cost – Service delivery cost –Insurance

Bathtub Curve Failures over Time InfancySteady StateWearout Failure Rate Time Workmanship Contamination Burn-In Quality Control Random Failures Overload Improper Usage Thermal Fatigue Oxidation Cracking-Shrinking Friction Wear Warranty Period

Product Life Cycle Activities Reliability & Maintainability DesignDevelopDeployManage Identify competitive market requirements Design for reliability Failure analysis Reliability predictions Reliability testing Maintainability testing Develop service strategy Ongoing reliability testing Engineering changes Maintenance releases End of product life End of service life Logistics strategy

Reliability Developing reliable products Quality materials Quality materials Stability – mechanical / electrical Stability – mechanical / electrical Design for anticipated load Design for anticipated load Design for operating environment Design for operating environment Cooling Cooling Redundancy Redundancy –Full functionality –Reduced functionality Self-healing Self-healing

Series and Parallel Predicting System Reliability System requires device “A” AND device “B” to operate R S = R 1 * R 2 * R n Series R S = 0.8 * 0.9 = 0.72 System requires device “A” OR device “B” to operate R S = 1 – ( 1 - R 1 ) * ( 1 – R N ) Parallel R S = 1 – ( ) * ( ) R S =

Complex Configuration Predicting System Reliability System includes series and parallel device configuration R S = R 1 * R 2 * R n R S = R A * R BC R S = 0.8 * 0.98 = R S = 1 – ( 1 - R 1 ) * ( 1 – R N ) R BC = 1 – ( ) * ( ) R BC = 0.98 A B C ABC

Reliability Metrics Reliability Metrics Reliability is specified and measured in several ways Failure Rate  Count of failures during a time period.  Usually 10 6 hours MTBF or MTTF Reliability %  Mean Time Between Failures  Mean Time To Fail  Stated in hours  MTBF =  MTBF = 1 /  Usually MTBF =  Usually MTBF = 10 6 /  R(p) =  e - t  t = Mission time

Reliability Reliability Calculating MTBF Component Duty CycleComponent Failure Rate  100 %CPU   100 %Power Supply  20 %Floppy  MTBF = hrs= Note: in 10 6 Hours  Total 1 =

Maintainability MTTR – Mean Time To Repair MTTR – Mean Time To Repair Measured in hours Measured in hours Major activities Major activities –Diagnoses –Repair Focus on labor and materials Focus on labor and materials Resources required to remove faults

Diagnose-ability Developing diagnose-able products User and service documentation, tools and training User and service documentation, tools and training Operator indicators and error messages. Preserved state indicators accessed by repair depot. Operator indicators and error messages. Preserved state indicators accessed by repair depot. Test points Test points Device error and state change logging Device error and state change logging Environmental measurements Environmental measurements Diagnostic and failure detection tools Diagnostic and failure detection tools –Local and remote diagnosis –“Call Home” –Designed for technical and non-technical users Product labeling and revision management Product labeling and revision management Quickly and Correctly Locate the Fault

Repair-ability Developing repair-able products Repair or replace? Repair or replace? Remote and On-site repair? Remote and On-site repair? –Firmware, software, micro-code Limit specialized tools Limit specialized tools Limit adjustments Limit adjustments Hot-Swap Hot-Swap Larger, functionally dense sub-assemblies Larger, functionally dense sub-assemblies Physical layout Physical layout –Easier to replace less reliable components –Minimal disruption to replace a component Quickly Remove the Fault

Economics Reliability & Maintainability Population (units)20,000 Reliability MTBF Hr10,00015,00010,00015,000 Repair cost$ $ Duty Cycle 12/24 Day = 50% 50 % Operating Time (12*365*20000) 87,600,000 Repair Calls ( /MTBF) 8,7605,8408,7605,840 Service Cost (Calls*Cost) $ 219,000$ 146,000$ 109,500$ 73,000 Savings$ 73,000$ 109,500$ 146,000 Annual

Resources Applied Reliability by Dave Trindade Applied Reliability by Dave Trindade Reliability Magazine magazine.com/index.php Reliability Magazine magazine.com/index.phphttp:// magazine.com/index.phphttp:// magazine.com/index.php IEEE Reliability Society IEEE Reliability Society