1 Using Excel to Implement Software Reliability Models Norman F. Schneidewind Naval Postgraduate School 2822 Racoon Trail, Pebble Beach, California, 93953,

Slides:

Advertisements

Similar presentations

Claude Beigel, PhD. Exposure Assessment Senior Scientist Research Triangle Park, USA Practical session metabolites Part II: goodness of fit and decision.

Advertisements

Gage R&R Estimating measurement components

Design of Experiments Lecture I

Exercise 7.5 (p. 343) Consider the hotel occupancy data in Table 6.4 of Chapter 6 (p. 297)

Forecasting Models With Linear Trend. Linear Trend Model If a modeled is hypothesized that has only linear trend and random effects, it will be of the.

Misleading Metrics and Unsound Analyses Presenter: Gil Hartman Authors: Barbara Kitchenham, David Ross Jeffery, and Colin Connaughton IEEE Software 24(2),

Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.

A Mathematica ® based regression analysis program Analisys … A Curve Fitting Application.

1 The Role of the Revised IEEE Standard Dictionary of Measures of the Software Aspects of Dependability in Software Acquisition Dr. Norman F. Schneidewind.

Copyright 2000, Stephan Kelley1 Estimating User Interface Effort Using A Formal Method By Stephan Kelley 16 November 2000.

Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:

Control Chart for Attributes Bahagian 1. Introduction Many quality characteristics cannot be conveniently represented numerically. In such cases, each.

The AutoSimOA Project Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School OR49 Sept 07 A 3 year, EPSRC funded project in collaboration with.

Control Charts for Variables

Data Mining CS 341, Spring 2007 Lecture 4: Data Mining Techniques (I)

Copyright 2006 John Wiley & Sons, Inc. Beni Asllani University of Tennessee at Chattanooga Forecasting Operations Chapter 12 Roberta Russell & Bernard.

1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.

BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.

Monté Carlo Simulation MGS 3100 – Chapter 9. Simulation Defined A computer-based model used to run experiments on a real system.  Typically done on a.

Decision analysis and Risk Management course in Kuopio

1 Time Scales Virtual Clocks and Algorithms Ricardo José de Carvalho National Observatory Time Service Division February 06, 2008.

MANAGEMENT SCIENCE The Art of Modeling with Spreadsheets STEPHEN G. POWELL KENNETH R. BAKER Compatible with Analytic Solver Platform FOURTH EDITION CHAPTER.

Example 16.3 Estimating Total Cost for Several Products.

1 CHAPTER M4 Cost Behavior © 2007 Pearson Custom Publishing.

DECISION MODELING Chapter 2 Spreadsheet Modeling Part 1 WITH MICROSOFT EXCEL Copyright 2001 Prentice Hall Publishers and Ardith E. Baker.

Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.

Cost Analysis and Classification Systems

Chapter 6 Random Error The Nature of Random Errors

Spreadsheet Modeling & Decision Analysis A Practical Introduction to Management Science 5 th edition Cliff T. Ragsdale.

1 Prediction of Software Reliability Using Neural Network and Fuzzy Logic Professor David Rine Seminar Notes.

Traffic modeling and Prediction ----Linear Models

1 NASA OSMA SAS02 Software Reliability Modeling: Traditional and Non-Parametric Dolores R. Wallace Victor Laing SRS Information Services Software Assurance.

SENG521 (Fall SENG 521 Software Reliability & Testing Software Reliability Tools (Part 8a) Department of Electrical & Computer.

3 CHAPTER Cost Behavior 3-1.

1 Demand Planning: Part 2 Collaboration requires shared information.

Copyright © 1994 Carnegie Mellon University Disciplined Software Engineering - Lecture 1 1 Disciplined Software Engineering Lecture #5 Software Engineering.

Term 2, 2011 Week 1. CONTENTS Types and purposes of graphic representations Spreadsheet software – Producing graphs from numerical data Mathematical functions.

1SAS 03/ GSFC/SATC- NSWC-DD System and Software Reliability Dolores R. Wallace SRS Technologies Software Assurance Technology Center

On Model Validation Techniques Alex Karagrigoriou University of Cyprus "Quality - Theory and Practice”, ORT Braude College of Engineering, Karmiel, May.

597 APPLICATIONS OF PARAMETERIZATION OF VARIABLES FOR MONTE-CARLO RISK ANALYSIS Teaching Note (MS-Excel)

1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.

1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.

Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.

Numeric Processing Chapter 6, Exploring the Digital Domain.

Scientific Inquiry & Skills

1 IEEE P1633\AIAA R013A Recommended Practice on Software Reliability Status Report Norm Schneidewind, Naval Postgraduate School (chair)

DECISION MODELING Chapter 2 Spreadsheet Modeling Part 1 WITH MICROSOFT EXCEL Copyright 2001 Prentice Hall Publishers and Ardith E. Baker.

The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.

1 Reliability-Sept2001 Software Reliability Through Hardware Reliability Dolores R. Wallace SRS Information Services Software Assurance Technology Center.

Time Series Analysis and Forecasting

1. 2 Traditional Income Statement LO1: Prepare a contribution margin income statement.

Maintenance Workload Forecasting

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.3.

1 Report on results of Discriminant Analysis experiment. 27 June 2002 Norman F. Schneidewind, PhD Naval Postgraduate School 2822 Racoon Trail Pebble Beach,

Chapter 9: Short-Term Forecasting PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with.

Sampling Design and Analysis MTH 494 Lecture-22 Ossam Chohan Assistant Professor CIIT Abbottabad.

Time Series Analysis and Forecasting. Introduction to Time Series Analysis A time-series is a set of observations on a quantitative variable collected.

Engineers often: Regress data to a model  Used for assessing theory  Used for predicting  Empirical or theoretical model Use the regression of others.

Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.

Microsoft Office 2013 ®® Calculating Data with Formulas and Functions.

FORECASTING METHODS OF NON- STATIONARY STOCHASTIC PROCESSES THAT USE EXTERNAL CRITERIA Igor V. Kononenko, Anton N. Repin National Technical University.

Microsoft ® Excel ® 2013 Enhanced Excel Tutorial 3 Calculating Data with Formulas and Functions.

Trends ASAP by Actuarial Services and Programs Evaluating Changes in Claim Frequency, Claim Costs, and Loss Costs.

Short-Term Forecasting

Random Testing: Theoretical Results and Practical Implications IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 2012 Andrea Arcuri, Member, IEEE, Muhammad.

Software Reliability Definition: The probability of failure-free operation of the software for a specified period of time in a specified environment.

Prepared by Lee Revere and John Large

Presentation transcript:

1 Using Excel to Implement Software Reliability Models Norman F. Schneidewind Naval Postgraduate School 2822 Racoon Trail, Pebble Beach, California, 93953, USA Voice: (831) Fax: (831)

2 Outline Introduction Characteristics of Excel Implementation Combined Software Reliability Tools – Excel Approach Structure of Combined Approach Notation for Prediction Worksheet Equations for Prediction and Comparison Worksheets Example Prediction Worksheet Analysis of Prediction Worksheet Notation for Actual – Prediction Comparisons Worksheet Example Actual – Prediction Comparisons Worksheet Analysis of Comparison Worksheet Cumulative Failure Prediction Plots Validation of Failure Count Predictions Time to Failure Plot Validation of Time to Failure Predictions Conclusions Excel Demo

3 Introduction CASRE and SMERFS, hereafter referred to as SRT (software reliability tools), were developed prior to the availability of mature spreadsheet programs. –Programs like Excel were not an option, but things have changed. In Excel, the user can create equations, do data and statistical analysis, make plots, an do programming, using Visual Basic. In SRT, the programming of the models has been done for the user, but the functionality is fixed until the next revision.

4 Characteristics of Excel Implementation #1 Advantages: –Almost all practitioners have Excel. A minority of practitioners have SRT. –Easier for practitioners to use than SRT. –Typically, failure data is provided by practitioners in Excel. –Improve technology transfer: Predictions can be made by the researcher in the spreadsheet and returned to the practitioner in the same spreadsheet. –Formatted Excel data can be imported into Word and PowerPoint for creating reports and presentations.

5 Characteristics of Excel Implementation # 2 Advantages: –User has more control over formatting of data, prediction results, and plots. –A large set of built-in mathematical and statistical functions are available for reliability analysis. SRT limited to functions like Chi-square. –User can construct his own reliability equations. SRT equations are fixed, based on the models implemented. –More flexibility in changing term in equations. Change cell values; copy and paste equations.

6 Characteristics of Excel Implementation # 3 Disadvantages: –Column and cell orientation of spreadsheets is cumbersome. It is not a natural mathematical format. Need to repeat parameter entries for iterations of equations. Variable names are not case sensitive. Variable names cannot be the same as column or cell names. –Thus, some variables must renamed to avoid naming conflicts.

7 Characteristics of Excel Implementation # 4 Disadvantages: –Mathematical library is not as extensive as Fortran and C ++ libraries used in SRT. –Does not have sophisticated model evaluation criteria of SRT. However, error analysis between actuals and predictions (i.e., validation) can be done in Excel.

8 Combined Software Reliability Tools – Excel Approach Best approach may be to combine SRT with Excel. SRT provides model parameter estimation. –Beyond the capabilities of Excel unless programmed in Visual Basic. –Copy and paste parameters from SRT into spreadsheet. Excel extends capabilities of SRT by allowing user provided equations, statistical analysis, and plots.

9 Structure of Combined Approach Worksheets: –Definitions: Notation Equations –Predictions Analysis –Actual – Prediction Comparisons Analysis Plots Validation Examples of this approach follow.

10 Notation for Prediction Worksheet

11 Equations for Prediction and Comparison Worksheets Time to Next Failure(s) Predicted at Time t Remaining Failures Predicted at Time t: r(t) = (  /  ) – X s,t Cumulative Number of Failures Detected at Time T: D(T) = (α/β)[1 – exp (-β ((T –s + 1)))] + X s-1 Cumulative Number of Failures Detected Over Life of Software T L : D(T L ) =  /  + X s-1 References: [1, 2, 3].

12 Example Prediction Worksheet

13 Analysis of Prediction Worksheet # 1 s, , and  obtained from SMERFS. One interval = one week of calendar time. Project 1: –Optimal s = 1 for both failure count and time to failure predictions. –t=26: interval when time to next failure prediction made This is also the last interval of observed failure data. –X 26 = 130: observed failure count in the range [1,26]. –F 1 = 1: given number of failures to occur after interval 26. –T F (26) = 3.96 intervals: time to next failure predicted at time 26 intervals.

14 Analysis of Prediction Worksheet #2 Project 1: –r(26) = 2.14: remaining failures predicted at time 26 intervals. –T = 27 intervals: test time. –D(27) = : cumulative number of failures detected at time 27 intervals. –D(  ) = : cumulative number of failures detected over life of software (conservatively, infinity). r(26) = D(  ) - X 26 = – 130 = 2.14 remaining failures, as in the above.

15 Analysis of Prediction Worksheet #3 Project 2: –Total range of 35 weeks divided into Parameter Estimation Range = 1, 23 weeks and Prediction Range = 24, 35 weeks for the purpose of model validation. Model fit using historical data does not demonstrate validity! –Estimate model parameters in range 1, 23 weeks. Accuracy of future predictions demonstrates validity. –Predict in range 24, 35 weeks and compare with actuals. –Optimal s = 12 for both failure count and time to failure predictions.

16 Analysis of Prediction Worksheet #4 Project 2: –t=23: interval when time to next failure prediction made –X 11 = 39: observed failure count in the range [1,11]. –X 12,23 = 32: observed failure count in the range [12,23]. –X 23 = 71: observed failure count in the range [1,23]. –F 1 = 5, …, 20: given number of failures to occur after interval 23. –T F (23) = 2.63, …, intervals: time to next failures predicted at time 23 intervals.

17 Analysis of Prediction Worksheet #5 Project 2: –r(23) = 44.96: remaining failures predicted at time 23 intervals. –T = 23, …, 35 intervals: test time. –D(23, …, 35) = 71.00, …, cumulative number of failures detected at time 23, …, 35 intervals. –D(  ) = : cumulative number of failures detected over life of software (conservatively, infinity). r(23) = D(  ) - X 23 = –71 = remaining failures, as in the above.

18 Notation for Actual – Prediction Comparisons Worksheet Parameter Estimation Range = 1, 23 weeks; Prediction Range = 24, 35 weeks; s = 12 weeks. D(T) Actual = Actual Cumulative Count, from Interval 1, in Prediction Range D(T) Pred = Predicted Cumulative Count, from Interval1, in Prediction Range Interval Actual = Difference in D(T) Actual Interval Pred = Difference in D(T) Pred Int Act Cum = Interval Actual Cumulative Count, from Interval 24, in Prediction Range Int Pred Cum = Interval Predicted Cumulative Count, from Interval 24, in Prediction Range TF(t) Actual = Actual Time to Next Given Number of Failures in the Int Act Cum column TF(t) Pred = Predicted Time to Next Given Number of Failures in the Int Act Cum column

19 Example Actual – Prediction Comparisons Worksheet

20 Analysis of Comparison Worksheet # 1 Project 2 –D(T) Actual is compared with D(T) Prediction. Failure counts are accumulated from Interval1in the parameter estimation range, but are compared in the prediction range. –Interval Actual is compared with Interval Prediction. Interval failure counts are compared in the prediction range. –Int Act Cum is compared with Int Pred Cum. Interval failure counts are accumulated from Interval 24 in the prediction range and compared in the prediction range.

21 Analysis of Comparison Worksheet # 2 Project 2 Make plots in prediction range: –Actual and Predicted Cumulative Failures in Range 1, 35 Weeks. –Actual and Predicted Cumulative Failures in Range 24,35 Weeks. –Validation of Failure Count Predictions. Residuals: (Predicted – Actual) versus week. –Residuals do not show bias (i.e., trend in either positive or negative direction). –Average Residual = failures indicates optimistic prediction on average.

22 Cumulative Failures in Range 1, 35 Weeks: Parameter Estimation Range plus Prediction Range

23 Cumulative Failures in Range 24,35 Weeks: Prediction Range

24 Validation of Failure Count Predictions Average Residual = failures

25 Analysis of Comparison Worksheet # 3 Project 2 Make plot in prediction range: –Actual and Predicted Time to Next Failures versus given number of failures. –Validation of Time to Failure Predictions. Residuals: (Predicted – Actual) versus given number of failures. –Residuals show bias starting at 15 failures (week 32) as it becomes difficult to predict further out into the future. –Average Residual = 0.87 weeks indicates optimistic prediction on average.

26 Time to Given Number of Failures

27 Validation of Time to Failure Predictions Average Residual = 0.87 weeks

28 Conclusions Spreadsheet technology can effectively support software reliability modeling and prediction. Advantages relative to SRT are: –Easier transfer of technology to practitioners. –More user control of program’s operation. –Many built-in mathematical and statistical functions. Disadvantages relative to SRT are: –Cell format is not conducive to mathematical modeling. –No built-in model evaluation criteria. SRT and Excel can be combined to advantage: –SRT for reliability model parameter estimation. –Excel for reliability prediction.

29 References [1] Norman F. Schneidewind, "Reliability Modeling for Safety Critical Software", IEEE Transactions on Reliability, Vol. 46, No.1, March 1997, pp [2] Norman F. Schneidewind, "Software Reliability Model with Optimal Selection of Failure Data", IEEE Transactions on Software Engineering, Vol. 19, No. 11, November 1993, pp [3] Norman F. Schneidewind and T. W. Keller, "Application of Reliability Models to the Space Shuttle", IEEE Software, Vol. 9, No. 4, July 1992 pp