CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 An Overview of Correlation and Functional Dependencies.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Variance reduction techniques. 2 Introduction Simulation models should be coded such that they are efficient. Efficiency in terms of programming ensures.
Experimental Design, Response Surface Analysis, and Optimization
The General Linear Model Or, What the Hell’s Going on During Estimation?
SCEA June 2000 JRS, TASC, 5/7/2015, 1 BMDO Cost Risk Improvement in Operations and Support (O&S) Estimates J. R. Summerville,
Simple Linear Regression and Correlation
Instrumental Variables Estimation and Two Stage Least Square
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
Econometric Details -- the market model Assume that asset returns are jointly multivariate normal and independently and identically distributed through.
Simple Linear Regression
Curve-Fitting Regression
Statistics for Business and Economics
Agenda for January 25 th Administrative Items/Announcements Attendance Handouts: course enrollment, RPP instructions Course packs available for sale in.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
Linear and generalised linear models
1 4. Multiple Regression I ECON 251 Research Methods.
7-2 Estimating a Population Proportion
An Introduction to Logistic Regression
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
1 10. Joint Moments and Joint Characteristic Functions Following section 6, in this section we shall introduce various parameters to compactly represent.
Variance and covariance Sums of squares General linear models.
Regression Analysis British Biometrician Sir Francis Galton was the one who used the term Regression in the later part of 19 century.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Introduction to Statistical Inferences
Regression Method.
Portfolio Management Lecture: 26 Course Code: MBF702.
Development of An ERROR ESTIMATE P M V Subbarao Professor Mechanical Engineering Department A Tolerance to Error Generates New Information….
EVAL 6970: Cost Analysis for Evaluation Dr. Chris L. S. Coryn Nick Saxton Fall 2014.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Some Background Assumptions Markowitz Portfolio Theory
Statistics for Business and Economics Chapter 10 Simple Linear Regression.
©2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
What is the MPC?. Learning Objectives 1.Use linear regression to establish the relationship between two variables 2.Show that the line is the line of.
Two Approaches to Calculating Correlated Reserve Indications Across Multiple Lines of Business Gerald Kirschner Classic Solutions Casualty Loss Reserve.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Discrete Distributions The values generated for a random variable must be from a finite distinct set of individual values. For example, based on past observations,
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Process Optimization By Dr : Mona Ossman.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Chapter 16 Data Analysis: Testing for Associations.
Chapter 13 Multiple Regression
Simulation is the process of studying the behavior of a real system by using a model that replicates the system under different scenarios. A simulation.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Data Modeling Patrice Koehl Department of Biological Sciences National University of Singapore
Correlation & Regression Analysis
1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.
Joint Moments and Joint Characteristic Functions.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Rick Walker Evaluation of Out-of-Tolerance Risk 1 Evaluation of Out-of-Tolerance Risk in Measuring and Test Equipment Rick Walker Fluke - Hart Scientific.
McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Forecasting.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
1 MDA Current Approach: Technically Based Opinion Mapped to Cost Growth History Kyle Ratliff December 3, 2002.
1 James R. Black Qing Qing Wu 17 Feb 2016 Modeling Prediction Intervals using Monte Carlo Simulation Software 2016 ICEAA Professional Development & Training.
Topic 3 (Ch. 8) Index Models A single-factor security market
Regression Analysis Simple Linear Regression
An Overview of Correlation and Functional Dependencies in Cost Risk and Uncertainty Analysis Richard L. Coleman 22 September 1994 Paper submitted.
Introduction to Instrumentation Engineering
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 An Overview of Correlation and Functional Dependencies in Cost Risk and Uncertainty Analysis Richard L. Coleman 22 September 1994 Paper submitted by Richard L. Coleman Shishu S. Gupta 15 August 1994

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 STATEMENT OF THE PROBLEM Correctly capturing the effects of correlation raises the total uncertainty and total risk estimates. Since the intent of uncertainty and risk estimation is to quantify the amount by which the estimate may be wrong, understatement of either cannot be tolerated, and so correlation cannot be ignored.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 SCOPE Correlation and its effects will be briefly described. Difficulties with implementing correlation will be discussed. A solution will be proposed for both uncertainty and risk estimation, in turn. –Model setup and results will be shown. –Impact will be examined.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 DEFINITIONS Cost Growth: the change (positive or negative) in the final cost of a weapon system. –Predicted growth at the outset or during the acquisition –Actual growth at the end of acquisition Risk: the prediction of cost growth before the end of the acquisition. Uncertainty: the statistical variability in prediction of the costs (not cost growth.) – Implicit in the data and procedures (e.g., linear regression) used to generate a cost estimate

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 DEFINITIONS (cont’d) Functional dependency: a dependent variable derived from a functional relationship to an independent variable. –“Source variable” and “derivative variable” will replace “independent” and “dependent variable,” respectively, to avoid confusions over the algebraic and statistical meanings of independent. Functional correlation: correlation that arises between source and derivative variables as a result of functional dependency. Derivative correlation: correlation that arises among derivative variables as a result of functional dependency.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 ASSUMPTIONS AND LIMITATIONS This briefing is couched in terms of the Normal (Gaussian) distribution and the triangular distribution. –The procedure is in no way limited to any particular distribution.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 IMPACT OF CORRELATION The correlation of costs does not usually affect the point estimate of costs –Correlations of predictive inputs are dealt with in standard ways, e.g., choice of independent inputs. –Depending on distributional assumptions, and the percentile used, the point estimate may shift with correlation. The impact arises in the quantification of the uncertainty and risk of a cost estimate. –The estimate of risk and uncertainty are almost always affected.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 CORRELATION AND DISPERSION In positive correlation, correlated costs tend to rise and fall together, increasing dispersion. In independence, some costs rise while others fall, offseting the excesses of one another, giving intermediate dispersion. In negative correlation, correlated cost changes tend to directly offset one another, reducing total dispersion.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 TYPICAL PRACTICES Risk and uncertainty are usually estimated at a given level of WBS, and then "rolled up" to higher levels. At the estimated level, models use probability distributions derived from the underlying data. –Dispersion is characterized as large, medium or small, or in the worst cases, simply guessed at. Correlations are usually not treated. –When they are treated, they are usually assumed (based on the analyst's "experience").

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 OBSTACLES TO CORRELATION ESTIMATION Discovery of mutual cost correlation is nearly impossible in practice. –Most data and resulting CERs are derived piecemeal, from disparate data sources, which will not yield correlation estimates. Correlation is useful principally in risk and uncertainty, so cost research does not collect and analyze it, except in individual CERs. Risk and uncertainty research focus on models and distributions and don’t get to correlation collection and analysis.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 OBSTACLES TO CORRELATION IMPLEMENTATION Estimated or assumed correlations might provide an mathematically intractable correlation matrix. Production of a full set of random numbers needed to produce a simulation solution is ponderous. –Relies on, e.g., the Choleski Factorization method. The problem becomes even more complex if non-Gaussian distributions are used.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 UNCERTAINTY SETUP The same WBS and relationships as are used in the cost estimate are imbedded into a spreadsheet-based model. The WBS level varies, to achieve independence among “source” variables. The model is set up to accept means and standard deviations. Source variables come from random draws. Derivative variables fall out of the equations.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 EXAMPLE WBS STRUCTURE SE/PM Hardware Total cost DevEng

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 HOW CORRELATION OCCURS If a source variable is randomly drawn (Hardware), then a second source variable is drawn (SE/PM Factor), and multiplied by the first variable, the derivative variable (SE/PM) will be correlated to the source variables (Hardware and the Factor variable.) –Whenever Hardware is high, SE/PM will tend to be high, mitigated by the variability in the SE/PM Factor, which is also an independent random variable. –When the SE/PM Factor is high, SE/PM will be high, mitigated by the effect of the value of the independent Hardware variable.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 UNCERTAINTY RUN EXAMPLE Let Hardware be a source variable. Let SE/PM and DevEng be derivative variables (linear functions of hardware) as follows: SE/PM = * Hardware DevEng = * Hardware

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 UNCERTAINTY MODEL OUTPUT TypeInputOneOutput VariableofInputStdSimulationOutputStd NameVariableMeanDeviationResult*MeanDeviation HardwareSource SE/PM FactorSource SE/PMDerivative DevEng FactorSource DevEngDerivative Total CostDerivative * The result of either a draw of a single random variable from the distribution (in the case of a source variable), or an algebraic expression involving constants and source variables (in the case of a derivative variable).

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 FUNCTIONAL CORRELATION The first 30 sets of the full 1000 random draws were captured and regressed, in order to test for relationships. Results: SE/PM = * Hardware DevEng = * Hardware SE/PM DevEng Std. Err. of Y Est R-Squared Correlation t statistic

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 EXAMPLE WBS STRUCTURE Functional Correlations SE/PM  Hardware Total cost  DevEng

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 DERIVATIVE CORRELATION The first 30 sets of points were tested for correlation between the derivative variables, SE/PM and DevEng. Results: Correlation coefficient.484 t statistic (Note: this is arithmetically equivalent to a regression with r-squared of.234, and the same t statistic, but represents a different conceptual model.)

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 EXAMPLE WBS STRUCTURE Functional Correlations Plus Derivative Correlation SE/PM  Hardware Total cost   DevEng

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 EFFECT ON DISPERSION In the previous example, the standard deviation of the estimate increased: With independence With correlation Percent change + 23%

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 DIFFERENCES FROM OTHER APPROACHES The expressions for derivative variables are those of the cost estimate. –A more simplistic approach, the way most models seem to operate, is to treat all variables as source variables, i.e., mutually independent. Correlation effects are correctly captured without the difficulties of explicit treatment. The values and distributions of derivative variables flow from the expressions. –No extra effort is necessary to derive these.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 RISK SETUP (using TASC-STAR) Point estimates are multiplied by random variables drawn from triangular distributions. Both cost estimating risk and schedule/technical risk are applied. –The former is based on the standard error of the estimate –The latter is based on historical Selected Acquisition Reports (SARs) The adjusted value of each source variable is used as the basis for each derivative variable, which is then recomputed and multiplied by relevant risk factors.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 RISK RUN EXAMPLE The effect on an actual missile system risk estimate : IndependentCorrelated Cost w/o risk $1,181M $1,181M Risk $66M $241M Total $1,247M $1,422M

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 EFFECT ON DISPERSION In the previous example, the standard deviation of the estimate increased: With independence $159M With correlation $286M Percent change + 80%

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 CONCLUSIONS Setting up functional dependencies in risk and uncertainty models –results in faithful replication of the functional correlation actually observed in the data, and –produces derivative correlation among variables not actually observed jointly Derivative correlation arises as an inescapable result of the functional dependencies, and is a natural outcome. Many, if not all, of the problems of correlation are solved. –To proceed farther requires data we do not yet have.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 FURTHER RESEARCH Collect a moderately large, “connected” set of data to observe and test actual correlations. Then, –Test functional dependency results. –Test alternative methods, and compare fidelity and difficulty.

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 Backup

CORRELATION AND FUNCTIONAL DEPENDENCY 28th Annual DoD Cost Analysis Symposium Leesburg, VA22 September 1994 NEGATIVE CORRELATION: AN EXAMPLE Suppose two manufacturers build bricks. – Their materials costs are similar. – Labor costs are similar – Overhead is similar. One company charges part of the time of their foremen to Quality Assurance (QA) that the other company charges to Test and Evaluation (T&E). The costs of QA and T&E will be negatively correlated. Dispersion will be: –Large in QA and T&E –Small in total cost