Verification & Validation

Slides:



Advertisements
Similar presentations
Introduction into Simulation Basic Simulation Modeling.
Advertisements

T.C ATILIM UNIVERSITY MODES ADVANCED SYSTEM SIMULATION MODES 650
ETM 607 – Output Analysis: Estimation of Absolute Performance
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Variance reduction techniques. 2 Introduction Simulation models should be coded such that they are efficient. Efficiency in terms of programming ensures.
Experimental Design, Response Surface Analysis, and Optimization
G. Alonso, D. Kossmann Systems Group
 1  Outline  Model  problem statement  detailed ARENA model  model technique  Output Analysis.
Output analyses for single system
Chapter 5: Confidence Intervals.
Chapter 8 Estimation: Additional Topics
Output Data Analysis. How to analyze simulation data? simulation –computer based statistical sampling experiment –estimates are just particular realizations.
Analysis of Simulation Experiments
Queuing Models Basic Concepts
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Performance Analysis and Monitoring Facilities in CPN Tools Tutorial CPN’05 October 25, 2005 Lisa Wells.
Components and Organization of Discrete-event Simulation Model
1 Performance Evaluation of Computer Networks Objectives  Introduction to Queuing Theory  Little’s Theorem  Standard Notation of Queuing Systems  Poisson.
Simulation.
Lecture 9 Output Analysis for a Single Model. 2  Output analysis is the examination of data generated by a simulation.  Its purpose is to predict the.
Variance Reduction Techniques
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
1 Validation and Verification of Simulation Models.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
QUEUING MODELS Queuing theory is the analysis of waiting lines It can be used to: –Determine the # checkout stands to have open at a store –Determine the.
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
5-3 Inference on the Means of Two Populations, Variances Unknown
WINTER 2012IE 368. FACILITY DESIGN AND OPERATIONS MANAGEMENT 1 IE 368: FACILITY DESIGN AND OPERATIONS MANAGEMENT Lecture Notes #3 Production System Design.
1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.
 We cannot use a two-sample t-test for paired data because paired data come from samples that are not independently chosen. If we know the data are paired,
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
Analysis of Simulation Results Andy Wang CIS Computer Systems Performance Analysis.
Introduction to Statistical Inferences
Simulation Output Analysis
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
Simulation Examples ~ By Hand ~ Using Excel
SIMULATION MODELING AND ANALYSIS WITH ARENA
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
 1  Outline  stages and topics in simulation  generation of random variates.
Steady-State Statistical Analysis By Dr. Jason Merrick.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
MS 305 Recitation 11 Output Analysis I
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Agresti/Franklin Statistics, 1 of 106  Section 9.4 How Can We Analyze Dependent Samples?
NETE4631:Capacity Planning (2)- Lecture 10 Suronapee Phoomvuthisarn, Ph.D. /
© 2003, Carla Ellis Simulation Techniques Overview Simulation environments emulation exec- driven sim trace- driven sim stochastic sim Workload parameters.
Structure of a Waiting Line System Queuing theory is the study of waiting lines Four characteristics of a queuing system: –The manner in which customers.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
ETM 607 – Input Modeling General Idea of Input Modeling Data Collection Identifying Distributions Parameter estimation Goodness of Fit tests Selecting.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
1 Terminating Statistical Analysis By Dr. Jason Merrick.
Chapter 10 Verification and Validation of Simulation Models
1 OUTPUT ANALYSIS FOR SIMULATIONS. 2 Introduction Analysis of One System Terminating vs. Steady-State Simulations Analysis of Terminating Simulations.
ETM 607 – Output Analysis: Estimation of Relative Performance Output comparison between two or more alternative systems Common Random Numbers (CRN) Comparison.
Output Analysis for Simulation
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Statistical Significance Hypothesis Testing.
1 Probability and Statistics Confidence Intervals.
 Simulation enables the study of complex system.  Simulation is a good approach when analytic study of a system is not possible or very complex.  Informational,
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Simulation. Types of simulation Discrete-event simulation – Used for modeling of a system as it evolves over time by a representation in which the state.
Building Valid, Credible & Appropriately Detailed Simulation Models
K. Salahpp.1 Chapter 9 Output Analysis for Single Systems.
Variance reduction techniques Mat Simulation
Chapter 10 Verification and Validation of Simulation Models
MECH 3550 : Simulation & Visualization
Modeling and Simulation: Exploring Dynamic System Behaviour
Presentation transcript:

Verification & Validation

Motivation The simulation model needs to work as intended. It should adequately represent the system being simulated.

Verification & Validation

Verification Possible methods … Use all of the prior methods in extreme cases Examples Examine only one part type at a time. Allow only one entity into the system.

Verification Use all of the prior methods in extreme cases Examples Control the distributions used. Set all processing times to constants – You should be able to predict quantities such as the entity avg. TIS. Control arrival rates Extremely high. Extremely low.

Verification Need to run very large models for a long simulated time to check for gradual buildup of queues. May be able to compute utilizations. During model development

Validation In general this is more difficult than verification. Two general cases

Validation System exists. Compare results to historical results. Does the data exist? Are you comparing “apples to apples”? Control model variability Resource availability If a known long maintenance period occurred for machines or if known long failures/repairs occurred, duplicate these in the model. Duplicate resource schedules. Duplicate the arrival pattern over which historical performance data has been collected. Job types. Job arrival times.

Validation System does not exist. Change the data and modify the model so that it simulates a system that still exists. Then compare to historical data. Expert opinion. Run extreme cases.

Output Analysis

Introduction A simulation model has been constructed of a system to generate estimates of one or more performance measures. Random inputs/system components in the simulation imply that the outputs (performance measure estimates) will be observations from probability distributions.

Introduction Questions about starting and running the model.

Types of Simulation “Runs” In general, there are two types of simulation analyses performed that dictate how the previous questions should be answered. Terminating – Steady state –

Example Simulation of a bank from open to close (9AM-5PM). This is an example of a terminating simulation. How do you start the simulation? Empty and idle. How long should the model be run (how much simulated time before stopping the run)? 9AM until all customers depart after 5PM.

Terminating Simulations Time period of interest is defined.

Example Bank simulation from 9AM-5PM. Customers arriving before 5PM are all served. The simulation ending time may vary from replication to replication. What is the termination criteria?

Arena To properly end such a simulation, use Run -> Setup -> Replication Parameters Set ending criteria in “Terminating Condition”. Leave “Replication Length” field blank. The termination criteria will use Arena “syntax”.

Arena

Terminating Simulations A simulation of a service facility with “rush hour” periods is constructed. There is a specific time period of interest – 11AM -1PM. The facility is open at 9AM. How long to simulate? 11AM-1PM.

Terminating Simulations Approaches for starting a simulation with unknown initial conditions. Collect data on the system state at the start of the rush hour period. Initialize the simulation with the “average” system state. Initialize the simulation with a random system state based on the collected data. Both hard to do in Arena, Straightforward in an “event driven” model (e.g., the manual simulation).

Terminating Simulations Approaches for starting a simulation with unknown initial conditions.

Arena

Steady State Simulations Steady state simulations are used to understand how a system performs after being in operations for a long time. The system has reached “steady state” where the performance is independent of the initial conditions. Examples Production line simulations – the line starts where it left off at the end of prior shifts. Emergency rooms. Worst case analysis.

Steady State Simulations How to start the simulation? Typically a warm-up period is used to minimize any impact of initial conditions. How long should the warm-up period be? Can determine from some sample runs of the simulation. How long to run the simulation?

Steady State Simulations

Steady State Simulations What are you looking for in the plots? Impact of initial conditions. Does the plot from time zero look different than other portions of the graph? Does the performance measure seem to be growing without bound?

The Number of Replications and Analysis of Output We will focus on the analysis of terminating simulations. They are more common in practice. The analysis is more straightforward. Simulation models are used for experimentation. One simulation replication → a single realization of each system performance measure. n independent replications → n independent samples from the same distribution.

The Number of Replications and Analysis of Output Consider a single performance measure. Let Xi be the random variable that represents the value of the performance measure for the ith simulation replication. xi = outcome/realization of Xi from the ith simulation replication. Since the Xi are independent and identically distributed random variables the performance can be characterized using the “typical” confidence interval.

Analysis of Output The approximate confidence interval Assumes the observations are from a normal distribution.

The Number of Replications How to estimate number of independent replications required for a desired precision expressed as a confidence interval half-width. The half-width h of this confidence interval is This cannot be used to precisely calculate n to get the desired precision h since the t-value is a function of n.

The Number of Replications Substitute Use this formula to approximate the number of replications needed to get a desired half-width (precision) for some performance measure.

The Number of Replications Example – Average Time In System (TIS) for entities being processed.

The Number of Replications Use Arena to send TIS average results from independent replications to a text file. Use this data to estimate n – the number of independent replications needed for a desired precision.

The Number of Replications

The Number of Replications

The Number of Replications

The Number of Replications

Comparing Two Alternatives In many situations where simulation is used, changes to an existing or “base” system are explored. There is a need to do a comparison of two systems. Simulation has been used to generate estimates of performance measure for the base system and a changed system. Analysis Two sample t-confidence interval. Paired t-confidence interval.

Comparing Two Alternatives

Comparing Two Alternatives “Standard” two-sample t-confidence interval. Assumptions X1j’s and X2j’s are independent. Var(X1j) = Var(X2j) . n1 and n2 need not be equal. Not recommended for simulation output analysis since the equal variance assumption is often not met (Kelton and Law 2000). However, the test is robust if n1 = n2.

Comparing Two Alternatives “Welch” two-sample t-confidence interval. Assumptions X1j’s and X2j’s are independent and normally distributed. n1 and n2 need not be equal.

Comparing Two Alternatives

Comparing Two Alternatives Paired t-confidence interval. Assumptions n1 and n2 are equal.

Comparing Two Alternatives

Comparing Two Alternatives The paired t-confidence interval forms a new sample as the difference between corresponding outputs from the same replication number for system 1 and system 2. X1j and X2j may be dependent. The Di’s must be independent. Var(X1j) = Var(X2j) is not necessary.

Example Experiment with the single server model arrival process. t 9,.975 = 2.26, Performance measure is avg. TIS.

In-class Exercise For the following data. Form 95% CIs for system 1 and system 2 output separately. Check for overlap. Form a 95% paired t-confidence interval. Check for the inclusion of zero. Construct a Welch two-sample t-confidence interval.

T(58,.975) = 2.0

In-class Exercise

Comparing >2 Systems Comparisons with a base system. All pairwise comparisons. Simple procedure is to apply what is called the Bonferroni inequality.

Comparing >2 Systems Example – One base system and four alternatives => four confidence intervals when comparing each system with the base system. To get 90% overall confidence level each individual confidence interval should be a 97.5% CI.

Comparing >2 Systems Experimental design methods are helpful for exploring a “factor space”. Care must be taken when conducting the experiments and analyzing experimental results due to the non-random nature of generating “random” values in a simulation.