Chapter 6 Statistical Analysis of Output from Terminating Simulations.

Slides:



Advertisements
Similar presentations
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Advertisements

Outline input analysis input analyzer of ARENA parameter estimation
Variance reduction techniques. 2 Introduction Simulation models should be coded such that they are efficient. Efficiency in terms of programming ensures.
Experimental Design, Response Surface Analysis, and Optimization
Simulation Operations -- Prof. Juran.
Objectives (BPS chapter 18) Inference about a Population Mean  Conditions for inference  The t distribution  The one-sample t confidence interval 
Sampling Distributions (§ )
 1  Outline  Model  problem statement  detailed ARENA model  model technique  Output Analysis.
Output analyses for single system
AP Statistics – Chapter 9 Test Review
Analysis of Simulation Experiments
Lecture 9 Output Analysis for a Single Model. 2  Output analysis is the examination of data generated by a simulation.  Its purpose is to predict the.
Statistical Analysis of Output from Terminating Simulations
Variance Reduction Techniques
1 Validation and Verification of Simulation Models.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 7 Sampling.
Chapter 2 Simple Comparative Experiments
Experimental Evaluation
The Analysis of Variance
Inferences About Process Quality
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Graduate Program in Engineering and Technology Management
1 Terminating Statistical Analysis By Dr. Jason Merrick.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Hypothesis Testing in Linear Regression Analysis
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Determining Sample Size
Chapter 5 Modeling & Analyzing Inputs
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
SIMULATION MODELING AND ANALYSIS WITH ARENA
More About Significance Tests
Steady-State Statistical Analysis By Dr. Jason Merrick.
Verification & Validation
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
The AIE Monte Carlo Tool The AIE Monte Carlo tool is an Excel spreadsheet and a set of supporting macros. It is the main tool used in AIE analysis of a.
CHAPTER 18: Inference about a Population Mean
MS 305 Recitation 11 Output Analysis I
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Simulation with Arena, 4 th ed.Chapter 6 – Stat. Output Analysis Terminating SimulationsSlide 1 of 31 Statistical Analysis of Output from Terminating Simulations.
Simulation with ArenaChapter 5 – Detailed Modeling and Terminating Statistical AnalysisSlide 1 of 88 Chapter 6 Statistical Analysis of Output From Terminating.
1 Statistical Distribution Fitting Dr. Jason Merrick.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
1 Terminating Statistical Analysis By Dr. Jason Merrick.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Nonparametric Statistics
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
1 OUTPUT ANALYSIS FOR SIMULATIONS. 2 Introduction Analysis of One System Terminating vs. Steady-State Simulations Analysis of Terminating Simulations.
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
ETM 607 – Output Analysis: Estimation of Relative Performance Output comparison between two or more alternative systems Common Random Numbers (CRN) Comparison.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
K. Salahpp.1 Chapter 9 Output Analysis for Single Systems.
Comparing Systems Using Sample Data
Chapter 2 Simple Comparative Experiments
Statistical Analysis of Output from Terminating Simulations
Chapter 10 Verification and Validation of Simulation Models
Lesson Comparing Two Means.
CHAPTER 18: Inference about a Population Mean
Statistical Analysis of Output from Terminating Simulations
CHAPTER 18: Inference about a Population Mean
Presentation transcript:

Chapter 6 Statistical Analysis of Output from Terminating Simulations

Simulation with Arena Chapter 5 – Detailed Modeling and Terminating Statistical Analysis 2 Statistical Analysis of Output from Terminating Simulations Random input leads to random output (RIRO) Run a simulation (once) — what does it mean? – Was this run “typical” or not? – Variability from run to run (of the same model)? Need statistical analysis of output data – From a single model configuration – Compare two or more different configurations – Search for an optimal configuration Statistical analysis of output is often ignored – This is a big mistake – no idea of precision of results – Not hard or time-consuming to do this – it just takes a little planning and thought, then some (cheap) computer time

Simulation with Arena Chapter 5 – Detailed Modeling and Terminating Statistical Analysis 3 Output Analysis Output analysis is concerned with Designing replications Obtain most reliable info with minimum number of replications and minimum run length. Computing statistics Point and confidence interval estimation Size and independency issues Presenting them in a textual and graphical format. Aim is to understand the system behavior and generate predictions for it!

Simulation with Arena Chapter 5 – Detailed Modeling and Terminating Statistical Analysis 4 Time Frame of Simulations Terminating: Specific starting, stopping conditions – Run length will be well-defined (and finite) Steady-state: Long-run (technically forever) – Theoretically, initial conditions don’t matter (but practically they usually do) – Not clear how to terminate a simulation run This is really a question of intent of the study Has major impact on how output analysis is done Sometimes it’s not clear which is appropriate

Model 6.1 Same as Model 5.3 Number of trunk lines=26 No additional staff during 5-8 hrs. 10 runs are made For terminating case, make IID replications Run>Setup>Replication Parameters: Number of Replications =10 Check both boxes for Initialize Between Replications Outputs are saved to.dat files Statistics Module, Type=output, Data file name= Filename.dat Asli Sencer5

Outputs of Model 6.1 Asli Sencer6 Category Overview report will have some statistical- analysis results of the output across the replications

Output Precision in Model 6.1 Asli Sencer7 This information (except standard deviation) is in Category Overview report If > 1 replication specified, Arena uses cross-replication data as above For other confidence levels or graphics – Output Analyzer

8 Confidence Interval Estimation

Simulation with Arena Chapter 5 – Detailed Modeling and Terminating Statistical Analysis 9 Interpretation of a Confidence Interval

Required Number of Replications to Achieve a Certain Precision 10 Want this to be “small,” say < h where h is prespecified Chapter 5 – Detailed Modeling and Terminating Statistical Analysis Simulation with Arena

Chapter 5 – Detailed Modeling and Terminating Statistical Analysis 11 Half Width and Number of Replications s = sample standard deviation from “initial” number n 0 of replications h 0 = half width from “initial” number n 0 of replications n grows quadratically as h decreases.

Number of Replications Needed Asli Sencer12 If we require h=$250 rather than $812 for total cost, replications are needed.

Model 6.2 Asli Sencer13

Model Runs-as a trial Save the output to Total cost.dat Open Output analyzer as a separate application File>Data File>Export Export binary data in.dat file to a plain ASCII text file and save. Open Arena Input Analyzer Plot the histogram of the Total Costs Asli Sencer14

Histogram of 1000 Total Cost Values Since Total Cost values is a sum, law of large numbers apply. We see that the distribution approaches normal as the number of replications increase! Same is true for average statistics due to central limit theorem. It is not true for extreme value statistics like maximum or minimum. Asli Sencer15

Simulation with Arena Chapter 5 – Detailed Modeling and Terminating Statistical Analysis 16 Confidence Intervals (cont’d) Usual formulas assume normally-distributed data  Never true in simulation  Might be approximately true if output is an average, rather than an extreme  Central limit theorem  Issues of robustness, coverage, precision – details in book

Simulation with Arena Chapter 5 – Detailed Modeling and Terminating Statistical Analysis 17 Comparison of Alternatives Statistical Hypothesis Test Reject Ho if is significantly large or small, i.e., performance of system 1 is significantly different than system 2! Here: : Total Cost of Base Model (110 observations) : Total Cost of Alternative Model (110 observations) is the mean performance of system i

Comparing Two Scenarios Base Scenario: Model 6.4 (Same as in Model 5.3) -110 runs -26 Trunk Lines, No New Staff between 12:00-16:00 Alternative scenario: Model 6.4 (More-resources scenario) -110 runs -29 Trunk Lines, (Change the capacity from 26 to 29) -Hire three for each of Larry, Moe, Curly, Hermann and Sales Resources. (Change these variables from 0 to 3) Tradeoff is between increased salary cost but decreased excess waiting costs. Will the total costs decrease? Percent Rejected calls will decrease, but how much? Asli Sencer18

Comparison of Scenarios Runs both models for 110 times. Statistics Data Module Save output files –BaseCase.dat or -MoreResources.dat. 95% CI for total costs are Base model: 22, =[21,805, 22,544] Increased resources: 24, =[24,213, 24,871] Intervals do NOT overlap, hence Total Costs are significantly different at 5% significance level. 95% CI for percent rejected are Base model: =[11.23, 12.25] Increased resources: =[1.42, 2.04] Intervals do NOT overlap, hence Percent Rejected are significantly different at 5% significance level. Asli Sencer19

Arena Output Analyzer Separate application in Arena Operates in output files (.dat) generated by Arena through the Statistics data module Data in.dat file is in binary format to be opened by Arena Output Analyzer only! Provides confidence intervals on expected output statistics as also appear in Arena output reports. Provides statistical comparison of two scenarios, and others. Asli Sencer20

Comparison of Scenarios with Arena Output Analyzer Open Output Analyzer Select File>New to open a data group, i.e., list of.dat files Add TotalCost-BaseCase.dat TotalCost-MoreResources.dat PercentRejected-BaseCase.dat PercentRejected-MoreResources.dat Can save this data group as.dgr file to refer easily afterwards Analyze>Compare Means Add each pair of comparisons by choosing ‘lumped’ so that all 110 values are considered in the analysis Asli Sencer21

Hypothesis Tests Ho: Mean TC of base case = Mean TC of more resources Ha: Mean TC of base case ≠ Mean TC of more resources Ho: Mean % rejected of base case = Mean % rejected of more resources case Ha: Mean % rejected of base case ≠ Mean % rejected of more resources case Asli Sencer22

Output Report-Compare Means Asli Sencer23 Confidence interval on difference misses 0, so conclude that there is a (statistically) significant difference between the base model and the alternative at α=5%

Evaluating Many Scenarios with Process Analyzer Separate application in Arena Allows making multiple pairwise scenario comparisons at a time. PAN operates on Arena program files with.p extension, generated when.doe model is run. A PAN scenario includes a program file, a set of values for the input controls (decision variables in the form of variables and resources), a set of output responses. A PAN project is a collection of such scenarios that can be saved by.pan extension for future reference. Asli Sencer24

Development of a PAN Project Use Model runs Output data files are deleted since they will be useless in PAN. Open a PAN project File > New, File > Open Add a new scenario. Double click on the raw Name=Base Case, Program File=Model 6.5.p Replications=110 Add contols Right click in this line OR Insert > Control Under Resources: The capacity of trunk line Under User Specified: New Tech 1, New Tech 2, New Tech 3, New Tech All, New Sales Add responses Right click on this line OR Insert > Responses Under user specified: Total Cost, Percent Rejected Asli Sencer25

New Scenarios Suppose you have $1360/week to spend on all additional resources. To which of the six expandable resources should you allocate the new money? Then following 6 alternative scenarios apply in addition to Base Case. 13 more trunk lines ($98 each) 4 more tech 1,2,3 people ($320 each) 3 more tech all people ($360 each) 4 more sales people ( 340 each) Run the scenarios Check the scenarios to run Run > Go OR play button OR F5 function key Asli Sencer26

PAN screen Asli Sencer27

Generating Reports for Multiple Comparison in PAN Insert > Chart OR right click on a response column. Chart type=Box whiskers Check Identify Best Scenarios box Select ‘smaller is the better’ Red boxes are significantly better than blue ones at 5% significance level. To decrease the half width of a scenario, increase the number of replications of that specific one. Error tolerance is a positive value that represents an amount small enough that you don’t care if the selected scenarios are actually inferior to the true best one by at most this amount. A positive error reduce the number of selected scenarios at the risk of being off by a little bit. Asli Sencer28

A PAN Report Asli Sencer29