Output Analysis: Variance Estimation

Slides:



Advertisements
Similar presentations
COMM 472: Quantitative Analysis of Financial Decisions
Advertisements

T.C ATILIM UNIVERSITY MODES ADVANCED SYSTEM SIMULATION MODES 650
 1  Outline  terminating and non-terminating systems  theories for output analysis  Strong Law of Large Numbers  Central Limit Theorem  Regenerative.
G. Alonso, D. Kossmann Systems Group
 1  Outline  Model  problem statement  detailed ARENA model  model technique  Output Analysis.
10 Further Time Series OLS Issues Chapter 10 covered OLS properties for finite (small) sample time series data -If our Chapter 10 assumptions fail, we.
STAT 497 APPLIED TIME SERIES ANALYSIS
Output analyses for single system
1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing.
Output Data Analysis. How to analyze simulation data? simulation –computer based statistical sampling experiment –estimates are just particular realizations.
Analysis of Simulation Experiments
Output Analysis and Experimentation for Systems Simulation.
The AutoSimOA Project Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School WSC 07 A 3 year, EPSRC funded project in collaboration with SIMUL8.
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
A gentle introduction to fluid and diffusion limits for queues Presented by: Varun Gupta April 12, 2006.
SUMS OF RANDOM VARIABLES Changfei Chen. Sums of Random Variables Let be a sequence of random variables, and let be their sum:
#9 SIMULATION OUTPUT ANALYSIS Systems Fall 2000 Instructor: Peter M. Hahn
Lecture 9 Output Analysis for a Single Model. 2  Output analysis is the examination of data generated by a simulation.  Its purpose is to predict the.
The AutoSimOA Project Katy Hoad, Stewart Robinson, Ruth Davies Warwick Business School OR49 Sept 07 A 3 year, EPSRC funded project in collaboration with.
4. Review of Basic Probability and Statistics
1 Simulation Modeling and Analysis Output Analysis.
191 Drawing Statistical Inference from Simulation Runs......the "fun" stuff!
Review of Probability and Statistics
15 Statistical Quality Control CHAPTER OUTLINE
Methods and Philosophy of Statistical Process Control
X-bar and R Control Charts
Chapter 51Introduction to Statistical Quality Control, 5th Edition by Douglas C. Montgomery. Copyright (c) 2005 John Wiley & Sons, Inc.
Lecture II-2: Probability Review
Control charts : Also known as Shewhart charts or process-behaviour charts, in statistical process control are tools used to determine whether or not.
Statistical Process Control
1 Terminating Statistical Analysis By Dr. Jason Merrick.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
References for M/G/1 Input Process
Analysis of Simulation Results Andy Wang CIS Computer Systems Performance Analysis.
Simulation Output Analysis
Fall CSC 446/546 Part 10: Estimation of Absolute Performance.
SIMULATION MODELING AND ANALYSIS WITH ARENA
 1  Outline  stages and topics in simulation  generation of random variates.
Steady-State Statistical Analysis By Dr. Jason Merrick.
Verification & Validation
Chapter 11 Output Analysis for a Single Model Banks, Carson, Nelson & Nicol Discrete-Event System Simulation.
Chapter 11 Output Analysis for a Single Model Banks, Carson, Nelson & Nicol Discrete-Event System Simulation.
MS 305 Recitation 11 Output Analysis I
© 2003, Carla Ellis Simulation Techniques Overview Simulation environments emulation exec- driven sim trace- driven sim stochastic sim Workload parameters.
0 K. Salah 2. Review of Probability and Statistics Refs: Law & Kelton, Chapter 4.
TESTING THE ASSUMPTIONS OF VARIABLES CONTROL CHARTS AND AN APPLICATION ON FOOD INDUSTRY Berna YAZICI Department of Statistics, Anadolu University Eskisehir,Turkey.
STEADY-STATE SYSTEM SIMULATION(2). REVIEW OF THE BASICS Initial Transient a.k.a. Warm-Up Period.
Lecture 2 Basics of probability in statistical simulation and stochastic programming Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius,
Elements of Stochastic Processes Lecture II
Simulation & Confidence Intervals COMP5416 Advanced Network Technologies.
1 OUTPUT ANALYSIS FOR SIMULATIONS. 2 Introduction Analysis of One System Terminating vs. Steady-State Simulations Analysis of Terminating Simulations.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Output Analysis for Simulation
Dr. Dipayan Das Assistant Professor Dept. of Textile Technology Indian Institute of Technology Delhi Phone:
Output Data Analysis. Introduction Most serious disadvantage of simulation –With a stochastic simulation, you don ’ t get exact “ answers ” from a run.
Lecture 13 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
Chapter 61Introduction to Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2012 John Wiley & Sons, Inc.
K. Salahpp.1 Chapter 9 Output Analysis for Single Systems.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Copyright (c) 2005 John Wiley & Sons, Inc.
CPSC 531: System Modeling and Simulation
Chapter 11 Output Analysis for a Single Model
Statistical Methods Carey Williamson Department of Computer Science
ENGM 620: Quality Management
Carey Williamson Department of Computer Science University of Calgary
Special Control Charts II
MECH 3550 : Simulation & Visualization
Modeling and Simulation: Exploring Dynamic System Behaviour
Presentation transcript:

Output Analysis: Variance Estimation Jong-hyun Ryu

Contents Motivation (Quality Control Chart) Collecting output data Type of simulations Steady-State Simulation Stochastic Stationary Process Variance Estimation Methods Replication Method Batch mean methods Non-overlapping vs. Overlapping Standardized Time Series Additional Methods

Motivation Quality Control Chart Design of Shewhart Control Chart Simple Control Chart (Shewhart Control Chart) Design of Shewhart Control Chart Control Limits UCL & LCL Results from incorrect control limits If overestimated  detection delay If underestimated  high false alarm rate Estimating process variance (standard deviation) is critical.

Type of simulations Terminating vs. non-terminating simulations Runs for some duration of time TE, where E is a specified event that stops the simulation. Starts at time 0 under well-specified initial conditions. Ends at the stopping time TE. Bank example: Opens at 8:30 am (time 0) with no customers present and 8 of the 11 teller working (initial conditions), and closes at 4:30 pm (Te) The simulation analyst chooses to consider it a terminating system because the object of interest is one day’s operation.

Type of simulations Non-terminating simulation Runs continuously, or at least over a very long period of time. Examples: assembly lines that shut down infrequently, telephone systems, hospital emergency rooms. Initial conditions defined by the analyst. Runs for some analyst-specified period of time TE. Study the steady-state (long-run) properties of the system, properties that are not influenced by the initial conditions of the model. Terminating or non-terminating The objectives of the simulation study The nature of the system.

Output Analysis for Steady-State Simulation Consider a single run of a simulation model to estimate a steady-state or long-run characteristics of the system. The single run produces observations Y1, Y2, ... (generally the samples of an autocorrelated time series). Performance measures: Point estimator and confidence interval Independent of the initial conditions

Output Analysis for Steady-State Simulation The sample size is a design choice, with several considerations in mind: Any bias in the point estimator that is due to artificial or arbitrary initial conditions (bias can be severe if run length is too short). Desired precision of the point estimator. Budget constraints on computer resources.

Initialization Bias Methods to reduce the point-estimator bias caused by using artificial and unrealistic initial conditions: Intelligent initialization. Divide simulation into an initialization phase and data-collection phase. Intelligent initialization Initialize the simulation in a state that is more representative of long-run conditions. If the system exists, collect data on it and use these data to specify more nearly typical initial conditions. If the system can be simplified enough to make it mathematically solvable, e.g. queueing models, solve the simplified model to find long-run expected or most likely conditions, use that to initialize the simulation.

Initialization Bias No widely accepted, objective and proven technique to guide how much data to delete to reduce initialization bias to a negligible level. Plots can be misleading but they are still recommended. If averages reveal a smoother and more precise trend as the # of replications increases. If averages can be smoothed further by plotting a moving average. Cumulative average becomes less variable as more data are averaged. The more correlation present, the longer it takes for to approach steady state.

Introduction to variance estimation Simulation output analysis Point estimator and confidence interval Variance estimation  confidence interval Independent and identically distributed (IID) Suppose X1,…Xm are iid

Stochastic Stationary Process The stochastic process X is stationary for t1,…,tk, t∈ T, if Stationary time series with positive autocorrelation Stationary time series with negative autocorrelation Nonstationary time series with an upward trend

Stochastic Stationary Process (2) A discrete-time stationary process X = {Xi : i≥1} with mean µ and variance σX2 = Cov(Xi,Xi), Variance of the sample mean Easy to see For iid,

Stochastic Stationary Process (3) The expected value of the variance estimator is: If Xi are independent, then is an unbiased estimator of If the autocorrelation is positive, then is biased low as an estimator of If the autocorrelation is negative, then is biased high as an estimator of

Replication Method Use to estimate point-estimator variability and to construct a confidence interval. Approach: make R replications, initializing and deleting from each one the same way. Important to do a thorough job of investigating the initial-condition bias: Bias is not affected by the number of replications, instead, it is affected only by deleting more data or extending the length of each run (i.e. increasing TE). Basic raw output data {Xrj, r = 1, ..., R; j = 1, …, n} is derived by: Individual observation from within replication r. Batch mean from within replication r of some number of discrete-time observations. Batch mean of a continuous-time process over time interval j.

Replication Method Length of each replication (n) beyond deletion point (d): (n - d) > 10d Number of replications (R) should be as many as time permits, up to about 25 replications. For a fixed total sample size (n), as fewer data are deleted (d decreases) C.I. shifts: greater bias. Decrease Variance Trade off between bias and variance.

Batch means method When using a single, long replication: Problem : Data are dependent so the usual estimator is biased Replication method (more than 25 replications) is expensive One solution: Batch means method

Batch means method (Nonoverlapping batch mean) Non-overlapping batch mean (NBM) m observations with batch mean Y1,m Batch 1 m observations with batch mean Yk,m Batch k

Nonoverlapping batch mean (NBM) Suppose the batch means become uncorrelated as m  ∞ NBM estimator for σ2 Confidence Interval

Overlapping Batch Mean (OBM) Y1,m Y2,m OBM estimator for σ2

NBM vs. OBM Under mild conditions Variance of the estimators Thus, both have similar bias Variance of the estimators Thus, the OBM method gives better MSE

Standardized Time Series Define the ‘centered’ partial sums of Xi as Central Limit Theorem Define the continuous time process Question: How does Zn(t) behave as n increases?

n=100

n=1000

n=10000

n=1,000,000 Unlike S, the S’ curve remains erratic even on larger time scales. However, the sample paths of S’ do converge to a limiting process given by the FCLT

Functional Central Limit Theorem (FCLT) where B(t) is the standard Brownian motion (drift coefficient =0, diffusion coefficient =1) Brownian motion with drift coefficient  and diffusion coefficient 2 is a real valued stochastic process with stationary and independent increments having continuous sample paths where Thus,

Functional Central Limit Theorem (FCLT) While CLT says that for any t, FCLT also shows that Zn(t) converges to an asymptotically continuous stochastic process with independent increments.

The Weighted Area Estimator If we can define the function f such that Continuous [0,1] Normalized so that Then, Define

The Weighted Area Estimator Since and is a variance estimator

Additional Methods Regenerative Method (Fishman 1973; Shedler 1993) Spectral Theory (Heidelberger and Welch 1981, 1983; Damerdji 1991) Quantile estimation (Heidelberger and Lewis 1984; Seila 1982) Estimation of functions of means (Munoz and Glynn 1997) Estimation of multivariate means (Anderson 1984; Charnes 1989, 1995; Seila 1984)

Reference Alexopoulos, C., D. Goldsman, and R. F. Serfozo. 2005. Stationary processes: statistical estimation. In The Encyclopedia of Statistical Sciences, ed. N. L. Johnson, and C. B. Read, 2nd edition. New York: John Wiley & Sons Kim, S.-H., C. Alexopoulos, K.-L. Tsui, and J. R. Wilson. 2006. A distribution-free tabular CUSUM chart for autocorrelated data. IIE Transactions, forthcoming.   Alexopoulos, C., N. T. Argon, D. Goldsman, and G. Tokol. 2004. Overlapping variance estimators for simulations. Technical Report, School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia. Law, A. M., and W. D. Kelton. 2000 Simulation Modeling and Analysis, 3rd ed. New York: McGraw-Hill.