Outline input analysis input analyzer of ARENA parameter estimation

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Point Estimation Notes of STAT 6205 by Dr. Fan.
1 Chi-Square Test -- X 2 Test of Goodness of Fit.
Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
Inference for Regression
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Statistics review of basic probability and statistics.
 1  Outline  generation of random variates  convolution  composition  acceptance/rejection  generation of uniform(0, 1) random variates  linear.
1 The Output Analyzer Separate application, also accessible via Tools menu in Arena Reads binary files saved by Arena Various kinds of output-data display,
INTRODUCTION TO NON-PARAMETRIC ANALYSES CHI SQUARE ANALYSIS.
Evaluation (practice). 2 Predicting performance  Assume the estimated error rate is 25%. How close is this to the true error rate?  Depends on the amount.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Machine Learning CMPT 726 Simon Fraser University
Chapter 16 Chi Squared Tests.
IEEM 3201 One and Two-Sample Estimation Problems.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Inferences About Process Quality
BCOR 1020 Business Statistics
Standard error of estimate & Confidence interval.
AM Recitation 2/10/11.
Regression Analysis (2)
Chapter 5 Modeling & Analyzing Inputs
Graduate Program in Engineering and Technology Management
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
 1  Outline  input analysis  goodness of fit  randomness  independence of factors  homogeneity of data  Model
 1  Outline  stages and topics in simulation  generation of random variates.
Modeling and Simulation CS 313
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
Random Sampling, Point Estimation and Maximum Likelihood.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
2 Input models provide the driving force for a simulation model. The quality of the output is no better than the quality of inputs. We will discuss the.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
0 K. Salah 2. Review of Probability and Statistics Refs: Law & Kelton, Chapter 4.
Ch5. Probability Densities II Dr. Deshi Ye
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Goodness-of-Fit Chi-Square Test: 1- Select intervals, k=number of intervals 2- Count number of observations in each interval O i 3- Guess the fitted distribution.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Chapter 7 Inferences Based on a Single Sample: Tests of Hypotheses.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
Tests of Random Number Generators
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
Sampling and estimation Petter Mostad
Statistical Estimation Vasileios Hatzivassiloglou University of Texas at Dallas.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Chapter 18 Chi-Square Tests.  2 Distribution Let x 1, x 2,.. x n be a random sample from a normal distribution with  and  2, and let s 2 be the sample.
Building Valid, Credible & Appropriately Detailed Simulation Models
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Goodness-of-Fit Tests
Econometric Models The most basic econometric model consists of a relationship between two variables which is disturbed by a random error. We need to use.
Discrete Event Simulation - 4
Modelling Input Data Chapter5.
Simple Linear Regression and Correlation
Chapter 9 Estimation: Additional Topics
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Outline input analysis input analyzer of ARENA parameter estimation maximum likelihood estimator goodness of fit randomness independence of factors homogeneity of data

Topics in Simulation knowledge in distributions and statistics random variate generation input analysis output analysis verification and validation optimization variance reduction

Input Analysis statistical tests to analyze data collected and to build model standard distributions and statistical tests estimation of parameters enough data collected? independent random variables? any pattern of data? distribution of random variables? factors of an entity being independent from each other? data from sources of the same statistical property?

Input Analyzer of ARENA which distribution to use and what parameters for the distribution Start /Rockwell Software/Arena 7.0/Input Analyzer Choose File/New Choose File/Data File/Use Existing to open exp_mean_10.txt Fit for a particular distribution, or Fit/Fit All

Criterion for Fitting in Input Analyzer n: total number of sample points ai: actual # of sample points in ith interval ei: expected # of sample points in ith interval sum of square error to determine the goodness of fit

p-values in Input Analyzer Chi Square Test and the Kolmogorov-Smirnov Test in fitting p-value: a measure of the probability of getting such a set of sample values from the chosen distribution the larger the p-value, the better

Generate Random Variates by Input Analyzer new file in Input Analyzer Choose File/Data file/Generate New select the desirable distribution output expo.dst changing expo.dst to expo.txt

Parameter Estimation two common methods maximum likelihood estimators method of moments

Idea of Maximum Likelihood Estimators a coin flipped 10 times, giving 9 heads & then 1 tail best estimate of p = P(head)? let A be the event of 9 heads followed by 1 tail p 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 P(A|p) 0.000 0.001 0.004 0.012 0.027 0.039 p 0.8 0.825 0.85 0.875 0.9 0.925 0.95 0.975 P(A|p) 0.027 0.031 0.035 0.038 0.039 0.037 0.032 0.02

Maximum Likelihood Estimators let  be the parameter to be estimated from sample values x1, ..., xn set up the likelihood function in  choose  to maximize the likelihood function discrete distribution: where {pi} is the p.m.f. with parameter  continuous distribution: where f(x; ) is the density at x with parameter 

Examples of Maximum Likelihood Estimators Bernoulli Distribution Exponential Distribution

Method of Moments kth moment of X: E(Xk) two ways to express moments from empirical values in terms of parameters estimates of parameters by equating the two ways Examples: Bernoulli Distribution, Exponential Distribution

Is the distribution to represent the data points appropriate? Goodness-of-Fit Test Is the distribution to represent the data points appropriate?

General Idea of Hypothesis Testing coin tosses H0: P(head) = 1 H1: P(head)  1 tossed twice, both being head; accept H0? tossed 5 times, all being head; accept H0? tossed 50 times, all being head; accept H0? to believe (or disbelieve) based on evidence internal “model” of the statistic properties of the mechanism that generates evidence

Theory and Main Idea of the 2 Goodness of Fit Test (X1, X2, ..., Xk) ~ Multinomial (n; p1, p2, ..., pk)

Goodness-of-Fit Test test the underlying distribution of a population H0: the underlying distribution is F H1: the underlying distribution is not F Goodness-of-Fit Test n sample values x1, ..., xn assumed to be from F k exhaustive categories for the domain of F oi = observed frequency of x1, ..., xn in the ith category ei = expected frequency of x1, ..., xn in the ith category

Goodness-of-Fit Test “better” to have ei = ej for i not equal to j for this method to work, ei  5 choose significant level  decision: if , reject H0; otherwise, accept H0.

Goodness-of-Fit Test Example: The lives of 40 batteries are shown below. Category i: Frequency oi 1.45-1.95 2 1.95-2.45 1 2.45-2.95 4 2.95-3.45 15 3.45-3.95 10 3.95-4.45 5 4.45-4.95 3 Test the hypothesis that the battery lives are approximately normally distributed with μ = 3.5 and σ = 0.7.

Goodness-of-Fit Test Solution: First calculate the expected frequencies under the hypothesis: For category 1: P(1.45 < X < 1.95) = P[(1.45-3.5)/0.7 < Z < (1.95-3.5)/0.7] = P(-2.93 < Z <-2.21) = 0.0119. e1 = 0.0119(40)  0.5. Similarly, we can calculate other expected frequencies: ei: 0.5 2.1 5.9 10.3 10.7 7.0 3.5

Goodness-of-Fit Test Similarly, we can calculate other expected frequencies: ei: 0.5 2.1 5.9 10.3 10.7 7.0 3.5 Since some ei’s are smaller than 5, we combine some categories and get the following Category i: Frequency oi Frequency ei 1.45-2.95 7 8.5 2.95-3.45 15 10.3 3.45-3.95 10 10.7 3.95-4.95 8 10.5

Goodness-of-Fit Test accept because calculate statistics: set the level of significance:  = 0.05. degrees of freedom: k-1=3. accept because

Test for Randomness Do the data points behave like random variates from i.i.d. random variables?

Test for Randomness graphical techniques run test (not discussed) run up and run down test (not discussed)

Background random variables X1, X2, …. (assumption Xi  constant) if X1, X2, … being i.i.d. j-lag covariance Cov(Xi, Xi+j)  cj = 0 V(Xi)  c0 j-lag correlation j  cj/c0 = 0

Graphical Techniques estimate j-lag correlation from sample check the appearance of the j-lag correlation