Rutgers Intelligent Transportation Systems (RITS) Laboratory

Slides:



Advertisements
Similar presentations
Mean, Proportion, CLT Bootstrap
Advertisements

Chapter 7 Statistical Data Treatment and Evaluation
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Statistical Estimation and Sampling Distributions
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
POINT ESTIMATION AND INTERVAL ESTIMATION
Experimental Design, Response Surface Analysis, and Optimization
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Errors & Uncertainties Confidence Interval. Random – Statistical Error From:
Topic 6: Introduction to Hypothesis Testing
Output Analysis and Experimentation for Systems Simulation.
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Evaluating Hypotheses
Chapter 8 Estimation: Single Population
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Chapter 6 Numerical Interpolation
Inferential Statistics
Lecture II-2: Probability Review
1 Assessment of Imprecise Reliability Using Efficient Probabilistic Reanalysis Farizal Efstratios Nikolaidis SAE 2007 World Congress.
Review of normal distribution. Exercise Solution.
CHAPTER 15 S IMULATION - B ASED O PTIMIZATION II : S TOCHASTIC G RADIENT AND S AMPLE P ATH M ETHODS Organization of chapter in ISSO –Introduction to gradient.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Analysis of Monte Carlo Integration Fall 2012 By Yaohang Li, Ph.D.
Stochastic Approximation and Simulated Annealing Lecture 8 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working.
AN ITERATIVE METHOD FOR MODEL PARAMETER IDENTIFICATION 4. DIFFERENTIAL EQUATION MODELS E.Dimitrova, Chr. Boyadjiev E.Dimitrova, Chr. Boyadjiev BULGARIAN.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
General Principle of Monte Carlo Fall 2013 By Yaohang Li, Ph.D.
PARAMETRIC STATISTICAL INFERENCE
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Module 1: Statistical Issues in Micro simulation Paul Sousa.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
FORECASTING METHODS OF NON- STATIONARY STOCHASTIC PROCESSES THAT USE EXTERNAL CRITERIA Igor V. Kononenko, Anton N. Repin National Technical University.
Chapter 7: The Distribution of Sample Means
Chapter 36 Quality Engineering (Part 1) (Review) EIN 3390 Manufacturing Processes Fall, 2010.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
Chapter 19 Monte Carlo Valuation. © 2013 Pearson Education, Inc., publishing as Prentice Hall. All rights reserved.19-2 Monte Carlo Valuation Simulation.
Uncertainties in Measurement Laboratory investigations involve taking measurements of physical quantities. All measurements will involve some degree of.
Virtual University of Pakistan
Estimating standard error using bootstrap
Sampling and Sampling Distribution
OPERATING SYSTEMS CS 3502 Fall 2017
Chapter 5 STATISTICAL INFERENCE: ESTIMATION AND HYPOTHESES TESTING
ESTIMATION.
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
LECTURE 33: STATISTICAL SIGNIFICANCE AND CONFIDENCE (CONT.)
Sampling Distributions and Estimation
36.1 Introduction Objective of Quality Engineering:
Al-Imam Mohammad Ibn Saud University Large-Sample Estimation Theory
Rutgers Intelligent Transportation Systems (RITS) Laboratory
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Unfolding Problem: A Machine Learning Approach
Introduction to Instrumentation Engineering
CONCEPTS OF ESTIMATION
Lecture 2 – Monte Carlo method in finance
Slides for Introduction to Stochastic Search and Optimization (ISSO) by J. C. Spall CHAPTER 15 SIMULATION-BASED OPTIMIZATION II: STOCHASTIC GRADIENT AND.
Virtual University of Pakistan
Introduction to Estimation
Lecture 7 Sampling and Sampling Distributions
Parametric Methods Berlin Chen, 2005 References:
Unfolding with system identification
Advanced Algebra Unit 1 Vocabulary
Presentation transcript:

Rutgers Intelligent Transportation Systems (RITS) Laboratory Department of Civil & Environmental Engineering An Investigation of the Convergence and Accuracy Properties of Latin Hypercube Sampling Technique for Traffic Equilibrium Problem under Capacity Uncertainty Paper: 10-3504 Jian Li, M.Sc. and Kaan Ozbay, Ph.D. Rutgers, The State University of New Jersey Abstract The traffic equilibrium problem under capacity uncertainty (TEPCU) has drawn significant interest in recent years, mainly because of the need to incorporate the stochastic nature of link capacities in the transportation planning process. A common approach for solving TEPCU is to use a sampling technique that randomly selects subsets of the uncertainty set to obtain approximate solutions. Latin hypercube sampling (LHS) is one of the most frequently used sampling methods, and it can provide accurate approximations. However, the main concern when using LHS is the large required sample size, which is important in the application of LHS for TEPUC because of the high computational time required for large networks. The main objective of this paper is to conduct an in-depth analysis of the convergence and approximation accuracy properties of LHS. Several computational tests are conducted using two different networks, with the goal of determining an efficient sample size that can be used to obtain an accurate approximate solution of TEPUC at a given level of confidence. The results provide us with a better understanding of the requirements of an appropriate experimental design for applying LHS to TEPUC on large transportation networks. Methodology Formulation The traffic assignment formulated by Wardrop (1952) has been widely used under the standard assumption of deterministic origin–destination (OD) demand and link capacity conditions. Incorporating above link capacity distributions, and supposing that there are m links in the network, the uncertain capacity parameter vector can be viewed as a variable vector, with each item having a probability distribution. The following stochastic programming problem is then formulated: Where F(x , ξ) is the objective function that will derive the traffic assignment toward either SO or UE, and ξ(ω) is link capacity, a random vector calculated on the basis of certain probability distributions. For solving approach, sample-average approximation (SAA), was employed and then minimized using a deterministic optimization algorithm. FIGURE 2 Flowchart of repeated LHS based SAA Solution Steps NUMERICAL EXPERIMENTS Nguyen–Dupuis Network Performance Measures Error of the Estimate Where S = the sample standard deviation K = the sample size. Accuracy of Approximation Where, = the expected value of K sample size = the value of benchmark The true expected value is approximated by the benchmark obtained by running the MC algorithm for 100,000 capacity realizations. Results Required Sample Size for the Convergence of LHS For both networks, the convergence rate was related only to the sample size and that there was no significant difference among the different number of bin fractions. When the sample size was increased, the error of the estimate decreased and the convergence rate gradually slowed down. The inflection point of the curve shown in Figure 3 (i.e., the point where the slope of the curve changed from sharp to flat) is between 500 and 2,000 sample sizes. FIGURE 3 Change in Error of the Estimate for Different Bin Fractions and Sample Sizes Approximation Accuracy of LHS For test networks with predetermined numbers of bin fractions, the value of approximation accuracy consistently decreased when the sample size is increased. Moreover, the predetermined number of bin fractions had a significant effect on the approximation accuracy. For the same sample size, when the number of bin fractions increased, the value of approximation accuracy improved. However, the approximation accuracy did not change when the number of bin fractions was large. FIGURE 4 Change in the Approximation Accuracy for Different Bin Fractions and Sample Sizes Comparison of Performance of LHS and MC The output results by LHS quickly converged and became stable when an acceptably small sample size was used and when the number of bin fractions was large. This represents a significant advantage over MC. Although the approximation accuracy of LHS is slightly worse than that of MC, this can be considered as a compromise between the approximation accuracy and computational time requirements. FIGURE 5 Comparison of the Approximation Accuracy of LHS and MC for Different Bin Fractions and Sample Sizes Conclusion This study investigated the convergence and approximation accuracy properties of LHS for TEPCU. The results indicate that: Convergence rate was related only to the sample size and that no significant difference was observed as a result of the differences in the size of the predetermined bin fractions. When the sample size was increased, the error of the estimate decreased and the convergence rate gradually slowed down. The predetermined number of bin fractions had a significant effect on approximation accuracy. For the same sample size, when the number of bin fractions increased, the value of approximation accuracy improved. However, the approximation accuracy exhibited a stable trend when the number of bin fractions was large. Introduction For modeling purposes, roadway capacity uncertainty is usually treated as a random variable under certain distribution instead of as a deterministic value. A common approach for solving the TEPCU problem is to use sampling techniques that randomly select subsets of the uncertainty set to obtain approximate solutions. Latin hypercube sampling (LHS) is a stratified sampling method that can reduce the variance in the MC estimate of the integrand significantly. An example of LHS with two input variables and five bin fractions is shown in Figure 1. FIGURE 1 Example of LHS with 2 Variables and 5 Intervals The main shortcoming of the LHS stratification scheme is: “it is one-dimensional and does not provide good uniform properties on a multi-dimensional unit hypercube” (Diwekar, 2003). The outcome of LHS can be considered only as a probabilistic output. Thus, the first and most important questions related to the application of repeated LHS procedure for TEPUC are these: How should one determine the number of bin fractions? How many corresponding sample sizes or additions of LHS are required to produce the desired output accuracy at the minimum level of confidence acceptable to the transportation planner?