Using an emulator. Outline So we’ve built an emulator – what can we use it for? Prediction What would the simulator output y be at an untried input x.

Slides:



Advertisements
Similar presentations
Bayesian tools for analysing and reducing uncertainty Tony OHagan University of Sheffield.
Advertisements

Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Introduction to Monte Carlo Markov chain (MCMC) methods
Macroeconomic Themes:31 Contribution of Monetarism in Macroeconomic Policy Supply of money is the determinant of the national income In the long run, the.
Macroeconomic Analysis 2003
Simple Linear Regression 1. review of least squares procedure 2
Constructing a Task List ITSW 1410 Presentation Media Software Instructor: Glenda H. Easter.
Hypothesis Tests: Two Independent Samples
Uncertainty and Sensitivity Analysis of Complex Computer Codes
Emulators and MUCM. Outline Background Simulators Uncertainty in model inputs Uncertainty analysis Case study – dynamic vegetation simulator Emulators.
Insert Date HereSlide 1 Using Derivative and Integral Information in the Statistical Analysis of Computer Models Gemma Stephenson March 2007.
Dialogue Policy Optimisation
Running a model's adjoint to obtain derivatives, while more efficient and accurate than other methods, such as the finite difference method, is a computationally.
Design of Experiments Lecture I
Commonly Used Distributions
CHAPTER 8 More About Estimation. 8.1 Bayesian Estimation In this chapter we introduce the concepts related to estimation and begin this by considering.
Sensitivity Analysis In deterministic analysis, single fixed values (typically, mean values) of representative samples or strength parameters or slope.
Experimental Design, Response Surface Analysis, and Optimization
Training Manual Aug Probabilistic Design: Bringing FEA closer to REALITY! 2.5 Probabilistic Design Exploring randomness and scatter.
Computational statistics 2009 Sensitivity analysis (SA) - outline 1.Objectives and examples 2.Local and global SA 3.Importance measures involving derivatives.
Climate case study. Outline The challenge The simulator The data Definitions and conventions Elicitation Expert beliefs about climate parameters Expert.
1 PERFORMANCE EVALUATION H Often one needs to design and conduct an experiment in order to: – demonstrate that a new technique or concept is feasible –demonstrate.
Discrete Event Simulation How to generate RV according to a specified distribution? geometric Poisson etc. Example of a DEVS: repair problem.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 14 1 MER301: Engineering Reliability LECTURE 14: Chapter 7: Design of Engineering.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
A general assistant tool for the checking results from Monte Carlo simulations Koi, Tatsumi SLAC/SCCS.
Evaluating Hypotheses
Value of Information Some introductory remarks by Tony O’Hagan.
1 Validation and Verification of Simulation Models.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
Value of Information for Complex Economic Models Jeremy Oakley Department of Probability and Statistics, University of Sheffield. Paper available from.
Robert M. Saltzman © DS 851: 4 Main Components 1.Applications The more you see, the better 2.Probability & Statistics Computer does most of the work.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Lecture II-2: Probability Review
Decision analysis and Risk Management course in Kuopio
Gaussian process modelling
Kalman filtering techniques for parameter estimation Jared Barber Department of Mathematics, University of Pittsburgh Work with Ivan Yotov and Mark Tronzo.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Sampling Distributions
Monte Carlo Simulation CWR 6536 Stochastic Subsurface Hydrology.
Copyright © 2010 Lumina Decision Systems, Inc. Monte Carlo Simulation Analytica User Group Modeling Uncertainty Series #3 13 May 2010 Lonnie Chrisman,
Probabilistic Mechanism Analysis. Outline Uncertainty in mechanisms Why consider uncertainty Basics of uncertainty Probabilistic mechanism analysis Examples.
PARAMETRIC STATISTICAL INFERENCE
Emulation, Uncertainty, and Sensitivity
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Using an emulator. Outline So we’ve built an emulator – what can we use it for? Prediction What would the simulator output y be at an untried input x?
Engineering Economic Analysis Canadian Edition
Sampling Distributions Chapter 7. The Concept of a Sampling Distribution Repeated samples of the same size are selected from the same population. Repeated.
17 May 2007RSS Kent Local Group1 Quantifying uncertainty in the UK carbon flux Tony O’Hagan CTCD, Sheffield.
Sensitivity and Importance Analysis Risk Analysis for Water Resources Planning and Management Institute for Water Resources 2008.
Measures of variability: understanding the complexity of natural phenomena.
Machine Design Under Uncertainty. Outline Uncertainty in mechanical components Why consider uncertainty Basics of uncertainty Uncertainty analysis for.
RESEARCH & DATA ANALYSIS
Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.
1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.
Options and generalisations. Outline Dimensionality Many inputs and/or many outputs GP structure Mean and variance functions Prior information Multi-output,
The KOSMOSHOWS What is it ? The statistic inside What it can do ? Future development Demonstration A. Tilquin (CPPM)
Probabilistic Slope Stability Analysis with the
Introduction to emulators Tony O’Hagan University of Sheffield.
Statistical Concepts Basic Principles An Overview of Today’s Class What: Inductive inference on characterizing a population Why : How will doing this allow.
8 Sept 2006, DEMA2006Slide 1 An Introduction to Computer Experiments and their Design Problems Tony O’Hagan University of Sheffield.
Chapter 13 Simple Linear Regression
Market-Risk Measurement
March 9th, 2015 Maxime Lapalme Nazim Ould-Brahim
Monte Carlo Simulation Managing uncertainty in complex environments.
Statistical Methods For Engineers
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Presentation transcript:

Using an emulator

Outline So we’ve built an emulator – what can we use it for? Prediction What would the simulator output y be at an untried input x ? Uncertainty analysis Given uncertainty in x, what is the implied uncertainty in y ? Sensitivity analysis Which inputs influence the output most? Which inputs are responsible for most output uncertainty? Calibration Given observation of the real system, how can we use that to learn about the best input values? EGU short course - session 32

Prediction and UA

Prediction Prediction is simple because that’s precisely what the emulator does For any given x, the emulator mean E(f(x)) is an estimate The emulator variance var[f(x)] expresses uncertainty Known as code uncertainty Similarly, given x and some threshold c we can evaluate P[f(x) > c] EGU short course - session 34

Uncertainty analysis If X has distribution g(x) then UA looks at the implied distribution of Y = f(X) How do we evaluate that? In Session 1 we used Monte Carlo for a simple nonlinear simulator Mean = Median = Std. dev. = But all these are estimates Accuracy depends on the size of the Monte Carlo sample 95% interval for the mean is (0.116, 0.118) EGU short course - session 35

UA with an emulator Consider the expected output M = E[Y] It is uncertain because of code uncertainty The emulator mean value for M is E[M] = ∫ E[f(x)] g(x) dx We can evaluate this by Monte Carlo Sample many values of x, evaluate the emulator mean E[f(x)] for each and average them This is already much faster than making many simulator runs to evaluate f(x) But we can often do the integral exactly EGU short course - session 36

Why emulation is more efficient Similarly we can evaluate var[M] This is code uncertainty and depends on the number of simulator runs to build the emulator We want to compute/estimate M sufficiently accurately, so we want var[M] sufficiently small Emulation is more efficient because we can typically achieve the desired accuracy using far fewer simulator runs to build the emulator than using traditional methods For the simple nonlinear model, using only 25 simulator runs to build the emulator, a 95% interval for M is (0.1173, ) Using the emulator we can also compute/estimate all those other quantities of interest, like var[Y] or P[Y > c] EGU short course - session 37

Sensitivity analysis

Which inputs affect the output most? This is a common question Sensitivity analysis (SA) attempts to address it There are various forms of SA The methods most frequently used are not the most helpful! EGU short course - session 39

Recap – the nonlinear model The simple nonlinear model of the first session y = sin(x 1 )/{1+exp(x 1 +x 2 )} Just two inputs Uncertainty analysis: Normal distributions on inputs Output mean = 0.117, median = Std. dev. = Which of these two inputs influences output most? And in what ways? EGU short course - session 310

Local sensitivity analysis To measure the sensitivity of y to input x i, compute the derivative of y with respect to x i Nonlinear model: At x 1 = x 2 = 0.5, the derivatives are wrt x 1, 0.142; wrt x 2, –0.094 How useful is this? Derivatives evaluated only at the central estimate Could be quite different at other points nearby Doesn’t capture interactions between inputs E.g. sensitivity of y to increasing both x 1 and x 2 could be greater or less than the sum of their individual sensitivities Not invariant to change of units EGU short course - session 311

If you do want use derivatives… Calculating the derivatives (adjoints) of a complex simulator is difficult Even with automatic adjoint compilers Differentiating a GP emulator is trivial The derivative of a GP is another GP So if you build an emulator derivative information is free EGU short course - session 312

One-way SA Vary inputs one at a time from central estimate Nonlinear model: Vary x 1 to 0.25, 0.75, output is 0.079, Vary x 2 to 0.25, 0.75, output is 0.154, Is this more useful? Depends on how far we vary each input Relative sensitivities of different inputs change if we change the ranges But ranges are arbitrary Also fails to capture interactions Statisticians have known for decades that varying factors one at a time is bad experimental design! EGU short course - session 313

Multi-way SA Vary factors two or more at a time Maybe statistical factorial design Full factorial designs require very many runs Can find interactions but hard to interpret Often just look for the biggest change of output among all runs Still dependent on how far we vary each input EGU short course - session 314

Probabilistic SA Inputs varied according to their probability distributions As in Uncertainty Analysis (UA) Sensitivities still depend on ranges of distributions (variances), but these are now not necessarily arbitrary Gives an overall picture and can identify interactions EGU short course - session 315

Variance decomposition One way to characterise the sensitivity of the output to individual inputs is to compute how much of the UA variance is due to each input For the simple non-linear model, we have EGU short course - session 316 InputContribution X1X % X2X % X 1.X 2 interaction2.93 %

Main effects We can also plot the effect of varying one input averaged over the others Nonlinear model Averaging y = sin(x 1 )/{1+exp(x 1 +x 2 )} with respect to the uncertainty in x 2, we can plot it as a function of x 1 Similarly, we can plot it as a function of x 2 averaged over uncertainty in x 1 We can also plot interaction effects EGU short course - session 317

Main effects in the simple nonlinear model Red is main effect of x 1 (averaged over x 2 ) Blue is main effect of x 2 (averaged over x 1 ) EGU short course - session 318

Joint effect in the simple nonlinear model EGU short course - session 319

A more complex example 5 inputs have appreciable influence, and account for 57% of the total UA variance Interactions account for 28% EGU short course - session 320

Amplifying on variances Main effect plots amplify on the information given in the variance decomposition The variance component associated with input x i is equal to the amount by which its main effect varies over the range of uncertainty in x i EGU short course - session 321 The 5 inputs with most influence are dashed

SA summary Why SA? 1. For the model user: SA identifies which inputs it would be most useful to reduce uncertainty about 2. For the model builder: main effect and interaction plots demonstrate how the simulator is behaving Sometimes surprisingly! EGU short course - session 322

Uncertainty and Sensitivity Analysis Doing uncertainty and sensitivity analysis will tell you a lot about your simulators Emulators are a perfect tool for doing such analyses EGU short course - session 323