Uncertainty Analysis Using GEM-SA. GEM-SA course - session 42 Outline Setting up the project Running a simple analysis Exercise More complex analyses.

Slides:



Advertisements
Similar presentations
Document Properties: adding information to your Microsoft Office documents Step 1: Add information to Document Properties What are Document Properties.
Advertisements

Sensitivity Analysis in GEM-SA Jeremy Oakley. Example  ForestETP vegetation model – 7 input parameters – 120 model runs  Objective: conduct a variance-based.
Uncertainty Analysis Using GEM-SA Tony O’Hagan. Outline  Setting up the project  Running a simple analysis  Exercise  More complex analyses.
Getting started with GEM-SA Marc Kennedy. This talk  Starting GEM-SA program  Creating input and output files  Explanation of the menus, toolbars,
Slide 1 John Paul Gosling University of Sheffield GEM-SA: a tutorial.
Sensitivity Analysis in GEM-SA. GEM-SA course - session 62 Example ForestETP vegetation model 7 input parameters 120 model runs Objective: conduct a variance-based.
CS0004: Introduction to Programming Visual Studio 2010 and Controls.
Introduction to Excel 2007 Part 2: Bar Graphs and Histograms February 5, 2008.
Validating uncertain predictions Tony O’Hagan, Leo Bastos, Jeremy Oakley, University of Sheffield.
Descriptive Statistics In SAS Exploring Your Data.
A Visual Introduction to PC SAS. Start SAS by double-clicking on the SAS icon...
Microsoft Excel 2003 Illustrated Complete Excel and Advanced Worksheet Management Customizing.
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
Introduction to SPSS Short Courses Last created (Feb, 2008) Kentaka Aruga.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
8/9/2015Slide 1 The standard deviation statistic is challenging to present to our audiences. Statisticians often resort to the “empirical rule” to describe.
Example 16.3 Estimating Total Cost for Several Products.
Introduction to Excel 2007 Part 3: Bar Graphs and Histograms Psych 209.
1 1 Slide © 2001 South-Western/Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
Using the SmartPLS Software
Introduction to VBA. This is not Introduction to Excel We’re going to assume you have a basic level of familiarity with Excel If you don’t, or you need.
Slide 1 SOLVING THE HOMEWORK PROBLEMS Simple linear regression is an appropriate model of the relationship between two quantitative variables provided.
Advanced Tables Lesson 9. Objectives Creating a Custom Table When a table template doesn’t suit your needs, you can create a custom table in Design view.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Downloading and Installing PAF Insight PAF Insight can be easily downloaded Or can be installed from a CD A license is needed t0 activate the program.
Prediction concerning Y variable. Three different research questions What is the mean response, E(Y h ), for a given level, X h, of the predictor variable?
Power Point EDU 271 Microsoft PowerPoint is a powerful tool to create professional looking presentations and slide shows. PowerPoint allows you to construct.
Gaussian process modelling
FW364 Ecological Problem Solving Lab 4: Blue Whale Population Variation [Ramas Lab]
Create Database Tables
Using FunctionUsing Function Lesson 5 © 2014, John Wiley & Sons, Inc.Microsoft Official Academic Course, Microsoft Word Microsoft Excel 2013.
Analysing your data PsychoPy saves several data files for different uses: A Microsoft Excel (spreadsheet) file that you can use for most analyses A ‘psydat’
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 2 1 Microsoft Office Access 2003 Tutorial 2 – Creating And Maintaining A.
SuccessMaker. Where are they? Math: Intranet On a server at Vanhoose Reading Web-based.
Using an emulator. Outline So we’ve built an emulator – what can we use it for? Prediction What would the simulator output y be at an untried input x?
Create Forms Lesson 5. Objectives Software Orientation The Forms group (below) is located on the Create tab in the Ribbon and can be used to create a.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Page 1 Non-Payroll Cost Transfer Enhancements Last update January 24, 2008 What are the some of the new enhancements of the Non-Payroll Cost Transfer?
Getting started with GEM-SA Marc Kennedy Central Science Laboratory, York Tony O’Hagan, Jeremy Oakley University of Sheffield.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
Getting started with GEM-SA. GEM-SA course - session 32 This talk Starting GEM-SA program Creating input and output files Explanation of the menus, toolbars,
Using Microsoft Visual Studio 2005 Original by Suma Rao Revised by John G. McMahon ( 9/6/2008 )
FIX Eye FIX Eye Getting started: The guide EPAM Systems B2BITS.
Course ILT Forms and queries Unit objectives Create forms by using AutoForm and the Form Wizard, and add or modify form headers and footers Open and enter.
June 21, Objectives  Enable the Data Analysis Add-In  Quickly calculate descriptive statistics using the Data Analysis Add-In  Create a histogram.
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
Chapter 8 Minitab Recipe Cards. Confidence intervals for the population mean Choose Basic Statistics from the Stat menu and 1- Sample t from the sub-menu.
1 Basic Computing Skills Dr Wenwu Wang Centre for Vision Speech and Signal Processing Department of Electronic Engineering
1 Microsoft Project 2003 Starting a New Project. 2 Creating a new project plan To start Microsoft Project, click the Windows Start menu. Point to All.
Create new project or open existing project (here, we will create a new project)
Work with Tables and Database Records Lesson 3 © 2014, John Wiley & Sons, Inc.Microsoft Official Academic Course, Microsoft Word Microsoft Access.
2/24/2016Slide 1 The standard deviation statistic is challenging to present to our audiences. Statisticians often resort to the “empirical rule” to describe.
Resource Review Excel formula basics Demonstrate how to enter manual formulas Examine some of the available functions and their usage Discuss the.
How to graph your stock project performance vs. the 3 major indexes.
Introduction to emulators Tony O’Hagan University of Sheffield.
BINARY LOGISTIC REGRESSION
Probability and Statistics
Regression Analysis AGEC 784.
International Computer Driving Licence Syllabus version 5.0
Microsoft Official Academic Course, Microsoft Access 2013
DEPARTMENT OF COMPUTER SCIENCE
Microsoft Official Academic Course, Access 2016
Eviews Tutorial for Labor Economics Lei Lei
Summary (Week 1) Categorical vs. Quantitative Variables
Running a Java Program using Blue Jay.
St. Edward’s University
An Introduction to Designing and Executing Workflows with Taverna
Chapter 13 Excel Extension: Now You Try!
Presentation transcript:

Uncertainty Analysis Using GEM-SA

GEM-SA course - session 42 Outline Setting up the project Running a simple analysis Exercise More complex analyses

Setting up the project

GEM-SA course - session 44 Create a new project Select Project -> New, or click toolbar icon Project dialog appears We’ll specify the data files first

GEM-SA course - session 45 Files The “Inputs” file contains one column for each parameter and one row for each model training run (the design) The “Outputs” file contains the outputs from those runs (one column, in this example) Using “Browse” buttons, select input and output files

GEM-SA course - session 46 Our example We’ll use the example “model1” in the GEM-SA DEMO DATA directory This example is based on a vegetation model with 7 inputs RESAEREO, DEFLECT, FACTOR, MO, COVER, TREEHT, LAI The model has 16 outputs, but for the present we will consider output 4 June monthly GPP

GEM-SA course - session 47 Number of inputs Click on Options tab Select number of inputs using Or click “From Inputs File”

GEM-SA course - session 48 Define input names Click on “Names …” The “Input parameter names” dialog opens Enter parameter names Click “OK”

GEM-SA course - session 49 Complete the project We will leave all other settings at their default values for now Click “OK” The Input Parameter Ranges window appears

GEM-SA course - session 410 Close and save project Click “Defaults from input ranges” button Click “OK” Select Project -> Save Or click toolbar icon Choose a name and click “Save”

Running a simple analysis

GEM-SA course - session 412 Build the emulator Click to build the emulator A lot of things now start to happen! The log window at the bottom starts to record various bits of information A little window appears showing progress of minimisation of the roughness parameter estimation criterion The “Main Effects” tab is selected, in which several graphs are drawn Progress bar at the bottom

GEM-SA course - session 413 Focus on the log window The “Main Effects” and “Sensitivity Analysis” tabs are concerned with SA, and will be considered in the next session We are interested just now simply in Uncertainty Analysis (UA) The “Output Summary” tab contains all we need and more But the key things can be seen more simply in the log window at the bottom Diagnostics of the emulator build The basic uncertainty analysis results

GEM-SA course - session 414 Emulation diagnostics Note where the log window reports … The first line says roughness parameters have been estimated by the simplest method The values of these indicate how non-linear the effect of each input parameter is Note the high value for input 4 (MO) Estimating emulator parameters by maximising probability distribution... maximised posterior for emulator parameters: precision = sigma- squared = , roughness =

GEM-SA course - session 415 Uncertainty analysis – mean Below this, the log reports So the best estimate of the output (June GPP) is 24.1 (mol C/m 2 ) This is averaged over the uncertainty in the 7 inputs Better than just fixing inputs at best estimates There is an emulation standard error of in this figure Estimate of mean output is , with variance

GEM-SA course - session 416 Uncertainty analysis – variance The final line of the log is This shows the uncertainty in the model output that is induced by input uncertainties The variance is 73.9 Equal to a standard deviation of 8.6 So although the best estimate of the output is 24.1, the uncertainty in inputs means it could easily be as low as 16 or as high as 33 Estimate of total output variance =

Exercise

GEM-SA course - session 418 A small change Run the same model with Output 11 instead of Output 4 Calculate the coefficient of variation (CV) for this output NB: the CV is defined as the standard deviation divided by the mean

More complex analyses

GEM-SA course - session 420 Input distributions A normal (gaussian) distribution is generally a more realistic representation of uncertainty Range unbounded More probability in the middle Default is to assume the uncertainty in each input is represented by a uniform distribution Range determined by the range of values found in the input file or separately input

GEM-SA course - session 421 Changing input distributions Reopen Project dialog by Project - > Edit … or clicking on Select Options tab Click All unknown, product normal Then OK A new dialog opens to specify means and variances

GEM-SA course - session 422 Model 1 example Uniform distributions from input ranges Normal distributions to match Range about 4 std deviations Except for MO Narrower distribution UniformNormal ParameterLowerUpperMeanVariance RESAEREO DEFLECT FACTOR MO COVER TREEHT LAI

GEM-SA course - session 423 Effect on UA After running the revised model, we see: It runs faster, with no need to rebuild the emulator The mean is changed a little and variance is halved The emulator fit is unchanged Estimate of mean output is , with variance Estimate of total output variance =

GEM-SA course - session 424 Reducing MO uncertainty further If we reduce the variance of MO even more, to 49: UA mean changes a little more and variance reduces again Notice also how the emulation uncertainty has increased (0.004 for uniform) This is because the design points cover the new ranges less thoroughly Estimate of mean output is , with variance Estimate of total output variance =

GEM-SA course - session 425 A homework exercise What happens if we reduce the uncertainty in MO to zero? Two ways to do this Literally set variance to zero Select “Some known, rest product normal” on Project dialog, check the tick box for MO in the mean and variance dialog What changes do you see in the UA?

GEM-SA course - session 426 Cross-validation Reopen the Project dialog and select the Options tab Look at the bottom menu box, labelled “Cross- validation” There are 3 options None Leave-one-out Leave final 20% out CV is a way of checking the emulator fit Default is None because CV takes time

GEM-SA course - session 427 Cross Validation Root Mean-Squared Error = Cross Validation Root Mean-Squared Relative Error = percent Cross Validation Root Mean-Squared Standardised Error = Largest standardised error is for data point 61 Cross Validation variances range from to Written cross-validation means to file cvpredmeans.txt Written cross-validation variances to file cvpredvars.txt Leave-one-out CV After estimating roughness and other parameters, GEM predicts each training run point using only the remaining n-1 points Results appear in log window Close to 1 (Model 1, output 4, uniform inputs)

GEM-SA course - session 428 Leave final 20% out CV This is an even better check, because it tests the emulator on data that have not been used in any way to build it Emulator is built on first 80% of data and used to predict last 20% Standardised error a bit bigger But not bad for just 24 runs predicted Cross Validation Root Mean-Squared Error = Cross Validation Root Mean-Squared Relative Error = percent Cross Validation Root Mean-Squared Standardised Error = Largest standardised error is for data point 22 Cross Validation variances range from to 4.886

GEM-SA course - session 429 Output Summary tab The “Output Summary” tab presents all of the key results in a single list Tidier than searching for the details in the log window Although the log window actually has more information Can print using

GEM-SA course - session 430 Other options There are various other options associated with the emulator building that we have not dealt with See built in help facility for explanations Also slides at the end of session 3 But we’ve done the main things that should be considered in practice And it’s enough to be going on with!

GEM-SA course - session 431 When it all goes wrong How do we know when the emulator is not working? Large roughness parameters Especially ones hitting the limit of 99 Large emulation variance on UA mean Poor CV standardised prediction error Especially when some are extremely large In such cases, see if a larger training set helps Other ideas like transforming output scale A suite of diagnostics is being developed in MUCM See Bastos and O’Hagan on my website Not implemented in GEM-SA yet