Eawag: Swiss Federal Institute of Aquatic Science and Technology Analyzing input and structural uncertainty of deterministic models with stochastic, time-dependent.

Slides:



Advertisements
Similar presentations
Eawag: Swiss Federal Institute of Aquatic Science and Technology Analyzing possible causes of bias of hydrological models with stochastic, time-dependent.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Probability & Statistical Inference Lecture 9
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Correlation and regression
Markov-Chain Monte Carlo
Model Assessment, Selection and Averaging
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
Statistics for the Social Sciences
Chapter 10 Simple Regression.
Simulation Modeling and Analysis
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Chapter Topics Types of Regression Models
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Chapter 11 Multiple Regression.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Simple Linear Regression Analysis
Extreme Value Analysis, August 15-19, Bayesian analysis of extremes in hydrology A powerful tool for knowledge integration and uncertainties assessment.
Ch. 14: The Multiple Regression Model building
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Lecture II-2: Probability Review
Introduction to Regression Analysis, Chapter 13,
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Analysis
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Correlation & Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Hydrologic Modeling: Verification, Validation, Calibration, and Sensitivity Analysis Fritz R. Fiedler, P.E., Ph.D.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Eawag: Swiss Federal Institute of Aquatic Science and Technology Problems of Inference and Uncertainty Estimation in Hydrologic Modelling Peter Reichert.
1 Multivariate Linear Regression Models Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
TODAY we will Review what we have learned so far about Regression Develop the ability to use Residual Analysis to assess if a model (LSRL) is appropriate.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
1 G Lect 14M Review of topics covered in course Mediation/Moderation Statistical power for interactions What topics were not covered? G Multiple.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Simple Linear Regression ANOVA for regression (10.2)
Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
Eawag: Swiss Federal Institute of Aquatic Science and Technology Mechanism-Based Emulation of Dynamic Simulation Models – Concept and Application in Hydrology.
Correlation & Regression Analysis
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
- 1 - Calibration with discrepancy Major references –Calibration lecture is not in the book. –Kennedy, Marc C., and Anthony O'Hagan. "Bayesian calibration.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Linear Regression Linear Regression. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Purpose Understand Linear Regression. Use R functions.
- 1 - Preliminaries Multivariate normal model (section 3.6, Gelman) –For a multi-parameter vector y, multivariate normal distribution is where  is covariance.
Eawag: Swiss Federal Institute of Aquatic Science and Technology Analyzing input and structural uncertainty of a hydrological model with stochastic, time-dependent.
Eawag: Swiss Federal Institute of Aquatic Science and Technology Use of time-dependent parameters for improvement and uncertainty estimation of dynamic.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Bayesian analysis of a conceptual transpiration model with a comparison of canopy conductance sub-models Sudeep Samanta Department of Forest Ecology and.
1 Simple Linear Regression Chapter Introduction In Chapters 17 to 19 we examine the relationship between interval variables via a mathematical.
The simple linear regression model and parameter estimation
Inference for Least Squares Lines
Linear Regression.
BA 275 Quantitative Business Methods
CHAPTER 29: Multiple Regression*
CHAPTER- 17 CORRELATION AND REGRESSION
Migration and the Labour Market
Our theory states Y=f(X) Regression is used to test theory.
Product moment correlation
Chapter 14 Inference for Regression
Presentation transcript:

Eawag: Swiss Federal Institute of Aquatic Science and Technology Analyzing input and structural uncertainty of deterministic models with stochastic, time-dependent parameters Peter Reichert Eawag Dübendorf, ETH Zürich and SAMSI

SAMSI meeting Feb. 1, 2007 Contents Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion  Motivation  Approach  Implementation [Tomassini, Reichert, Künsch and Borsuk, 2007, subm.]  Results for a simple epidemiological model [Cintron-Arias, Reichert, Lloyd, Banks, 2007, initiated]  Results for a simple hydrologic model [Reichert and Mieleitner, 2007, in progress]  Discussion

SAMSI meeting Feb. 1, 2007 Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Motivation Problem:  In most applications, there is a bias in results of deterministic models.  This bias is typically caused by input and model structure errors, and not by correlated measurement errors.  Bias modelling describes this bias statistically, but does not directly support the identification of its causes and the formulation of improved model structures. Objective:  Search for improved model structures or stochastic influence factors to model input or parameters that reduce/remove the bias. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Approach Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Approach The proposed approach consists of the following steps: 1)Replace selected (constant) parameters of the deterministic model by a stochastic process in time. 2)Estimate the states of this process jointly with the model parameters. 3)Analyze the degree of bias reduction achieved by doing this for different parameters. 4)Analyze the identified time dependence of the parameters for correlation with external influence factors and internal model variables. 5)Improve the deterministic model if such dependences were found. 6)Explain the remaining stochasticity (if any) of the model by including the appropriate stochastic parameter(s) into the model description. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Implementation Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model Deterministc model: Consideration of observation error: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model Model with parameter i time dependent: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Time Dependent Parameter This has the advantage that we can use the analytical solution: The time dependent parameter is modelled by a mean-reverting Ornstein Uhlenbeck process: or, after reparameterization: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Inference We combine the estimation of  constant model parameters,, with  state estimation of the time-dependent parameter(s),, and with  the estimation of (constant) parameters of the Ornstein-Uhlenbeck process(es) of the time dependent parameter(s),. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Inference Gibbs sampling for the three different types of parameters. Conditional distributions: Ornstein-Uhlenbeck process (cheap) simulation model (expensive) Ornstein-Uhlenbeck process (cheap) Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Inference Metropolis-Hastings sampling for each type of parameter: Multivariate normal jump distributions for the parameters  and . This requires one simulation to be performed per suggested new value of . The discretized Ornstein-Uhlenbeck parameter,, is split into subintervals for which OU-process realizations conditional on initial and end points are sampled. This requires the number of subintervals simulations per complete new time series of . Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Estimation of Hyperparameters by Cross - Validation Due to identifiability problems we select the two hyperparameters (  ) by cross-validation: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Estimation of Hyperparameters by Cross - Validation For a state-space model of the form we can estimate the pseudo-likelihood from the sample: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Results for Epidemiological Model Results for a Simple Epidemiological Model  Model  Model Application  Preliminary Results Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model A Simple Epidemiological Model: Dushoff et al S :susceptible I :infected N :„total population“ N-S-I :resistant model parameters 2initial conditions 1standard dev. of meas. err. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model Application Influenza Outbreaks in the USA Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Model predictions and data Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residual analysis – constant parameters Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residual analysis – beta0 time varying Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Time dependent parameter: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Next Steps  More Fourier terms, more general periodicity.  Cross validation for choice of  and .  Interpretation of remaining stochasticity.  Inclusion of identified forcing in model equations.  Inclusion of stochastic parameter in model equations.  Different data set: measles. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary for Hydrologic Model Results for a Simple Hydrologic Model  Model  Model Application  Preliminary Results Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model A Simple Hydrologic Watershed Model (1): Kuczera et al Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model A Simple Hydrologic Watershed Model (2): Kuczera et al A B 7 model parameters 3 initial conditions 1 standard dev. of meas. err. 3 „modification parameters“ C Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model Application Model application:  Data set of Abercrombie watershed, New South Wales, Australia (2770 km 2 ), kindly provided by George Kuczera (Kuczera et al. 2006).  Box-Cox transformation applied to model and data to decrease heteroscedasticity of residuals.  Step function input to account for input data in the form of daily sums of precipitation and potential evapotranspiration.  Daily averaged output to account for output data in the form of daily average discharge. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Model Application Prior distribution: Estimation of constant parameters: Independent uniform distributions for the loga- rithms of all parameters (7+3+1=11), keeping correction factors (f rain, f pet, f Q ) equal to unity. Estimation of time-dependent parameters: Ornstein-Uhlenbeck process applied to log of the parameter. Hyper-parameters:  = 5d,  fixed, only estimation of initial value and mean (0 for log f rain, f pet, f Q ). Constant parameters as above. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Posterior marginals: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Max. post. simulation with constant parameters: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residuals of max. post. sim. with const. pars.: Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residual analysis, max. post., constant parameters Residual analysis, max. post., q_gw_max time-dependent Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residual analysis, max. post., s_F time-dependent Residual analysis, max. post., f_rain time-dependent Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residual analysis, max. post., f_pet time-dependent Residual analysis, max. post., f_Q time-dependent Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residual analysis, max. post., k_et time-dependent Residual analysis, max. post., q_latmax time-dependent Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Residual analysis, max. post., k_s time-dependent NSparameterNSparameter 0.90f_rain0.79f_pet 0.88s_F0.69q_gwmax 0.87k_s0.67q_latmax 0.83f_Q0.67k_et const. parameter: 0.64 Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Time-dependent parameters Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Time-dependent parameters Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Scatter plot of parameter vs model variables Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Scatter plot of parameter vs model variables Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Scatter plot of parameter vs model variables Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Scatter plot of parameter vs model variables Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Preliminary Results Scatter plot of parameter vs model variables Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Next Steps  Search for correlations between time dependent parameters and external influence factors and system variables.  Improve formulation of deterministic model and redo the analysis.  Cross validation for choice of  and .  Inclusion of stochastic parameter in model equations to explain remaining stochasticity. Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Discussion Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

SAMSI meeting Feb. 1, 2007 Discussion 1.Other formulations of time-dependent parameters? 2.Dependence on other factors than time? 3.How to estimate hyperparameters? (Reduction in correlation time always improves the fit.) 4.How to avoid modelling physical processes with the bias term? 5.Learn from more applications. 6.Compare results with methodology by Bayarri et al. (2005). Combine/extend the two methodologies? 7.How to improve efficiency? 8.How to combine statistical with contextual knowledge? 9.? Motivation Approach Implementation Results for epidem. Model Results for hydrol. model Discussion

Eawag: Swiss Federal Institute of Aquatic Science and Technology Thank you for your attention