Presentation is loading. Please wait.

Presentation is loading. Please wait.

GEV Flood Quantile Estimators with Bayesian Shape-Parameter GLS Regression Dirceu Silveira Reis Jr., Jery R. Stedinger and Eduardo Savio Martins Fundação.

Similar presentations


Presentation on theme: "GEV Flood Quantile Estimators with Bayesian Shape-Parameter GLS Regression Dirceu Silveira Reis Jr., Jery R. Stedinger and Eduardo Savio Martins Fundação."— Presentation transcript:

1 GEV Flood Quantile Estimators with Bayesian Shape-Parameter GLS Regression Dirceu Silveira Reis Jr., Jery R. Stedinger and Eduardo Savio Martins Fundação Cearense de Meteorologia e Recursos Hídricos– FUNCEME, Fortaleza, Brazil, and Cornell University, Ithaca, NY, USA.

2 The Challenge Wish to estimate extreme flood quantiles (such as 99 percentile) and flood risk distribution by fitting 3-parameter GEV distribution using annual maximum flood records of length 25 to 100 years. Some records have censored values - only know floods were below or above a threshold, due to “zero” values and recording limitations for historical records.

3 The Problem Would like to use maximum likelihood estimators because of asymptotic properties, and ability to incorporate censored data. But absurd MLEs of   can result in moderate samples Large uncertainty in extreme flood quantile estimates BEWARE: Statisticians often call   .

4 To reduce uncertainty Use regression to derive regional shape parameter  and its variance using basin characteristics. At-site  estimates obtained with L-moments. Small-sample variance and cross-correlations of  - estimators derived by Monte Carlo analysis. Combine at-site data & regional information as a prior probability distribution on  to define Generalized Maximum Likelihood Estimators (GMLE)

5 Outline GLS Regression Model –Classical model error estimator –Bayesian analysis GLS Regional   for Illinois River Basin GEV for Illinois Sites with Regional  Conclusions

6 GLS Regression GOAL: Obtain efficient estimators of a hydrologic statistic as a function of physiographic basin characteristics. MODEL:  =  +  1 log(Area)+  2 (Slope) +... + Error

7 Why use Generalized Least Squares: different record lengths m i => different precisions PLUS cross-correlation among estimators Basic Model for  GLS Model  (    ) = Cov{  i,  j } =    I +   where  computed using average  with record lengths m i and cross-correlations  i,j between concurrent flows. ^ ^

8 GLS Analysis: Solution GLS regression model (Stedinger & Tasker, 1985, 1989)  = X  +  with parameter estimator b for  { X T  (    ) -1 X } b = X T  (    ) -1  Estimate model-error    using moments (  - X b) T  (    ) -1 (  - X b) = n - p with  (    ) =    I +  n = dimension of   vector; p = dimension of b ^ ^ ^ ^ ^ ^

9 Measures of Precision Model error variance Var[  ] = E{ [  i - x i T  ] 2 } =   2 Parameter (sampling) error variance Var[b] = (X T  -1 X) -1 Variance of Prediction for new site VP new = E{ [  0 - x 0 T b] 2 } =   2 + x 0 T (X T  -1 X) -1 x 0

10 Pseudo R 2 for GLS Not interested in total error  that includes sampling error   which cannot hope to explain; how much of critical model error  can we explain, where Var[  ] =    ? Consider the GLS model:

11 Key Lesson For range of problems, GLS provides a flexible procedure for regionalization when only imperfect estimates of hydrologic parameters are available. Generally provides unbiased estimators of model error variance    and Var(b), and efficient estimators b of 

12 Moment Estimator’s Drawbacks Estimated    often equals zero when sampling variance is dominant error. Moments procedure does not provide natural measure of precision of the model error variance    Asymptotic MLE approximations have trouble with bound at zero.

13 Likelihood function - model error     Tibagi River, Brazil, n=17) Maximum of likelihood may be at zero, but larger values are very probable. Zero clearly not in middle of likely range of values.

14 Advantages of Bayesian Analysis Provides posterior distribution of parameters  model error variance   2, and predictive distribution for dependent variable Bayesian Approach is a natural solution to the problem

15 Bayesian GLS Model Prior distribution:  ( ,    ) -Parameter  are multivariate normal (  ) -Model error variance    Exponential dist. ( ); E[    ] = = 24 Likelihood function : Assume data is multivariate N[ X ,  ]

16 Quasi-Analytic Bayesian GLS  Joint posterior distribution  Marginal posterior of    where integrate analytically normal likelihood & prior to determine f in closed-form.

17 Example of a posterior of     Tibagi, Brazil, n =17)

18 Quasi-Analytic Result From joint posterior distribution can compute marginal posterior of  and moments by 1- dimensional num. integrations

19 Example – Illinois River Basin Stations: 62 stations in midwest USA Record length: 14 to 90 years Covariates: Drainage area Main channel slope Length Area of lakes Forest cover Soil permeability index Dummy variables Z 1 and Z 2 for regions

20 Best Regional Models of  Illinois River basin (USA) ASV = average sampling variance = Avg{ x T Var[  ] x } AVP = average variance of prediction =   2 + Avg{ x T Var[  ] x }

21 What did we learn? OLS results in too large a model error variance estimate because includes sampling error. Weighted & Generalized Least Squares produce different results: cross-correlation matters. Moment versus Bayesian estimator of model error variance was important: different model error variances result; different models selected. Obtained informative regional  distribution. Pseudo-R 2 = 55%.

22 GEV  -estimates using prior information (variances in parentheses) Sites are in Illinois Basin. Geophysical prior has mean = -0.1, var = (0.122 2 ).

23 Flood Frequency for Site 2 (90 years)

24 Conclusions  GMLE - regional prior for site with n = 90, reduced by 25% uncertainty variance in 100-yr flood; for site with n = 27, reduced 100-year flood by 50% and variance 60 times!  GLS reflects precision due to different record lengths plus cross-correlations among hydrologic statistics; thus provides more realistic description of sampling errors.  Bayesian GLS: 1. quasi-analytic procedure 2. full realistic posterior of  and    plus moments 3. provides regional estimates of  and their precision

25 References Coles, S.G. and M.J. Dixon, Likelihood-Based Inference for Extreme Value Models, Extremes, 2(1), 5-23, 1999. Martins, E.S. and J.R. Stedinger, “Generalized Maximum Likelihood GEV Quantile Estimators for Hydrologic Data,” Water Resour. Res., 28(11), 3001-3010, 2000. Martins, E.S., and J.R. Stedinger, “Cross-correlation among estimators of shape”, Water Resour. Res., 38(11), doi:10.1029/2002WR001589, 2002. Reis, D.S., Jr., Flood Frequency Analysis Employing Bayesian Regional Regression and Imperfect Historical Information, Ph.D. Thesis, Cornell University, Ithaca, NY, USA, 2005. Reis, D. S., Jr., J.R. Stedinger, and E.S. Martins, Bayesian GLS Regression with application to LP3 Regional Skew Estimation, Proceedings World Water & Environmental Resources Congress 2003, Editors P. Bizier and P. DeBarry, Philadelphia, PA, American Society of Civil Engineers, June 23-26, 2003. Reis, D. S., Jr., J.R. Stedinger, and E.S. Martins, Bayesian GLS Regression with application to LP3 Regional Skew Estimation, accepted Water Resour. Res.,, May 2005. Stedinger, J.R., and G.D. Tasker, Regional Hydrologic Analysis, 1. Ordinary, Weighted and Generalized Least Squares Compared, Water Resour. Res., 21(9), 1421-1432, 1985. [correction, Water Resour. Res. 22(5), 844, 1986.]


Download ppt "GEV Flood Quantile Estimators with Bayesian Shape-Parameter GLS Regression Dirceu Silveira Reis Jr., Jery R. Stedinger and Eduardo Savio Martins Fundação."

Similar presentations


Ads by Google