Creating probability forecasts of binary events from ensemble predictions and prior information - A comparison of methods Cristina Primo Institute Pierre.

Slides:



Advertisements
Similar presentations
A unified linear Model Output Statistics scheme for both deterministic and ensemble forecasts S. Vannitsem Institut Royal Météorologique de Belgique Monterey,
Advertisements

Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Extended range forecasts at MeteoSwiss: User experience.
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
What is a good ensemble forecast? Chris Ferro University of Exeter, UK With thanks to Tom Fricker, Keith Mitchell, Stefan Siegert, David Stephenson, Robin.
What is a good ensemble forecast? Chris Ferro University of Exeter, UK With thanks to Tom Fricker, Keith Mitchell, Stefan Siegert, David Stephenson, Robin.
Fair scores for ensemble forecasts Chris Ferro University of Exeter 13th EMS Annual Meeting and 11th ECAM (10 September 2013, Reading, UK)
LRF Training, Belgrade 13 th - 16 th November 2013 © ECMWF Sources of predictability and error in ECMWF long range forecasts Tim Stockdale European Centre.
ECMWF long range forecast systems
14 May 2001QPF Verification Workshop Verification of Probability Forecasts at Points WMO QPF Verification Workshop Prague, Czech Republic May 2001.
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
EVSC 495/EVAT 795 Data Analysis & Climate Change Class hours: TuTh 2:00-3:15 pm Instructor: Michael E. Mann.
Bayesian methods for combining climate forecasts (*): Department of Meteorology, The University of Reading 1.Introduction 2.Conditioning and Bayes’ theorem.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Improving COSMO-LEPS forecasts of extreme events with.
Introduction to Probability and Probabilistic Forecasting L i n k i n g S c i e n c e t o S o c i e t y Simon Mason International Research Institute for.
Gridded OCF Probabilistic Forecasting For Australia For more information please contact © Commonwealth of Australia 2011 Shaun Cooper.
Probabilistic forecasts of precipitation in terms of quantiles John Bjørnar Bremnes.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
© Crown copyright Met Office Andrew Colman presentation to EuroBrisa Workshop July Met Office combined statistical and dynamical forecasts for.
Ensemble Post-Processing and it’s Potential Benefits for the Operational Forecaster Michael Erickson and Brian A. Colle School of Marine and Atmospheric.
A Regression Model for Ensemble Forecasts David Unger Climate Prediction Center.
Multi-Model Ensembling for Seasonal-to-Interannual Prediction: From Simple to Complex Lisa Goddard and Simon Mason International Research Institute for.
Caio A. S. Coelho Supervisors: D. B. Stephenson, F. J. Doblas-Reyes (*) Thanks to CAG, S. Pezzulli and M. Balmaseda.
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
Evaluating decadal hindcasts: why and how? Chris Ferro (University of Exeter) T. Fricker, F. Otto, D. Stephenson, E. Suckling CliMathNet Conference (3.
Climate Forecasting Unit Prediction of climate extreme events at seasonal and decadal time scale Aida Pintó Biescas.
Statistical Decision Theory
Verification of ensembles Courtesy of Barbara Brown Acknowledgments: Tom Hamill, Laurence Wilson, Tressa Fowler Copyright UCAR 2012, all rights reserved.
Bayesian Inference Ekaterina Lomakina TNU seminar: Bayesian inference 1 March 2013.
Tutorial. Other post-processing approaches … 1) Bayesian Model Averaging (BMA) – Raftery et al (1997) 2) Analogue approaches – Hopson and Webster, J.
Intraseasonal TC prediction in the southern hemisphere Matthew Wheeler and John McBride Centre for Australia Weather and Climate Research A partnership.
FORECAST SST TROP. PACIFIC (multi-models, dynamical and statistical) TROP. ATL, INDIAN (statistical) EXTRATROPICAL (damped persistence)
Caio A. S. Coelho, D. B. Stephenson, F. J. Doblas-Reyes (*) and M. Balmaseda (*) Department of Meteorology, University of Reading and ECMWF (*)
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
Celeste Saulo and Juan Ruiz CIMA (CONICET/UBA) – DCAO (FCEN –UBA)
Toward Probabilistic Seasonal Prediction Nir Krakauer, Hannah Aizenman, Michael Grossberg, Irina Gladkova Department of Civil Engineering and CUNY Remote.
Verification of IRI Forecasts Tony Barnston and Shuhua Li.
Caio A. S. Coelho Is a group of forecasts better than one? Department of Meteorology, University of Reading Introduction Combination in meteorology Bayesian.
1 An overview of the use of reforecasts for improving probabilistic weather forecasts Tom Hamill NOAA / ESRL, Physical Sciences Div.
Forecasting in CPT Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January 2015.
Machine Learning CUNY Graduate Center Lecture 4: Logistic Regression.
- 1 - Bayesian inference of binomial problem Estimating a probability from binomial data –Objective is to estimate unknown proportion (or probability of.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss A more reliable COSMO-LEPS F. Fundel, A. Walser, M. A.
Verification of ensemble systems Chiara Marsigli ARPA-SIMC.
Applying Ensemble Probabilistic Forecasts in Risk-Based Decision Making Hui-Ling Chang 1, Shu-Chih Yang 2, Huiling Yuan 3,4, Pay-Liam Lin 2, and Yu-Chieng.
Two extra components in the Brier Score Decomposition David B. Stephenson, Caio A. S. Coelho (now at CPTEC), Ian.T. Jolliffe University of Reading, U.K.
© Crown copyright Met Office Seasonal forecasting: Not just seasonal averages! Emily Wallace November 2012.
Caio A. S. Coelho, S. Pezzulli, M. Balmaseda (*), F. J. Doblas-Reyes (*) and D. B. Stephenson Forecast calibration and combination: A simple Bayesian approach.
Caio A. S. Coelho, S. Pezzulli, M. Balmaseda (*), F. J. Doblas-Reyes (*) and D. B. Stephenson Bayesian combination of ENSO forecasts Department of Meteorology,
Predicting the performance of climate predictions Chris Ferro (University of Exeter) Tom Fricker, Fredi Otto, Emma Suckling 13th EMS Annual Meeting and.
Outline Historical note about Bayes’ rule Bayesian updating for probability density functions –Salary offer estimate Coin trials example Reading material:
Oliver Schulte Machine Learning 726
Synthesis of Seasonal Prediction Skill
FORECASTING HEATWAVE, DROUGHT, FLOOD and FROST DURATION Bernd Becker
Verifying and interpreting ensemble products
Potential predictability, ensemble forecasts and tercile probabilities
Development and applications of a new genesis potential index
LOGISTIC REGRESSION 1.
Question 1 Given that the globe is warming, why does the DJF outlook favor below-average temperatures in the southeastern U. S.? Climate variability on.
Makarand A. Kulkarni Indian Institute of Technology, Delhi
Anne Leroy Météo France, Nouméa, Nouvelle-Calédonie Matthew C. Wheeler
Forecast Assimilation: A Unified Framework for the
Forecasting Seasonal and
forecasts of rare events
Verification of COSMO-LEPS and coupling with a hydrologic model
Improved skill of ENSO coupled model probability forecasts by Bayesian combination with empirical forecasts Caio A. S. Coelho, S. Pezzulli, M. Balmaseda.
ECMWF Seasonal Forecast Group Meeting: 24 July 2002
the performance of weather forecasts
What is a good ensemble forecast?
Power Regression & Regression estimation of event probabilities (REEP)
Presentation transcript:

Creating probability forecasts of binary events from ensemble predictions and prior information - A comparison of methods Cristina Primo Institute Pierre Simon Laplace (IPSL) Ian Jolliffe, Chris A. T. Ferro and David B. Stephenson Climate Analysis Group Department of Meteorology University of Reading

2 Outline: How to improve probabilistic forecasts of a binary event? - Use prior information: Bayesian methods. - Calibrate the model: Logistic regression. Illustration of methods with an example: - 3-day ahead precipitation in Reading (UK) - 5-month ahead forecast of Dec. Niño 3.4 Index Conclusions

3 Aim: Forecast a binary event at a future time 1 if the event is observed to occur 0 otherwise NOTATION: Number of members that forecast the event. Numerical models provide an ensemble of forecasts for time : ensemble size 1 if the -th member forecast the event at time 0 otherwise

4 How to estimate the probability of the event?

5 (a) Temporal series Do not use ensemble forecasts.

6 How to estimate the probability of the event? (b) Frequentist approach Just use ensemble forecasts (do not use past observations).

7 How to estimate the probability of the event? (c) Bayesian approach Use ensemble forecasts and prior information about past data, expert opinion or a combination between them.

8 How to estimate the probability of the event? (d) Calibration approach Incorporate the relationship between past observations and past ensemble forecasts.

9 Easy to obtain. When and, the forecaster issues probabilities of 0 and 1 (event completely impossible or completely certain to occur). There is no estimate of uncertainty on the predicted probability. The probabilities take only a finite set of discrete values. It is unlikely the forecaster really believes this statement !! 1) Frequentist approach The probability is estimated by the relative frequency: = Probability that the event is observed to occur (unknown).

10 2) Bayesian approach : provides us with a posterior distribution of the predicted probability. Estimate a distribution a priori including the uncertainty in the parameters: Model uncertainty of the ensemble forecasts (likelihood) as a conditional distribution: Obtain a posterior distribution (Bayes’ theorem): Davison A. C.,Cambridge University Press (2003) If the model is perfectly calibrated, the probability that an ensemble member forecast the event is also.

11 2) Beta approach Observations Beta(0.5,0.5) Beta(1,1) Beta(2,2) Beta(10,20) Katz and Ehrendorfer, Weather and Forecasting (2005). Forecasts + choose a Prior distribution likelihood p.d.f. Beta( ,  ) Posterior distribution Bayes´ theorem But both  and  are unknown !!

12 How to choose the prior distribution? 2) Calculate: a central point: a measure of the spread: = weight, = Number of past observations The weight gives different importance to prior belief and model forecasts and is chosen to minimize the logarithmic score. Rajagopalan et al. (2002) method is a particular case, where:  =  = 0. This is equivalent to the frequentist approach. = climatology

13 3) Calibration Technique: Both unknown Predictor (given by the ensemble forecasts). Link Function The parameters of the logistic regression are calculated to maximize the likelihood. Logistic regression If is an explanatory variable calculated from the ensemble forecasts, then:

14 Relative frequencies: Logit transformation of the relative frequencies: Include prior information: Which explanatory variable to use? Roulston and Smith, Mon. Wea. Rev. (2002)

15 1)Relative frequencies 2) Beta approach Rajagopalan et al. (2002) 3) Logistic Regression Summary of methods

16 Example: Daily winter precipitation at Reading (UK) Forecasts: 3-day ahead 50-member forecasts of daily total precipitation from Ensemble Prediction System (EPS) at ECMWF for a grid point near Reading (UK) forecast Period: Dec-Jan-Feb from 1997 to n= 812 daily observations m x n = 50 x 812 = forecasts Binary event: precipitation above a threshold. Observations: total daily precipitation observed at the University of Reading atmospheric observatory.

17 Precipitation in Reading: The model is not perfectly calibrated 0.1 mm (WMO def. of wet day) 2 mm (perc= 75.6%) 10 mm (perc.=97%, extreme event).

18 Threshold=0.1mm Threshold=10mm Threshold=2mm The predicted probability can be expressed as a function of the frequency approach: F is a lineal function in the RLZ method and non linear in the Logistic Regression approach.

19 BRIER SCORE0.1 mm2 mm10 mm Climatology Frequencies Bayesian approach (RLZ) Log. Reg. (Xt = qt) Log. Reg. (Xt = logit qt) Brier Score: All the BS improve the frequencies one (  =0.05).

20 Example 2: Niño-3.4 SST index Observations: Niño-3.4 SST index Forecasts: 5-month ahead 9-member ensemble forecasts of Niño 3.4 SST index from the coupled ECMWF model ( DEMETER). Period: hindcasts for December started in the 1st of August of each year from 1958 to n = 44 observations m x n = 9 x 44 = 396 forecasts (Palmer et al. Bull. Am. Meteorol. Soc., 2004) Binary event: Index above the median.

21 Years Observations and Forecasts Niño 3.4 SST index: Observations Forecasts

22 Niño 3.4 SST index: 90% perc. 75% perc. median Median 75% 90% We calibrate the data when we codify them.

23 Brier Score: BRIER SCORE0.1 mm2 mm10 mm Climatology Frequencies Bayesian approach (RLZ) Log. Reg. (Xt = qt)

24 Conclusions based on this example: - Use of prior information via the Beta distribution gives forecasts that have more skill than the frequentist ones - Calibration using logistic regression gives forecasts that have more skill than the frequentist ones - A combination of Beta technique and calibration improves each technique separately. Work is still necessary to choose the best predictor for the logistic regression and the best way to combine both techniques.

25 END

26