Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Assessing the skill of decadal predictions Reidun Gangstø,

Slides:



Advertisements
Similar presentations
Measuring the performance of climate predictions Chris Ferro, Tom Fricker, David Stephenson Mathematics Research Institute University of Exeter, UK IMA.
Advertisements

EVAL 6970: Meta-Analysis Vote Counting, The Sign Test, Power, Publication Bias, and Outliers Dr. Chris L. S. Coryn Spring 2011.
Model Assessment and Selection
Model assessment and cross-validation - overview
Chapter 2: Lasso for linear models
Details for Today: DATE:3 rd February 2005 BY:Mark Cresswell FOLLOWED BY:Assignment 2 briefing Evaluation of Model Performance 69EG3137 – Impacts & Models.
1 Research activities Climate Services & NCCR Christof Appenzeller et al. Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie.
Statistical Postprocessing of Weather Parameters for a High-Resolution Limited-Area Model Ulrich Damrath Volker Renner Susanne Theis Andreas Hense.
The loss function, the normal equation,
Forecasting JY Le Boudec 1. Contents 1.What is forecasting ? 2.Linear Regression 3.Avoiding Overfitting 4.Differencing 5.ARMA models 6.Sparse ARMA models.
BA 555 Practical Business Analysis
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz New automatic weather type classifications at MeteoSwiss.
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
A comparison of K-fold and leave-one-out cross-validation of empirical keys Alan D. Mead, IIT
System Evaluation To evaluate the error probability of the designed Pattern Recognition System Resubstitution Method – Apparent Error Overoptimistic Holdout.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Automating Peak-over- Threshold with the Information.
Ensemble Learning (2), Tree and Forest
1 Assessment of the CFSv2 real-time seasonal forecasts for Wanqiu Wang, Mingyue Chen, and Arun Kumar CPC/NCEP/NOAA.
Model Assessment and Selection Florian Markowetz & Rainer Spang Courses in Practical DNA Microarray Analysis.
Creating Empirical Models Constructing a Simple Correlation and Regression-based Forecast Model Christopher Oludhe, Department of Meteorology, University.
2015 AprilUNIVERSITY OF HAIFA, DEPARTMENT OF STATISTICS, SEMINAR FOR M.A 1 Hastie, Tibshirani and Friedman.The Elements of Statistical Learning (2nd edition,
Climate Forecasting Unit Second Ph’d training talk Prediction of climate extreme events at seasonal and decadal time scale Aida Pintó Biescas.
Barcelona, 2015 Ocean prediction activites at BSC-IC3 Virginie Guemas and the Climate Forecasting Unit 9 February 2015.
The high resolution in seasonal climate predictions with EC-Earth François Massonnet C. Prodhomme, L. Batté, V. Guemas, F. J. Doblas-Reyes Polar Predictability.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz CH2018 – Meeting Communication Aspects Andreas Fischer,
Methodology for producing the revised back series of population estimates for Julie Jefferies Population and Demography Division Office for.
Guidance on Intensity Guidance Kieran Bhatia, David Nolan, Mark DeMaria, Andrea Schumacher IHC Presentation This project is supported by the.
EUROBRISA Workshop – Beyond seasonal forecastingBarcelona, 14 December 2010 INSTITUT CATALÀ DE CIÈNCIES DEL CLIMA Beyond seasonal forecasting F. J. Doblas-Reyes,
 2003, G.Tecuci, Learning Agents Laboratory 1 Learning Agents Laboratory Computer Science Department George Mason University Prof. Gheorghe Tecuci 5.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Pollen emission in COSMO-ART COSMO User Workshop
Introduction to Machine Learning Supervised Learning 姓名 : 李政軒.
Model dependence and an idea for post- processing multi-model ensembles Craig H. Bishop Naval Research Laboratory, Monterey, CA, USA Gab Abramowitz Climate.
Manu Chandran. Outline Background and motivation Over view of techniques Cross validation Bootstrap method Setting up the problem Comparing AIC,BIC,Crossvalidation,Bootstrap.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss European wind storms and reinsurance loss: New estimates.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Using seasonal forecasts to estimate current climate.
Testing old and new AVO methods Chuck Ursenbach CREWES Sponsors Meeting November 21, 2003.
Decadal predictability of winter SST in the Nordic Seas and Barents Sea in three CMIP5 models NACLIM ANNUAL MEETING, OCTOBER 14, 2014 BERLIN H. R. Langehaug,
1 Motivation Motivation SST analysis products at NCDC SST analysis products at NCDC  Extended Reconstruction SST (ERSST) v.3b  Daily Optimum Interpolation.
. Outline  Evaluation of different model-error schemes in the WRF mesoscale ensemble: stochastic, multi-physics and combinations thereof  Where is.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz First Experience with KENDA at MeteoSwiss Daniel Leuenberger,
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Robust Estimators.
Data Mining Practical Machine Learning Tools and Techniques By I. H. Witten, E. Frank and M. A. Hall Chapter 5: Credibility: Evaluating What’s Been Learned.
18 September 2009: On the value of reforecasts for the TIGGE database 1/27 On the value of reforecasts for the TIGGE database Renate Hagedorn European.
Classification Ensemble Methods 1
Exploring the Possibility to Forecast Annual Mean Temperature with IPCC and AMIP Runs Peitao Peng Arun Kumar CPC/NCEP/NWS/NOAA Acknowledgements: Bhaskar.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz The new multi-layer snow model Guy de Morsier 1, Jean-Marie.
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Machine Learning 5. Parametric Methods.
Validation methods.
One-year re-forecast ensembles with CCSM3.0 using initial states for 1 January and 1 July in Model: CCSM3 is a coupled climate model with state-of-the-art.
Chris Ferro Climate Analysis Group Department of Meteorology University of Reading Extremes in a Varied Climate 1.Significance of distributional changes.
Impact of TAO observations on Impact of TAO observations on Operational Analysis for Tropical Pacific Yan Xue Climate Prediction Center NCEP Ocean Climate.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz 1 METEO-Cert - Acceptance Procedure for Automatic Weather.
S 1 CT1/CT3 Meeting April 2013 Hamburg WP 3.1 Suitability of the ocean observing system components for initialization Wonsun Park GEOMAR.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Weather type dependant fuzzy verification of precipitation.
1 Malaquias Peña and Huug van den Dool Consolidation methods for SST monthly forecasts for MME Acknowledgments: Suru Saha retrieved and organized the data,
Cooperative Research Programs (CoRP) Satellite Climate Studies Branch (SCSB) 1 1 Reconstruction of Near-Global Precipitation Variations Based on Gauges.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Rodney Nielsen Many of these slides were adapted from: I. H. Witten, E. Frank and M. A. Hall Data Science Credibility: Evaluating What’s Been Learned Predicting.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Ensemble activites and plans at MeteoSwiss André Walser.
1/39 Seasonal Prediction of Asian Monsoon: Predictability Issues and Limitations Arun Kumar Climate Prediction Center
National Oceanic and Atmospheric Administration’s National Weather Service Colorado Basin River Forecast Center Salt Lake City, Utah 11 The Hydrologic.
Improved Historical Reconstructions of SST and Marine Precipitation Variations Thomas M. Smith1 Richard W. Reynolds2 Phillip A. Arkin3 Viva Banzon2 1.
Predictability of Indian monsoon rainfall variability
10701 / Machine Learning Today: - Cross validation,
Instrumental Surface Temperature Record
Measuring the performance of climate predictions
Presentation transcript:

Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Assessing the skill of decadal predictions Reidun Gangstø, Andreas P. Weigel, Mark A. Liniger EMS Annual Meeting, Berlin, 13 September 2011

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Outline The ENSEMBLES decadal predictions Impact of drift correction on skill Is there any skill apart from the trend? Impact of cross-validation on skill Evaluating skill with the Jackknife bias corrector Summary and conclusions

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Global average T2 temperature (ENSEMBLES decadal predictions vs ERA-40/Interim re-analysis data) ECMWFUKMO IFM-GEOMARCERFACS T2 (°C) Hindcast year T2 (°C)

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Problem: sample size too small (8-9) to obtain robust bias estimates for each year separately Lead-time (year) T2 temperature (°C) Example of drift evolution with lead-time Crosses: CONV solid lines: FIT Drift correction methods: Subtracting the lead- time dependent bias (CONV) Fitting a 3rd degree polynomial fit to the lead-time dependent bias (FIT) The drift correction is done in a leave-one-out cross-validation mode

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Global mean T2, after drift correction ECMWFUKMO IFM-GEOMARCERFACS T2 (°C) Hindcast year T2 (°C) Multi-model

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Correlation after drift correction (in cross-validation) Correlation, FIT (T2 mean over years 1-5) Lead-time (year) Mean of grid point-wise correlation Correlation Latitude Longitude

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Removing the model trend 1-5 y 6-10 y All lead-times 1-10 y T2 temperature (°C) Year 1-5 y 6-10 y 1-5 y 6-10 y

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Removing the observed trend 1-5 y 6-10 y 1-5 y 6-10 y y 6-10 y

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Correlation after detrending (drift correction with CONV, in cross-validation) Why is the skill predominantly negative??? Lead-time (year) Correlation Correlation, model trend removed (yrs 1-5) Latitude Longitude Mean of grid point-wise correlation

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Cross-validation 1960 Predict 1960 Determine bias 1965 Determine bias Predict 1965 Determine bias 1970 Determine biasPredict 1970 Determine bias 1975 Determine biasPredict 1975 Determine bias 1980 Determine biasPredict 1980 Determine bias … then correlate with observations 1960, 1965, 1970, …

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Prescribed correlation: 0 Number of experiments: 10’000 Var.fcst / Var.obsv 1:12 Correlation as measured Drift-correction (method: CONV) in cross-validation Toy model: bias from cross-validation

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Not bias corrected Forecasts Obsv. NO CORRELATION Illustration of cross-validation bias, example: CONV drift correction

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Not bias corrected Bias corrected Forecasts Obsv. NO CORRELATION Illustration of cross-validation bias example: CONV drift correction

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Not bias corrected Bias corrected Forecasts Obsv. NO CORRELATION Illustration of cross-validation bias example: CONV drift correction

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Not bias corrected Bias corrected Forecasts Obsv. NO CORRELATION Illustration of cross-validation bias example: CONV drift correction

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Not bias corrected Bias corrected Forecasts Obsv. NO CORRELATION NEGATIVE CORRELATION Illustration of cross-validation bias example: CONV drift correction

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Consequences for verification Estimates of actual prediction skill of decadal forecasts problematic because: Issues of data situation in hindcasts (e.g. ocean data before 1980s) small sample size induces bias in cross-validation procedure It may be better to look at potential predictability, i.e. the skill we would have with an infinite number of training data, and assuming that there are no limitations in data quality

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Jackknifing as a pragmatic solution Empirical approach frequently used to quantify sample size related biases Related to bootstrapping The idea is that the estimator is computed from the full sample, then recomputed n times, leaving a different observation out each time Reference: B. Efron (1982). The Jackknife, the Bootstrap and other resampling plans. J.W. Arrowsmith, Ltd., Bristol, England, 92 pp.

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Prescribed correlation: 0 Number of experiments: 10’000 Var.fcst / Var.obsv 1:12 Correlation as measured Drift-correction (method: CONV) in cross-validation Jackknife estimate Toy model: bias from cross-validation

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Local correlation after drift correction, with the Jackknife bias corrector (JK) applied Correlation with JK (T2 mean over years 1-5) Lead-time (year) Correlation Correlation, CONV, with CV Latitude Longitude Mean of grid point-wise correlation

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Local correlation after detrending, with the Jackknife bias corrector applied Correlation with JK, model trend removed (yrs 1-5) Lead-time (year) Correlation Correlation CONV, with CV Latitude Longitude Mean of grid point-wise correlation

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Difference in correlation between the detrending methods Uncertainties related to the choice of detrending method are of the same order of magnitude as remaining fluctuations

Assessing the skill of decadal predictions | Reidun Gangstø EMS Annual Meeting, Berlin | 13 September /23 Summary and conclusions Predicted near-surface temperature from the ENSEMBLES decadal model forecasts are compared to ERA-40/Interim re-analysis data Drift correction: Reduction of noise by fitting suitable polynomial through annual bias estimates Verification: Unbiased estimate of forecasts problematic due to small sample sizes It may be more useful to focus on potential predictability (e.g. Jackknife method) Trend: By far most of the skill is related to reproduction of linear trend Skill of predicting remaining (interannual) fluctuations close to zero Exact quantification difficult due to uncertainties in detrending methods