Two Consolidation Projects: Towards an International MME: CFS+EUROSIP(UKMO,ECMWF,METF) 11 slides Towards a National MME: CFS and GFDL 18 slides.

Slides:



Advertisements
Similar presentations
LRF Training, Belgrade 13 th - 16 th November 2013 © ECMWF Sources of predictability and error in ECMWF long range forecasts Tim Stockdale European Centre.
Advertisements

ECMWF long range forecast systems
Weekly TC forecasts in the Southern Hemisphere Anne Leroy (Météo France) Matthew Wheeler (CAWCR/BOM) John McBride (CAWCR/BOM) funded by the Indian Ocean.
Verification of NCEP SFM seasonal climate prediction during Jae-Kyung E. Schemm Climate Prediction Center NCEP/NWS/NOAA.
Initialization Issues of Coupled Ocean-atmosphere Prediction System Climate and Environment System Research Center Seoul National University, Korea In-Sik.
Details for Today: DATE:3 rd February 2005 BY:Mark Cresswell FOLLOWED BY:Assignment 2 briefing Evaluation of Model Performance 69EG3137 – Impacts & Models.
Assessment of CFSv2 hindcast (seasonal mean) CPC/NCEP/NOAA Jan 2011.
Validation of the NCEP CFS forecasts Suranjana Saha Environmental Modeling Center NCEP/NWS/NOAA/DOC.
The NCEP operational Climate Forecast System : configuration, products, and plan for the future Hua-Lu Pan Environmental Modeling Center NCEP.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Seamless precipitation prediction skill in a global model: Actual versus potential skill Matthew Wheeler 1, Hongyan Zhu 1, Adam Sobel 2, and Debra Hudson.
© Crown copyright Met Office Andrew Colman presentation to EuroBrisa Workshop July Met Office combined statistical and dynamical forecasts for.
Ensemble Post-Processing and it’s Potential Benefits for the Operational Forecaster Michael Erickson and Brian A. Colle School of Marine and Atmospheric.
A Regression Model for Ensemble Forecasts David Unger Climate Prediction Center.
1 Assessment of the CFSv2 real-time seasonal forecasts for Wanqiu Wang, Mingyue Chen, and Arun Kumar CPC/NCEP/NOAA.
Warm Season Precipitation Predictions over North America with the Eta Regional Climate Model Model Sensitivity to Initial Land States and Choice of Domain.
Multi-Model Ensembling for Seasonal-to-Interannual Prediction: From Simple to Complex Lisa Goddard and Simon Mason International Research Institute for.
Caio A. S. Coelho Supervisors: D. B. Stephenson, F. J. Doblas-Reyes (*) Thanks to CAG, S. Pezzulli and M. Balmaseda.
Exploring sample size issues for 6-10 day forecasts using ECMWF’s reforecast data set Model: 2005 version of ECMWF model; T255 resolution. Initial Conditions:
1 How Does NCEP/CPC Make Operational Monthly and Seasonal Forecasts? Huug van den Dool (CPC) CPC, June 23, 2011/ Oct 2011/ Feb 15, 2012 / UoMDMay,2,2012/
Intraseasonal TC prediction in the southern hemisphere Matthew Wheeler and John McBride Centre for Australia Weather and Climate Research A partnership.
The La Niña Influence on Central Alabama Rainfall Patterns.
EUROBRISA Workshop – Beyond seasonal forecastingBarcelona, 14 December 2010 INSTITUT CATALÀ DE CIÈNCIES DEL CLIMA Beyond seasonal forecasting F. J. Doblas-Reyes,
FORECAST SST TROP. PACIFIC (multi-models, dynamical and statistical) TROP. ATL, INDIAN (statistical) EXTRATROPICAL (damped persistence)
Caio A. S. Coelho, D. B. Stephenson, F. J. Doblas-Reyes (*) and M. Balmaseda (*) Department of Meteorology, University of Reading and ECMWF (*)
Model dependence and an idea for post- processing multi-model ensembles Craig H. Bishop Naval Research Laboratory, Monterey, CA, USA Gab Abramowitz Climate.
Page 1© Crown copyright 2006 Matt Huddleston With thanks to: Frederic Vitart (ECMWF), Ruth McDonald & Met Office Seasonal forecasting team 14 th March.
Verification of IRI Forecasts Tony Barnston and Shuhua Li.
1 Climate Test Bed Seminar Series 24 June 2009 Bias Correction & Forecast Skill of NCEP GFS Ensemble Week 1 & Week 2 Precipitation & Soil Moisture Forecasts.
1 Motivation Motivation SST analysis products at NCDC SST analysis products at NCDC  Extended Reconstruction SST (ERSST) v.3b  Daily Optimum Interpolation.
Course Evaluation Closes June 8th.
CTB Science Plan For Multi Model Ensembles (MME) Suru Saha Environmental Modeling Centre NCEP/NWS/NOAA.
Development of Precipitation Outlooks for the Global Tropics Keyed to the MJO Cycle Jon Gottschalck 1, Qin Zhang 1, Michelle L’Heureux 1, Peitao Peng 1,
Statistical Post Processing - Using Reforecast to Improve GEFS Forecast Yuejian Zhu Hong Guan and Bo Cui ECM/NCEP/NWS Dec. 3 rd 2013 Acknowledgements:
CTB computer resources / CFSRR project Hua-Lu Pan.
Recent and planed NCEP climate modeling activities Hua-Lu Pan EMC/NCEP.
“Comparison of model data based ENSO composites and the actual prediction by these models for winter 2015/16.” Model composites (method etc) 6 slides Comparison.
Exploring the Possibility to Forecast Annual Mean Temperature with IPCC and AMIP Runs Peitao Peng Arun Kumar CPC/NCEP/NWS/NOAA Acknowledgements: Bhaskar.
One-year re-forecast ensembles with CCSM3.0 using initial states for 1 January and 1 July in Model: CCSM3 is a coupled climate model with state-of-the-art.
1 Malaquias Peña and Huug van den Dool Consolidation of Multi Method Forecasts Application to monthly predictions of Pacific SST NCEP Climate Meeting,
Verification of ensemble precipitation forecasts using the TIGGE dataset Laurence J. Wilson Environment Canada Anna Ghelli ECMWF GIFS-TIGGE Meeting, Feb.
Huug van den Dool / Dave Unger Consolidation of Multi-Method Seasonal Forecasts at CPC. Part I.
1 How Does NCEP/CPC Make Operational Monthly and Seasonal Forecasts? Huug van den Dool (CPC) ESSIC, February, 23, 2011.
Meteorology 485 Long Range Forecasting Friday, February 13, 2004.
Multi Model Ensembles CTB Transition Project Team Report Suranjana Saha, EMC (chair) Huug van den Dool, CPC Arun Kumar, CPC February 2007.
Huug van den Dool and Suranjana Saha Prediction Skill and Predictability in CFS.
Judith Curry James Belanger Mark Jelinek Violeta Toma Peter Webster 1
Huug van den Dool and Steve Lord International Multi Model Ensemble.
1 Summary of CFS ENSO Forecast September 2010 update Mingyue Chen, Wanqiu Wang and Arun Kumar Climate Prediction Center 1.Latest forecast of Nino3.4 index.
1 An Assessment of the CFS real-time forecasts for Wanqiu Wang, Mingyue Chen, and Arun Kumar CPC/NCEP/NOAA.
1 Malaquias Peña and Huug van den Dool Consolidation methods for SST monthly forecasts for MME Acknowledgments: Suru Saha retrieved and organized the data,
Cooperative Research Programs (CoRP) Satellite Climate Studies Branch (SCSB) 1 1 Reconstruction of Near-Global Precipitation Variations Based on Gauges.
Verification of Daily CFS forecasts Huug van den Dool & Suranjana Saha CFS was designed as ‘seasonal’ system Hindcasts , 15 ‘members’ per month.
Predictability of Monthly Mean Temperature and Precipitation: Role of Initial Conditions Mingyue Chen, Wanqiu Wang, and Arun Kumar Climate Prediction Center/NCEP/NOAA.
1 A review of CFS forecast skill for Wanqiu Wang, Arun Kumar and Yan Xue CPC/NCEP/NOAA.
Climate Mission Outcome A predictive understanding of the global climate system on time scales of weeks to decades with quantified uncertainties sufficient.
1 Summary of CFS ENSO Forecast December 2010 update Mingyue Chen, Wanqiu Wang and Arun Kumar Climate Prediction Center 1.Latest forecast of Nino3.4 index.
1 Summary of CFS ENSO Forecast August 2010 update Mingyue Chen, Wanqiu Wang and Arun Kumar Climate Prediction Center 1.Latest forecast of Nino3.4 index.
1/39 Seasonal Prediction of Asian Monsoon: Predictability Issues and Limitations Arun Kumar Climate Prediction Center
IRI Climate Forecasting System
IRI Multi-model Probability Forecasts
Makarand A. Kulkarni Indian Institute of Technology, Delhi
Anne Leroy Météo France, Nouméa, Nouvelle-Calédonie Matthew C. Wheeler
Progress in Seasonal Forecasting at NCEP
The Importance of Reforecasts at CPC
IRI forecast April 2010 SASCOF-1
GloSea4: the Met Office Seasonal Forecasting System
Forecast system development activities
Power Regression & Regression estimation of event probabilities (REEP)
Ryan Kang, Wee Leng Tan, Thea Turkington, Raizan Rahmat
Presentation transcript:

Two Consolidation Projects: Towards an International MME: CFS+EUROSIP(UKMO,ECMWF,METF) 11 slides Towards a National MME: CFS and GFDL 18 slides

Does the NCEP CFS add to the skill of the European DEMETER-3 to produce a viable International Multi Model Ensemble (IMME) ? Huug van den Dool Climate Prediction Center, NCEP/NWS/NOAA Suranjana Saha and Åke Johansson Environmental Modeling Center, NCEP/NWS/NOAA August 2007

DATA and DEFINITIONS USED DEMETER-3 (DEM3) = ECMWF + METFR + UKMO CFS IMME = DEM3 + CFS 1981 – Initial condition months : Feb, May, Aug and Nov Leads 1-5 Monthly means

DATA/Definitions USED (cont) Deterministic : Anomaly Correlation Probabilistic : Brier Score (BS) and Rank Probability Score (RPS) Ensemble Mean and PDF T2m and Prate Europe and United States “ NO (fancy) consolidation, equal weights, NO Cross-validation”

DATA/Definitions USED (cont) Verification Data : T2m : CPC Monthly Analysis of the CAMS + Global Historical Climate Network (Fan and Van den Dool 2007) Prate : CMAP (Xie-Arkin 1997)

Number of times IMME improves upon DEM-3 : out of 20 cases (4 IC’s x 5 leads): RegionEUROPE USA VariableT2mPrateT2mPrate Anomaly Correlation 914 Brier Score RPS “The bottom line”

Frequency of being the best model in 20 cases in terms of Anomaly Correlation of the Ensemble Mean “Another bottom line” CFSECMWFMETFRUKMO T2mUSA4556 T2mEUROPE3565 PrateUSA7336 PrateEUROPE11005

Frequency of being the best model in 20 cases in terms of Brier Score of the PDF “ Another bottom line” CFSECMWFMETFRUKMO T2mUSA11215 T2mEUROPE10313 PrateUSA17201 PrateEUROPE18011

Frequency of being the best model in 20 cases in terms of Ranked Probability Score (RPS) of the PDF “ Another bottom line” CFSECMWFMETFRUKMO T2mUSA9416 T2mEUROPE9343 PrateUSA19001 PrateEUROPE18001

CONCLUSIONS Overall, NCEP CFS contributes to the skill of IMME (relative to DEM3) for equal weights. This is especially so in terms of the probabilistic Brier Score and for Precipitation

CONCLUSIONS (Cont) In comparison to ECMWF, METFR and UKMO, the CFS as an individual model does: well in deterministic scoring (AC) for Prate and very well in probability scoring (BS) for Prate and T2m over both USA and EUROPEAN domains

CONCLUSIONS (Cont) The relative weakness of the CFS is in the deterministic scoring (AC) for T2m (which is near average of the other models) over both EUROPE and USA Skill (if any) over EUROPE or USA is very modest for any model, or any combination of models The Brier Score shows rare improvements over climatological probabilities in this study The AC for the ensemble mean gives a more “positive” impression about skill than the Brier Score

Study of the performance of GFDL seasonal forecasts in a Multi Model Ensemble at NCEP Huug van den Dool Climate Prediction Center/NCEP/NWS/NOAA Suranjana Saha Environmental Modeling Center/NCEP/NWS/NOAA

Data Used 4 initial conditions: April 1, May 1, Oct 1 and Nov 1 10 member one-year forecasts (leads 0 thru 11) Period (25 years) GFDL has a fully coupled model CM2.1 (IPCC version)

Verification Data Used Focus on monthly mean 2m-temperature and precipitation over the continental US Verification of 2m-temperature against GHCN+CAMS (land only) Verification of precipitation against CMAP ( land and ocean) Area: valid grid points (2.5x2.5) within 25N-50N, 125W-65W box over the US

Comparison to the NCEP Climate Forecast System (CFS) GFDL members start a few days before and on the first of the month. CFS members are clustered around the 11 th and 21 st of the previous month and the 1 st of the initial month. In an NCEP operational setting, the GFDL model would be run everyday (similar to the CFS). Therefore, the calibration of the operational forecast would be obtained from an interpolation of two sets of forecasts, a month apart (one of which would be a month old), thus resulting in a possible degradation of skill.

VERIFICATION OF US PRATE ANOMALY CORRELATION

CFS US PRATE ANOMALY CORRELATION There are 32 ENTRIES: 8 leads for 4 initial months initial month apr may oct nov lead Worst mean-sd mean mean+sd best NO CROSS VALIDATION Some skill in ENSO months

CFS US PRATE ANOMALY CORRELATION There are 32 ENTRIES: 8 leads for 4 initial months initial month apr may oct nov lead Worst mean-sd mean mean+sd best CROSS VALIDATION CV3RE CV brings all numbers down

GFDL US PRATE ANOMALY CORRELATION There are 32 ENTRIES: 8 leads for 4 initial months initial month apr may oct nov lead Worst mean-sd mean mean+sd best NO CROSS VALIDATION Weak skill in ENSO months

MME2 US PRATE ANOMALY CORRELATION There are 32 ENTRIES: 8 leads for 4 initial months initial month apr may oct nov lead Worst mean-sd mean mean+sd best NO CROSS VALIDATION

CFSGFDLMME CFSGFDLMME NO CV CV3RE US PRATE (AC) BEST out of 32 cases (4 IC’s x 8 leads): MEAN AC

 CFS alone is slightly better than GFDL alone  MME2 is slightly better than CFS alone  MME2 is better than GFDL alone  Numerically, differences are minuscule, and the existence of any skill is debatable US PRATE (summary)

Adding GFDL to CFS for MME2 degrades scores GFDL has ENSOs, maybe even too strong in 1983 and 1998, but the precipitation anomalies are weak at the equator and are pushed away from the equator, mainly into the southern hemisphere. PRATE OVER NINO 3.4 AREA (summary) CFS MME2 GFDL (NO CV) (CV3RE)

VERIFICATION OF US SURFACE TEMPERATURE ANOMALY CORRELATION

CFSGFDLMME CFSGFDLMME NO CV CV3RE US 2m TEMPERATURE (AC) BEST out of 32 cases (4 IC’s x 8 leads): MEAN AC

 CFS alone is not better than GFDL alone  MME2 is slightly better than CFS alone  MME2 is not better than GFDL alone  Numerically, differences are minuscule, and the existence of any skill is debatable US 2m TEMPERATURE (summary)

Effect of OCN (Optimal Climate Normals) filtering on AC scores for all 32 cases (NO-CV) 9 year running mean is removed RAW OCN-filtered GFDL CFS GFDL loses its advantage over the CFS when the trend is removed TREND ANALYSIS OF US 2m TEMP

CONCLUSIONS (1) Skill of both, CFS and GFDL, is extremely low for both 2m temperature (T2M) and precipitation (PRATE) over the US, and this skill wilts further upon cross validation (CV3RE) GFDL makes no contribution to the skill of MME2 for PRATE over the US GFDL makes no contribution to the skill of MME2 for PRATE over the tropical Pacific (Nino 3.4 area) GFDL has a small edge over the CFS and contributes to MME2 for T2M over the US

CONCLUSIONS (2) The inconsistency between performance in PRATE and T2M is explained by inclusion of historical CO2 etc, i.e. GFDL does a better job on the decadal temperature trends. This is explained by the drop in the skill when the trend is removed. The empirical tool, OCN (Optimal Climate Normals), is routinely used by CPC to incorporate decadal trends in the consolidation of the official seasonal forecasts for US T2M. Its performance is better than any of their dynamical tools.

From Delsole(2007) Surprisingly, none of the regression models proposed here can consistently beat the skill of a simple multi-model mean “Under suitable assumptions, both the Bayesian estimate and the constrained least squares solution reduce to standard ridge regression”.

Kharin and Zwiers(2002): Several methods of combining individual forecasts from a group of climate models to produce an ensemble forecast are considered In the extratropics, the regression-improved ensemble mean performs best. The “superensemble” forecast that is obtained by optimally weighting the individual ensemble members does not perform as well as either the simple ensemble mean or the regression-improved ensemble mean. The sample size evidently is too small to estimate reliably the relatively large number of optimal weights required for the superensemble approach.

Finally Huug van den Dool, 2007 There is essentially not enough hindcast data for these fancy consolidation methods to work (21-25 years is nothing !!). (( There may be exceptions )) There is no (or not enough) independent information in model A versus Model B We have to be rigorous in CV procedures!

The rest is EXTRA

Classic +Delsole limit +CPC limit

Appendix: Consolidation Techniques A technique to linearly combine any set of models Example: Con3 = a*A + b*B + c*C, where A, B and C are forecasts and a, b, and c coefficients. The coefficients ideally depend on skill and co-linearity among the models, as determined from many hindcasts Because of near instability of the matrix problem, NCEP applies ‘ridging’ to the covariance matrix, and tries to pool as much data as possible (areas, leads..). To arrive at a skill estimate, we perform a 3 year-out cross validation (CV3), namely the year in consideration and two more years chosen at random (to reduce CV pathological problems)

BRIER SCORE FOR 3-CLASS SYSTEM 1. Calculate tercile boundaries from observations ( for longer leads) at each gridpoint. 2. Assign departures from model’s own climatology (based on 21 years, all members) to one of the three classes: Below (B), Normal (N) and Above (A), and find the fraction of forecasts (F) among all participating ensemble members for these classes denoted by FB, FN and FA respectively, such that FB+ FN+FA=1. 3. Denoting Observations as O, we calculate a Brier Score (BS) as : BS={(FB-OB)**2 +(FN-ON)**2 + (FA-OA)**2}/3, aggregated over all years and all grid points. {{For example, when the observation is in the B class, we have (1,0,0) for (OB, ON, OA) etc.}} 4. BS for random deterministic prediction: BS for ‘always climatology’ (1/3 rd,1/3 rd,1/3 rd ) : RPS: The same as Brier Score, but for cumulative distribution (no- skill=0.148)

Anomaly correlation does not asymptote to 100 at fcst time=0 Interpolation of initial conditions from Reanalysis 2 may not be correct or accurate

CROSS-VALIDATION Anomaly Pattern correlation over the tropical Pacific. Average for all leads and initial months. Empty bar: Full (dependent), filled bar: 3-yr out cross-validated.

Peña and Van den Dool (2008) Consolidation of Multi Method Forecasts by Ridge Regression: Application to Pacific Sea Surface Temperature  Strategies to increase the ratio of the effective sample size of the training data to the number of coefficients to be fitted are proposed and tested. These strategies include: i) objective selection of a smaller subset of models, ii) pooling of information from neighboring gridpoints, and iii) consolidating all ensemble members rather than each model’s ensemble average. In all variations of the ridge regression consolidation methods tested, increased effective sample size produces more stable weights and more skillful predictions on independent data. In the western tropical Pacific, most consolidation methods outperform the simple equal weight ensemble average; in other regions they have similar skill as measured by both the anomaly correlation and the relative operating curve. The main obstacle to progress is a short period of data and a lack of independent information among models. CV3RE