Metrics for Model Skill Assessment Model Error time series (model-data misfit): –ME(i) = model - data Total Root-Mean-Square Error: RMS_Total RMS_Total.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Page 1© Crown copyright 2004 Skill scores for GEMS-aerosol Olivier Boucher GEMS - Kick-off meeting July 2005.
AP Statistics Section 3.2 C Coefficient of Determination
NOTATION & ASSUMPTIONS 2 Y i =  1 +  2 X 2i +  3 X 3i + U i Zero mean value of U i No serial correlation Homoscedasticity Zero covariance between U.
Marine Data Management Mihael M. de Souza. Area of Interest (AOI) 35º S 45º S 160º E175º E.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
CFSv2 monthly mean forecast skill December, 2010.
Assessment of CFSv2 hindcast (seasonal mean) CPC/NCEP/NOAA Jan 2011.
An Assessment of CMAQ with TEOM Measurements over the Eastern US Michael Ku, Chris Hogrefe, Kevin Civerolo, and Gopal Sistla PM Model Performance Workshop,
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
9. SIMPLE LINEAR REGESSION AND CORRELATION
The South Atlantic Bight Cape Hatteras Cape Canaveral.
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
What does researcher want of statistics?. 1.How variable it is? 2.Does “my pet thing” work? 3.Why do the things differ? 4.Why does it fail from time to.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Math 227 Elementary Statistics Math 227 Elementary Statistics Sullivan, 4 th ed.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Chapter 2 Data Patterns and Choice of Forecasting Techniques
1 Assessment of the CFSv2 real-time seasonal forecasts for Wanqiu Wang, Mingyue Chen, and Arun Kumar CPC/NCEP/NOAA.
Update on hydrodynamic model comparisons Marjy Friedrichs and Carl Friedrichs Aaron Bever (post-doc) Leslie Bland (summer undergraduate student)
National Aeronautics and Space Administration ABSTRACT Using version 1.3 of the Aquarius dataset, the spatial distribution and seasonal variability of.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Oceanography 569 Oceanographic Data Analysis Laboratory Kathie Kelly Applied Physics Laboratory 515 Ben Hall IR Bldg class web site: faculty.washington.edu/kellyapl/classes/ocean569_.
ABSTRACT In situ and modeled water-column primary production (PPeu) were determined from seasonally IMECOCAL surveys and satellite data off Baja.
Oceanography 569 Oceanographic Data Analysis Laboratory Kathie Kelly Applied Physics Laboratory 515 Ben Hall IR Bldg class web site: faculty.washington.edu/kellyapl/classes/ocean569_.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Bivariate Distributions Overview. I. Exploring Data Describing patterns and departures from patterns (20%-30%) Exploring analysis of data makes use of.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Linear Regression Least Squares Method: the Meaning of r 2.
Verification methods - towards a user oriented verification WG5.
Statistics Numerical Representation of Data Part 2 – Measure of Variation.
SEASONAL COMMON PLOT SCORES A DRIANO R ASPANTI P ERFORMANCE DIAGRAM BY M.S T ESINI Sibiu - Cosmo General Meeting 2-5 September 2013.
Carr, Friedrichs et al. PPARR3, OCRT Meeting, April 2006 A study of marine primary productivity models, with an ocean color bias Primary Productivity Round.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Statistical Tools for Solar Resource Forecasting Vivek Vijay IIT Jodhpur Date: 16/12/2013.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
2nd GODAE Observing System Evaluation Workshop - June Ocean state estimates from the observations Contributions and complementarities of Argo,
Chapter 16 Data Analysis: Testing for Associations.
Chapter 3 Correlation.  Association between scores on two variables –e.g., age and coordination skills in children, price and quality.
Empirically Based Characteristics of Effect Sizes used in ANOVA J. Jackson Barnette, PhD Community and Behavioral Health College of Public Health University.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Progresses in IMaRS Caiyun Zhang Sept. 28, SST validation over Florida Keys 2. Potential application of ocean color remote sensing on deriving.
2.6 Scatter Diagrams. Scatter Diagrams A relation is a correspondence between two sets of data X is the independent variable Y is the dependent variable.
V This project is supported by the NASA Interdisciplinary Science Program Map of the U.S. East Coast showing the Mid-Atlantic and South Atlantic Bights.
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
GODAE OceanView-GSOP-CLIVAR workshop June Monitoring the Ocean State from the Observations Stéphanie Guinehut Sandrine Mulet Marie-Hélène.
Mathematical Studies for the IB Diploma © Hodder Education Pearson’s product–moment correlation coefficient.
Applied Quantitative Analysis and Practices LECTURE#07 By Dr. Osman Sadiq Paracha.
V. Rouillard  Introduction to measurement and statistical analysis CURVE FITTING In graphical form, drawing a line (curve) of best fit through.
1 An Assessment of the CFS real-time forecasts for Wanqiu Wang, Mingyue Chen, and Arun Kumar CPC/NCEP/NOAA.
Dealing with location in the valuation of office rents in London Multilevel and semi-parametric modelling Aniel Anand, 1 st July 2015.
Oceanography 569 Oceanographic Data Analysis Laboratory Kathie Kelly Applied Physics Laboratory 515 Ben Hall IR Bldg class web site: faculty.washington.edu/kellyapl/classes/ocean569_.
Verification methods - towards a user oriented verification The verification group.
Standard Deviation Variance and Range. Standard Deviation:  Typical distance of observations from their mean  A numerical summary that measures the.
S D.. In probability and statistics, the standard deviation is the most common measure of statistical dispersion. Simply put, standard deviation measures.
Analyze Of VAriance. Application fields ◦ Comparing means for more than two independent samples = examining relationship between categorical->metric variables.
RTOFS Monitoring and Evaluation Metrics Avichal Mehra MMAB/EMC/NCEP/NWS.
NSO8055 Okeanograafiline prognoos
J.C.A. CEPEDA-MORALES1 and G. GAXIOLA-CASTRO2 RESULTS AND DISCUSSION
CHAPTER 3 Describing Relationships
Making Sense of Statistics:
Presentation transcript:

Metrics for Model Skill Assessment Model Error time series (model-data misfit): –ME(i) = model - data Total Root-Mean-Square Error: RMS_Total RMS_Total 2 = RMS_Bias 2 + RMS_Variability 2 –RMS_Bias = difference between means –RMS_Variability (centered pattern RMS) = mean [difference of deviations from mean] RMS_Variability: Correlation, Amplitude --> Taylor diagram –Amplitude of deviations –Correlation of deviations

Taylor Plot Graphical relationship between time series based on four statistics: 1) Overall Mean Bias 2) Seasonal Variance Standard Deviation 3) Timing/Phase Correlation Coefficient 4) Root Mean Square Error Centered RMS Distance (RMS_V)

Taylor Diagram Example

old Run 751 new Run 801

Target Diagram as Skill Assessment Tool RMS_T 2 = RMS_B 2 + RMS_V 2 model-data misfit = variability in data model-data misfit = error in data SAB SST climatology

SST

Chlorophyll

Summary Taylor & Target diagrams are two complimentary ways of assessing model skill - Taylor: Correlation of variability Amplitude of variability (Bias) - Target: Total RMS Relative bias and variability components

SST Correlation: ~0.9 always satellite, in situ, 2004, climatology Amplitude of variability: good especially for satellite 2004 comparisons underestimate in FL, GA overestimate everywhere north of SC Bias : low underestimate in SAB in climatology, better using 2004 RMS_bias ≈ RMS_variability MLD Correlation: always positive Higher in MAB (.8) than SAB (.5) Higher in outer SAB (>.6) than inner SAB (<.4) Amplitude of variability: overestimate variability Except for MAB Outer shelf Bias : generally low typically overestimate (FL inner, DE outer) occasionally underestimate (FL, GA outer, MAB outer) Summary (cont.)

Surface chlorophyll - much greater challenge! Correlation: -0.6 to 0.9 (same for Clim and 2004) lower off NC, SC, NY Higher off FL, DE, NJ Amplitude of variability: so-so (worse for 2004 in SAB) underestimate in SAB overestimate in MAB Bias : large negative bias everywhere underestimate in GA, SC (benthic production?) underestimate on inner MAB shelf RMS_bias >> RMS_variability Little correlation between where MLD/SST is modeled well (poorly) and where chlorophyll is modeled well (poorly) Summary (cont.)

Use these Taylor/Target diagrams to compare runs With/without tides With/without DOM Plot other quantities: kPAR, productivity, oxygen, salinity Examine other regions: Gulf of Maine Gulf Stream/Sargasso Use these for the OCRT meeting? Use these for the Oceanography article? Future Work