1 The Venzke et al. * Optimal Detection Analysis Jeff Knight * Venzke, S., M. R. Allen, R. T. Sutton and D. P. Rowell, The Atmospheric Response over the.

Slides:



Advertisements
Similar presentations
Self-Organizing Maps Projection of p dimensional observations to a two (or one) dimensional grid space Constraint version of K-means clustering –Prototypes.
Advertisements

Component Analysis (Review)
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Generalised Inverses Modal Analysis and Modal Testing S. Ziaei Rad.
Climate change detection and attribution methods Exploratory Workshop DADA, Buenos Aires, Oct 2012 Francis Zwiers, Pacific Climate Impacts Consortium,
Tensors and Component Analysis Musawir Ali. Tensor: Generalization of an n-dimensional array Vector: order-1 tensor Matrix: order-2 tensor Order-3 tensor.
Face Recognition Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
PCA + SVD.
1er. Escuela Red ProTIC - Tandil, de Abril, 2006 Principal component analysis (PCA) is a technique that is useful for the compression and classification.
Initialization Issues of Coupled Ocean-atmosphere Prediction System Climate and Environment System Research Center Seoul National University, Korea In-Sik.
The Predictive Skill and the Most Predictable Pattern in the Tropical Atlantic: The Effect of ENSO Zeng-Zhen Hu 1 Bohua Huang 1,2 1 Center for Ocean-Land-Atmosphere.
An introduction to Principal Component Analysis (PCA)
Principal Component Analysis
Principal Component Analysis
Unsupervised Learning - PCA The neural approach->PCA; SVD; kernel PCA Hertz chapter 8 Presentation based on Touretzky + various additions.
Spike-triggering stimulus features stimulus X(t) multidimensional decision function spike output Y(t) x1x1 x2x2 x3x3 f1f1 f2f2 f3f3 Functional models of.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Lecture 11 Vector Spaces and Singular Value Decomposition.
ICA Alphan Altinok. Outline  PCA  ICA  Foundation  Ambiguities  Algorithms  Examples  Papers.
What is EOF analysis? EOF = Empirical Orthogonal Function Method of finding structures (or patterns) that explain maximum variance in (e.g.) 2D (space-time)
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
Summarized by Soo-Jin Kim
Chapter 2 Dimensionality Reduction. Linear Methods
Linear Algebra Review 1 CS479/679 Pattern Recognition Dr. George Bebis.
Extensions of PCA and Related Tools
CSE554AlignmentSlide 1 CSE 554 Lecture 5: Alignment Fall 2011.
Atlantic Multidecadal Variability and Its Climate Impacts in CMIP3 Models and Observations Mingfang Ting With Yochanan Kushnir, Richard Seager, Cuihua.
Linear Regression Andy Jacobson July 2006 Statistical Anecdotes: Do hospitals make you sick? Student’s story Etymology of “regression”
Canonical Correlation Analysis and Related Techniques Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Classification Course web page: vision.cis.udel.edu/~cv May 12, 2003  Lecture 33.
N– variate Gaussian. Some important characteristics: 1)The pdf of n jointly Gaussian R.V.’s is completely described by means, variances and covariances.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
Paper review EOF: the medium is the message 報告人:沈茂霖 (Mao-Lin Shen) 2015/11/10 Seminar report.
Principal Components: A Mathematical Introduction Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University.
BioSS reading group Adam Butler, 21 June 2006 Allen & Stott (2003) Estimating signal amplitudes in optimal fingerprinting, part I: theory. Climate dynamics,
Chapter 28 Cononical Correction Regression Analysis used for Temperature Retrieval.
Introduction to Linear Algebra Mark Goldman Emily Mackevicius.
Local Predictability of the Performance of an Ensemble Forecast System Liz Satterfield and Istvan Szunyogh Texas A&M University, College Station, TX Third.
Northwest European High Summer Climate Variability, the West African Monsoon and the Summer North Atlantic Oscillation Jim Hurrell, NCAR, & Chris Folland,
Principal Component Analysis (PCA)
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Multivariate time series analysis Bijan Pesaran Center for Neural Science New York University.
Intro. ANN & Fuzzy Systems Lecture 16. Classification (II): Practical Considerations.
Chapter 13 Discrete Image Transforms
Central limit theorem revisited Throw a dice twelve times- the distribution of values is not Gaussian Dice Value Number Of Occurrences.
Central limit theorem - go to web applet. Correlation maps vs. regression maps PNA is a time series of fluctuations in 500 mb heights PNA = 0.25 *
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Reduced echelon form Matrix equations Null space Range Determinant Invertibility Similar matrices Eigenvalues Eigenvectors Diagonabilty Power.
Unsupervised Learning II Feature Extraction
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
Data Science Dimensionality Reduction WFH: Section 7.3 Rodney Nielsen Many of these slides were adapted from: I. H. Witten, E. Frank and M. A. Hall.
CSE 554 Lecture 8: Alignment
Localization for Anisotropic Sensor Networks
STATISTICAL ORBIT DETERMINATION Kalman (sequential) filter
LECTURE 11: Advanced Discriminant Analysis
LECTURE 10: DISCRIMINANT ANALYSIS
9.3 Filtered delay embeddings
Machine Learning Dimensionality Reduction
Feature Space Based Watermarking in Multi-Images
Principal Component Analysis
Singular Value Decomposition North Atlantic SST
SVD: Physical Interpretation and Applications
SVD, PCA, AND THE NFL By: Andrew Zachary.
Introduction to Connectivity Analyses
Atlantic Ocean Forcing of North American and European Summer Climate
Maths for Signals and Systems Linear Algebra in Engineering Lectures 13 – 14, Tuesday 8th November 2016 DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR)
LECTURE 09: DISCRIMINANT ANALYSIS
Principal Component Analysis
Presentation transcript:

1 The Venzke et al. * Optimal Detection Analysis Jeff Knight * Venzke, S., M. R. Allen, R. T. Sutton and D. P. Rowell, The Atmospheric Response over the North Atlantic to Decadal Changes in Sea Surface Temperature, J. Clim, 12, , 1999.

2 Ensembles Initial-value ensembles of climate model calculations allow estimation of the climate response to forcings, as multiple realisations of the chaotic noise inherent in the climate system are available. The independence of these noise components in each simulation allows it to be ‘averaged out’ by the formation of an ensemble mean.

3 Ensemble means For data X from calculation k: X k = X M + X Nk, where X M is the ensemble mean and X Nk is the noise in k. For large enough ensemble size n, the forced signal in X is X F = E(X M )  X M.

4 EOF analysis A common way to analyse the forced model response is principal component analysis (PCA, or EOF analysis) of the ensemble mean. PCA aims to find the covarying spatial patterns that have maximum variance in the data. In essence we can treat the mean X M as a set of time series for each point, X M (i, j) ; i = 1, l ; j = 1, m i.e. the arrangement of points irrelevant.

5 SVD analysis In general, an l×m matrix can be decomposed X l,m = E l,m S m,m P T l,m, where the columns of E are the eof patterns of X, S is a diagonal matrix of the variance coefficients (square eigenvalues) and P contains the associated principal component time series. A Singular Value Decomposition.

6 The trouble with this… EOF analysis of means of realistic size climate ensembles will only give accurate estimates of the EOFs of X F if the noise is not spatially coherent. But this is not the case. Even worse, it is likely that the patterns of noise are in fact quite similar to the signal (e.g. NAO). So the ensemble mean EOFs are heavily biased away from the forced response towards the noise.

7 The way forward Ensemble averaging throws out a lot of the information about the spatial characteristics of the noise. The aim of the Venzke technique is to estimate the properties of the noise so better account of its influence can be made in estimating the forced EOFs.

8 Technique We can identify the noise in each experiment: X Nk = X k - X M, Rewrite the noise X N as the concatenation of each run X Nk : X N = {X N1 (i, j), X N2 (i, j), …, X Nn (i, j)}, i = 1, l ; j = 1, m, making a matrix of size l  nm.

9 Noise EOFs Next, compute the EOFs of the noise by SVD: X N = E N S N P N T, so the columns of E N are the EOF patterns of the noise. These patterns can be used to define a new BASIS for the ensemble mean, i.e. we seek to express the ensemble mean in terms of these EOF patterns. We choose the  best defined patterns to do this.

10 Prewhitening Function If we define a matrix function, F = n ½ E N (  ) (S N (  ) ) -1, and its (pseudo-) inverse, F (-1) = n -½ E N (  ) S N (  ), we notice that multiplying by F projects onto the noise EOFs and scales the result by the inverse singular values, or the variance.

11 Prewhitening So the transform X M = F T X M converts the ensemble mean into a set of time series of projections of the mean on the noise EOFs, such that each EOF has the same weight in the result. In other words, we have transformed the data into a basis made of these EOFs.

12 EOFs of transformed mean As we now have a mean in which the signals lie amongst unit-variance noise EOFs, we can isolate them by further EOF analysis: X M = ESP T, so that E’ contains the EOFs under this new basis. But beware! The weighting at each point in the EOF relates to the strength of each noise pattern, not the spatial weight.

13 Back to physical space To get back to a physical basis from the noise EOF basis, we need to invert the transform based on noise EOFs, Ê = F (-1) E, which also rescales, giving us back an l×  matrix i.e. the truncated number of optimal EOF patterns of l points each. Projection on the mean gives the principal components.

14 To summarise Concatenate the noise estimates for each realisation, X Nk Derive joint noise EOF patterns, E N Project these patterns onto the ensemble mean, scaling the EOFs by their variance Find the signal by deriving EOFs in this basis Transform these EOFs back to physical space

15 An Example Courtesy of Rowan Sutton We create a synthetic signal on a 20×20 grid (l = 400) by taking a simple harmonic pattern and modulating through m = 50 times with a sine of period 20.

16 Signal Pattern

17 Noise Patterns Generate random noise by associating a random time series with each 2D harmonic pattern H i,j, for i,j = 1,5, and weighting the variance by e -(i+j). Repeat this for n = 6 ensemble members.

18 Noise Patterns

19 Synthetic Data

20 Ensemble Mean EOFs

21 Noise EOFs

22 EOFs in the noise basis

23 Optimised 1st EOF

24 Conclusions Venzke et al. find that Principal Component Analysis of the mean of realistic ensembles is heavily contaminated by the noise and so provides a poor estimate of the EOFs of the forced response. By using information about the spatial characteristics of the noise in the ensemble, these estimates can be improved.