What is EOF analysis? EOF = Empirical Orthogonal Function Method of finding structures (or patterns) that explain maximum variance in (e.g.) 2D (space-time)

Slides:



Advertisements
Similar presentations
Eigen Decomposition and Singular Value Decomposition
Advertisements

Principal Component Analysis (PCA) or Empirical Orthogonal Functions (EOFs) Arnaud Czaja (SPAT Data analysis lecture Nov. 2011)
Eigen Decomposition and Singular Value Decomposition
Noise & Data Reduction. Paired Sample t Test Data Transformation - Overview From Covariance Matrix to PCA and Dimension Reduction Fourier Analysis - Spectrum.
Exploratory Factor Analysis
PCA + SVD.
Environmental Data Analysis with MatLab Lecture 16: Orthogonal Functions.
Statistical tools in Climatology René Garreaud
Lecture 7: Principal component analysis (PCA)
Principal Component Analysis CMPUT 466/551 Nilanjan Ray.
Principal Component Analysis
Principal Component Analysis
3D Geometry for Computer Graphics
Principal component analysis (PCA)
The Hidden Message Some useful techniques for data analysis Chihway Chang, Feb 18’ 2009.
Principle Component Analysis What is it? Why use it? –Filter on your data –Gain insight on important processes The PCA Machinery –How to do it –Examples.
TFIDF-space  An obvious way to combine TF-IDF: the coordinate of document in axis is given by  General form of consists of three parts: Local weight.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Microarray analysis Algorithms in Computational Biology Spring 2006 Written by Itai Sharon.
Ordinary least squares regression (OLS)
Correlation. The sample covariance matrix: where.
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
Measuring Functional Integration: Connectivity Analyses
Principle Component Analysis Presented by: Sabbir Ahmed Roll: FH-227.
Chapter 2 Dimensionality Reduction. Linear Methods
Presented By Wanchen Lu 2/25/2013
Next. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University.
Recap of PCA: what it does, how to do it Details of PCA presentation of results terminology scaling truncation of PCs interpretation of PCs Rotation of.
Extensions of PCA and Related Tools
The Multiple Correlation Coefficient. has (p +1)-variate Normal distribution with mean vector and Covariance matrix We are interested if the variable.
Eigen Decomposition Based on the slides by Mani Thomas Modified and extended by Longin Jan Latecki.
1 The Venzke et al. * Optimal Detection Analysis Jeff Knight * Venzke, S., M. R. Allen, R. T. Sutton and D. P. Rowell, The Atmospheric Response over the.
Canonical Correlation Analysis and Related Techniques Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia.
N– variate Gaussian. Some important characteristics: 1)The pdf of n jointly Gaussian R.V.’s is completely described by means, variances and covariances.
Interpreting Principal Components Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University L i n.
Slicing and dicing the 153-year record of monthly sea level at San Francisco, California using singular spectrum analysis Larry Breaker Moss Landing Marine.
Laboratory in Oceanography: Data and Methods MAR599, Spring 2009 Anne-Marie E.G. Brunner-Suzuki Empirical Orthogonal Functions.
Central limit theorem revisited
Rotation matrices 1 Constructing rotation matricesEigenvectors and eigenvalues 0 x y.
Christina Bonfanti University of Miami- RSMAS MPO 524.
EIGENSYSTEMS, SVD, PCA Big Data Seminar, Dedi Gadot, December 14 th, 2014.
Principal Component Analysis Zelin Jia Shengbin Lin 10/20/2015.
Feature Extraction 主講人:虞台文. Content Principal Component Analysis (PCA) PCA Calculation — for Fewer-Sample Case Factor Analysis Fisher’s Linear Discriminant.
3 “Products” of Principle Component Analysis
Feature Extraction 主講人:虞台文.
Oceanography 569 Oceanographic Data Analysis Laboratory Kathie Kelly Applied Physics Laboratory 515 Ben Hall IR Bldg class web site: faculty.washington.edu/kellyapl/classes/ocean569_.
5. The Harmonic Oscillator Consider a general problem in 1D Particles tend to be near their minimum Taylor expand V(x) near its minimum Recall V’(x 0 )
Central limit theorem revisited Throw a dice twelve times- the distribution of values is not Gaussian Dice Value Number Of Occurrences.
Principal Components Analysis ( PCA)
Central limit theorem - go to web applet. Correlation maps vs. regression maps PNA is a time series of fluctuations in 500 mb heights PNA = 0.25 *
EMPIRICAL ORTHOGONAL FUNCTIONS 2 different modes SabrinaKrista Gisselle Lauren.
Canonical Correlation Analysis (CCA). CCA This is it! The mother of all linear statistical analysis When ? We want to find a structural relation between.
CS246 Linear Algebra Review. A Brief Review of Linear Algebra Vector and a list of numbers Addition Scalar multiplication Dot product Dot product as a.
PRINCIPAL COMPONENT ANALYSIS(PCA) EOFs and Principle Components; Selection Rules LECTURE 8 Supplementary Readings: Wilks, chapters 9.
CSE 554 Lecture 8: Alignment
Slicing and dicing the 153-year record of monthly
EMPIRICAL ORTHOGONAL FUNCTIONS
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Principal Component Analysis
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
ALL the following plots are subject to the filtering :
Eigen Decomposition Based on the slides by Mani Thomas
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Principal Component Analysis
Geology 491 Spectral Analysis
Canonical Correlation Analysis and Related Techniques
Eigen Decomposition Based on the slides by Mani Thomas
Presentation transcript:

What is EOF analysis? EOF = Empirical Orthogonal Function Method of finding structures (or patterns) that explain maximum variance in (e.g.) 2D (space-time) dataset Mathematically EOFs are eigenvectors of the covariance matrix of a dataset

Any (space-time) dataset can be represented as a matrix: X = M = X ij N Math

Define X T X T = N = X ji M Math

And covariance matrix C=XX T C = M N = M N M M

Math EOFs (e i ) are the eigenvectors of C C e i = e i

Math Principal components: Fourier coefficients of the corresponding EOFs in the time expansion of the dataset PC i (t) = (X T, e i ) Too easy, huh?

Math Why does 1st EOF maximize explained variance? Answer: by construction. (e T X,X t e) = ||e T X|| =  max (e T,e) = 1 Or: (e T,Ce) =, ( C = XX T ), Ce = e This maximizes on the eigenvector corresponding to the greatest eigenvalue. Amazing!

What do EOFs and PCs mean? EOF – a coherent orthogonal spatial pattern. First EOF explains most variance in a physical field PC – time behavior of the corresponding EOF (=spatial pattern) Stunning! Let ’ s EOF everything!

EOF interpretation Direction of maximum variance

Example 1. El-Nino.

Example 2. Arctic Oscillation.

Tropospheric Winter Trends Cohen et al, 2012, ERL

Northern Hemisphere Land Temperatures Data: CRU temperature Alexeev et al, 2012, Clim Change; Cohen et al, 2012, ERL

Major modes in the Northern Hemisphere

Influence of Cosmic Rays on Earth's Climate (H.Svensmark)

Why EOFs are not physical modes? Your equations: dx/dt + Ax = f Physical modes: eigenvectors of A. (Solve Ay = y) Physical modes are not orthogonal (generally speaking)

Why EOFs are not physical modes? Your equations: dx/dt + Ax = f EOFs – eigenvectors of a matrix derived from A A T EOFs: orthogonal by construction

Other methods SVD = Singular Value Decomposition, aka MCA = Maximum Correlation Analysis Method is looking for correlated spatial patterns in two different fields

Math Correlation matrix C XY =XY T C XY = M N = M N L L

Math SVD vectors of C: in U (X-field) and V (Y-field) matrices C XY = U  V T

Other methods CCA = Canonical Correlation Analysis: SVD over space of Fourier coefficients of EOFs

Other methods POP = Principal Oscillation Pattern Analysis FDT over space of Fourier coefficients of EOFs (FDT = Fluctuation-Dissipation Theorem)

POP = Principal Oscillating Patterns x n+1 = C x n +  (C = ‘step forward’ operator) Assume = 0 = C + We can approximate C from: C = C 0 C -1 1 Where C 0 =, C 1 =

Other methods Varimax, Quartimax, rotated EOF analysis EOF modifications

Other methods MTM = Multi-Taper Method Combination of EOF and Wavelet analyses

Other methods SSA = Singular Spectrum Analysis MSSA = Multi-channel SSA MTM-SVD EEOF = Extended EOF FDEOF = Frequency Domain EOF CEOF = Complex EOF

When is EOF analysis useful? Analysis of repeating pronounced patterns over long time series Image/data compression Filtering Not so fast….

When EOF use is inappropriate? Short time series, lots of missing and/or inconsistent data Absence of a prominent signal Presence of a dominant trend in the data (e.g. seasonal cycle is dangerous!)

Why do people get so excited about EOFs? EOFs can be applied to any dataset Simplicity of the analysis is very appealing. Everyone does EOFs. Patterns are often tempting to analyze (because of method ’ s simplicity)

Do not overdo it with EOFs! “ New ” patterns sometimes turn out to be not so new. Artificial (mechanistic) data de-trending can lead to surprises (example: removal of seasonal cycle does not remove changing seasonal variability in most of the fields)

Do scientists have problems interpreting EOF results? Saying “ I performed EOF analysis on my data ” does not mean you explained any physics EOFs usually do not coincide with eigen- modes of the physical process you are trying to interpret/explain. POP analysis does not give you orthonormal modes, but it might approximate your physical modes

Are results of EOF analysis accurate? Statistical significance is always an issue. If something correlates (even very well) with something else (or appears to be systematically preceding/following), this does not mean one causes the other. They both can be caused by something else.

What are EOF maniacs? People who eof (svd, cca …) everything with everything just for the sake of it

Are there many EOF/SVD/CCA maniacs out there? Yes, there are! (I am one of them)