Presentation is loading. Please wait.

Presentation is loading. Please wait.

PRINCIPAL COMPONENT ANALYSIS(PCA) EOFs and Principle Components; Selection Rules LECTURE 8 Supplementary Readings: Wilks, chapters 9.

Similar presentations


Presentation on theme: "PRINCIPAL COMPONENT ANALYSIS(PCA) EOFs and Principle Components; Selection Rules LECTURE 8 Supplementary Readings: Wilks, chapters 9."— Presentation transcript:

1 PRINCIPAL COMPONENT ANALYSIS(PCA) EOFs and Principle Components; Selection Rules LECTURE 8 Supplementary Readings: Wilks, chapters 9

2 WE’LL START OUT WITH AN EXAMPLE: 20th GLOBAL SURFACE TEMPERATURE RECORD

3 Climatic Research Unit (‘CRU’), University of East Anglia Surface Temperature Changes

4 EOFs for the five leading eigenvectors of the global temperature data from 1902-1980. The gridpoint areal weighting factor used in the PCA procedure has been removed from the EOFs so that relative temperature anomalies can be inferred from the patterns. 12% (88%) 6% (3%) 5% (1%) 4% (1%) 3% (0.5%) EOF #1 EOF #2 EOF #3 EOF #4 EOF #5

5 SURFACE TEMPERATURE RECORD FILTERED BY RETAINING PROJECTION ONTO WITH FIRST FIVE EIGENVECTORS FILTERING THROUGH PCA

6 GLOBAL TEMPERATURE TREND EOF #1 PC #1

7 Multivariate ENSO Index (“MEI”) EOF #2 PC #2 EL NINO/SOUTHERN OSCILLATION (ENSO)

8 NORTH ATLANTIC OSCILLATION EOF #3 PC #3

9 EOF #3 PC #3 NORTH ATLANTIC OSCILLATION

10 TROPICAL ATLANTIC “DIPOLE” EOF #3 PC #3

11 ATLANTIC MULTIDECADAL OSCILLATION EOF #5 PC #5

12 EOF #5 PC #5 ATLANTIC MULTIDECADAL OSCILLATION

13 EOF #5 PC #5 ATLANTIC MULTIDECADAL OSCILLATION

14 PCA as an SVD on the Data Matrix X

15 Recall from our earlier lecture the variance-covariance matrix A in the multivariate regression problem: The eigenvectors of A comprise an orthogonal predictor set (Principal Components Regression)

16 Let us return to the data matrix, (assume it has zero mean) We can write Where U,V are unitary matrices (orthogonal matrices if X is real-valued), U is MxN, S is diagonal NxN, and V is NxN Singular Value Decomposition (SVD) Assume M>N (overdetermined; greater number of “equations” than “unknowns”)

17 We can then write Where U, V are unitary matrices (orthogonal matrices if X is real-valued), U is NxM, S is diagonal MxM, and V is MxM Singular Value Decomposition (SVD) Typically, we are interested in the case N>M. A revised overdetermined problem can be obtained by redefining the problem:

18

19 V is a unitary matrix which diagonalizes XX T ! There is a mathematical equivalence between taking the Singular Value Decomposition (SVD) of X, and finding the eigenvectors of A=XX T Thus, S 2 contains the eigenvalues of XX T

20 U contains as its columns the temporal patterns or Principal Components (“PC”s) corresponding to the M eigenvalues, which are the “right eigenvectors” of the SVD: V contains the as its columns the Spatial Pattern or Empirical Orthogonal Function (“EOF”) corrresponding to the M eigenvalues, which are the “left eigenvectors” of the SVD:

21 We can filter the original data with a subset of M* eigenvectors: FILTERING WITH EIGENVECTORS

22 Standardization & Areal Weighting Gappy Data Frequency domain “Rotation” Selection Rules Some Additional Considerations:

23 How many eigenvectors do we consider significant? Eigenvalue > 1/M Break in slope in eigenvalue spectrum (“Scree” test) or log eigenvalue (“LEV”) spectrum Eigenvalue lies outside expected distribution for M uncorrelated Gaussian time series of length N (Preisendorfer Rule N). This is an example of a Monte Carlo method Rule N’ (take into account serial correlation) There is no uniquely defensible criterion... SELECTION RULES

24 Preisdendorfer Rule N SELECTION RULES

25 Asymptotic results of Preisendorfer Rule N for large sample size (N,M>100 or so) =N/M=N/M SELECTION RULES

26 MATLAB EXAMPLE: NORTH ATLANTIC SEA LEVEL PRESSURE DATA 1899-1999


Download ppt "PRINCIPAL COMPONENT ANALYSIS(PCA) EOFs and Principle Components; Selection Rules LECTURE 8 Supplementary Readings: Wilks, chapters 9."

Similar presentations


Ads by Google