CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace. 2002. Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.

Slides:

Advertisements

Similar presentations

Tables, Figures, and Equations

Advertisements

CHAPTER 27 Mantel Test From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

BIOL 582 Lecture Set 22 One-Way MANOVA, Part II Post-hoc exercises Discriminant Analysis.

Component Analysis (Review)

An Introduction to Multivariate Analysis

CHAPTER 24 MRPP (Multi-response Permutation Procedures) and Related Techniques From: McCune, B. & J. B. Grace Analysis of Ecological Communities.

Chapter 17 Overview of Multivariate Analysis Methods

QUANTITATIVE DATA ANALYSIS

From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

CHAPTER 22 Reliability of Ordination Results From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,

CHAPTER 19 Correspondence Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.

CHAPTER 17 Bray-Curtis (Polar) Ordination From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,

Canonical correlations

Chapter 7 Data Screening From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

CHAPTER 29 Classification and Regression Trees Dean L. Urban From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design,

CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,

CHAPTER 23 Multivariate Experiments From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.

CHAPTER 18 Weighted Averaging From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.

From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

Tables, Figures, and Equations

Correlation. The sample covariance matrix: where.

Discriminant analysis

Business Research Methods William G. Zikmund Chapter 24 Multivariate Analysis.

This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.

One-Way Manova For an expository presentation of multivariate analysis of variance (MANOVA). See the following paper, which addresses several questions:

Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 

Chapter 15 Correlation and Regression

The Multiple Correlation Coefficient. has (p +1)-variate Normal distribution with mean vector and Covariance matrix We are interested if the variable.

Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.

Some matrix stuff.

1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.

Chapter Eighteen Discriminant Analysis Chapter Outline 1) Overview 2) Basic Concept 3) Relation to Regression and ANOVA 4) Discriminant Analysis.

1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.

Discriminant Analysis

Principal Component Analysis Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

Multivariate Statistics Matrix Algebra I W. M. van der Veld University of Amsterdam.

Inferential Statistics

Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.

Computing Eigen Information for Small Matrices The eigen equation can be rearranged as follows: Ax = x  Ax = I n x  Ax - I n x = 0  (A - I n )x = 0.

From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon

Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.

MANOVA AND DISCRIMANT ANALYSIS Juan Carlos Penagos Saul Hoyos.

17-1 COMPLETE BUSINESS STATISTICS by AMIR D. ACZEL & JAYAVEL SOUNDERPANDIAN 6 th edition (SIE)

© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.

Principal Component Analysis (PCA). Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite)

Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.

Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.

Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.

Two-Group Discriminant Function Analysis. Overview You wish to predict group membership. There are only two groups. Your predictor variables are continuous.

Discriminant Function Analysis Mechanics. Equations To get our results we’ll have to use those same SSCP matrices as we did with Manova.

Multivariate Statistics with Grouped Units Hal Whitehead BIOL4062/5062.

Tom.h.wilson Department of Geology and Geography West Virginia University Morgantown, WV.

D/RS 1013 Discriminant Analysis. Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more.

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)

MANOVA Lecture 12 Nuance stuff Psy 524 Andrew Ainsworth.

Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.

DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.

Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.

Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.

Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.

Part 5 - Chapter

Part 5 - Chapter 17.

LECTURE 10: DISCRIMINANT ANALYSIS

Introduction to Statistics

Part 5 - Chapter 17.

Principal Component Analysis (PCA)

LECTURE 09: DISCRIMINANT ANALYSIS

Chapter 18: The Chi-Square Statistic

Presentation transcript:

CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon Tables, Figures, and Equations

Purposes: 1. Summarizing the differences between groups (often used as a follow-up to clustering, to help describe the groups); "descriptive discriminant analysis." With community data, you could use indicator species analysis as a nonparametric alternative.

Purposes: 1. Summarizing the differences between groups (often used as a follow-up to clustering, to help describe the groups); "descriptive discriminant analysis." With community data, you could use indicator species analysis as a nonparametric alternative. 2. Multivariate testing of whether or not two or more groups differ significantly from each other. For ecological community data this is better done with MRPP, thus avoiding the assumptions listed below.

Purposes: 1. Summarizing the differences between groups (often used as a follow-up to clustering, to help describe the groups); "descriptive discriminant analysis." With community data, you could use indicator species analysis as a nonparametric alternative. 2. Multivariate testing of whether or not two or more groups differ significantly from each other. For ecological community data this is better done with MRPP, thus avoiding the assumptions listed below. 3. Determining the dimensionality of group differences.

Purposes: 1. Summarizing the differences between groups (often used as a follow-up to clustering, to help describe the groups); "descriptive discriminant analysis." With community data, you could use indicator species analysis as a nonparametric alternative. 2. Multivariate testing of whether or not two or more groups differ significantly from each other. For ecological community data this is better done with MRPP, thus avoiding the assumptions listed below. 3. Determining the dimensionality of group differences. 4. Checking for misclassified items.

Purposes (cont.): 5. Predicting group membership or classifying new cases ("predictive discriminant analysis").

Purposes (cont.): 5. Predicting group membership or classifying new cases ("predictive discriminant analysis"). 6. Comparing occupied vs. unoccupied habitat to determine the habitat characteristics that allow or prevent a species' existence. DA has been widely used for this purpose in wildlife studies and rare plant studies.

Assumptions 1. Homogeneous within-group variances 2. Multivariate normality within groups. 3. Linearity among all pairs of variables. 4. Prior probabilities.

How it works The "direct" procedure is described below. 1. Calculate variance/covariance matrix for each group.

How it works The "direct" procedure is described below. 1. Calculate variance/covariance matrix for each group. 2. Calculate pooled variance/covariance matrix (S p ) from the above matrices.

How it works The "direct" procedure is described below. 1. Calculate variance/covariance matrix for each group. 2. Calculate pooled variance/covariance matrix (S p ) from the above matrices. 3. Calculate between group variance (S g ) for each variable.

4. Maximize the F-ratio: where the y is an the eigenvector associated with a particular discriminant function. We seek y to maximize F.

Maximize this ratio by finding the partial derivatives with a characteristic equation: The number of roots is g-1, where g is number of groups. In other words, the number of functions (axes) derived is one less than the number of groups. The eigenvalues thus express the percent of variance among groups explained by those axes.

6. Solve for each eigenvector y (also known as the "canonical variates" or "discriminant functions").

7. Locate points (sample units) on each axis. X = scores (coordinates) for n rows (sample units) on m dimensions, where m = g-1. A = original data matrix of n rows by p columns Y = matrix of m eigenvectors with loadings for p variables. Each eigenvector is known as a discriminant function.

These unstandardized discriminant functions Y can be used as (linear) prediction equations, assigning scores to unclassified items. Standardized discriminant function coefficients standardize to unit variance. The absolute value of these coefficients indicate the relative importance of the individual variables in contributing to the discriminant function.

8. Classification phase. a.Derive a classification equation for each group, one term in the equation for each variable, plus a constant. b.Insert data values for a given SU to calculate a classification score for each group for that SU. c.The SU is assigned to the group in which it had the highest score. The coefficients in the equation are derived from: p  p within-group variance-covariance matrix (S p ) and p  1 vector of the means for each variable in group k, M k. First, calculate W by dividing each term of S p by the within-group degrees of freedom. Then:

8. Classification phase, cont. The coefficients in the equation are derived from: p  p within-group variance-covariance matrix (S p ) p  1 vector of the means for each variable in group k, M k. First, calculate W by dividing each term of S p by the within- group degrees of freedom. Then: The constant is derived as: The constant and the coefficients in C k define a linear equation of the usual form, one equation for each group k.

Summary statistics  Wilk's lambda ( ). Wilk's is the error sum of squares divided by the sum of the effect sum of squares and the error sum of squares. Thus, it is the variance among the objects not explained by the discriminant functions. It ranges from zero (perfect separation of groups) to one (no separation of groups). Statistical significance of lambda is tested with a chi- square approximation.  Chi-square (derived from Wilk’s lambda).  Variance explained.

Figure Comparison of DA and PCA. Groups are tighter in DA than in PCA because DA maximizes group separation while PCA maximizes the representation of variance among individual points. Groups were superimposed on an ordination of pine species in ecological trait space (after McCune 1988). Pinus resinosa was not assigned to a group, so it does not appear in the DA ordination.

Table Predictions of goshawk nesting sites from DA compared to actual results, in one case using equal prior probabilities, in the other case using prior probabilities based on the occupancy rate of landscape cells. The first value of 0.83 means that 83% of the sites that were predicted by DA to be nesting sites actually were nesting sites.

priors 0.5 priors

EQUAL priors: No. non-nests predicted nests = p(predicted nest but not nest)  number of non-nests = 0.17  93 = 15.8 No. nests predicted non-nests = p(predicted not nest but nest)  number of nests = 0.17  7 = 1.2 Total number of errors = = 17 False positives False negatives

UNEQUAL priors: No. non-nests predicted nests = p(predicted nest but not nest)  number of non-nests = 0.02  93 = 1.9 No. nests predicted non-nests = p(predicted not nest but nest)  number of nests = 0.52  7 = 3.6 Total number of errors = = 5.5 False positives False negatives