D/RS 1013 Discriminant Analysis. Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more.

Slides:



Advertisements
Similar presentations
Discriminant Analysis and Classification. Discriminant Analysis as a Type of MANOVA  The good news about DA is that it is a lot like MANOVA; in fact.
Advertisements

Canonical Correlation simple correlation -- y 1 = x 1 multiple correlation -- y 1 = x 1 x 2 x 3 canonical correlation -- y 1 y 2 y 3 = x 1 x 2 x 3 The.
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Chapter 17 Overview of Multivariate Analysis Methods
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Linear Discriminant Function LDF & MANOVA LDF & Multiple Regression Geometric example of LDF & multivariate power Evaluating & reporting LDF results 3.
Discrim Continued Psy 524 Andrew Ainsworth. Types of Discriminant Function Analysis They are the same as the types of multiple regression Direct Discrim.
Multiple Regression Models Advantages of multiple regression Important preliminary analyses Parts of a multiple regression model & interpretation Differences.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.
CSE 300: Software Reliability Engineering Topics covered: Software metrics and software reliability Software complexity and software quality.
Topic 3: Regression.
MACHINE LEARNING 6. Multivariate Methods 1. Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Motivating Example  Loan.
An Introduction to Classification Classification vs. Prediction Classification & ANOVA Classification Cutoffs, Errors, etc. Multivariate Classification.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Chapter 14 Inferential Data Analysis
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Simple Linear Regression Analysis
Discriminant Analysis Testing latent variables as predictors of groups.
Business Research Methods William G. Zikmund Chapter 24 Multivariate Analysis.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Regression with 2 IVs Generalization of Regression from 1 to 2 Independent Variables.
Classification (Supervised Clustering) Naomi Altman Nov '06.
CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
Some matrix stuff.
Chapter Eighteen Discriminant Analysis Chapter Outline 1) Overview 2) Basic Concept 3) Relation to Regression and ANOVA 4) Discriminant Analysis.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
Business Research Methods William G. Zikmund Chapter 24 Multivariate Analysis.
Discriminant Analysis
Chapter 12 – Discriminant Analysis © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
Regression Analyses. Multiple IVs Single DV (continuous) Generalization of simple linear regression Y’ = b 0 + b 1 X 1 + b 2 X 2 + b 3 X 3...b k X k Where.
ANOVA and Linear Regression ScWk 242 – Week 13 Slides.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Multivariate Data Analysis Chapter 5 – Discrimination Analysis and Logistic Regression.
Multiple Linear Regression Partial Regression Coefficients.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
17-1 COMPLETE BUSINESS STATISTICS by AMIR D. ACZEL & JAYAVEL SOUNDERPANDIAN 6 th edition (SIE)
Multiple Discriminant Analysis
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 12 Making Sense of Advanced Statistical.
Adjusted from slides attributed to Andrew Ainsworth
Linear Discriminant Analysis and Its Variations Abu Minhajuddin CSE 8331 Department of Statistical Science Southern Methodist University April 27, 2002.
Linear Discriminant Analysis (LDA). Goal To classify observations into 2 or more groups based on k discriminant functions (Dependent variable Y is categorical.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Outline of Today’s Discussion 1.Introduction to Discriminant Analysis 2.Assumptions for Discriminant Analysis 3.Discriminant Analysis in SPSS.
Two-Group Discriminant Function Analysis. Overview You wish to predict group membership. There are only two groups. Your predictor variables are continuous.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
Unit 7 Statistics: Multivariate Analysis of Variance (MANOVA) & Discriminant Functional Analysis (DFA) Chat until class starts.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L9.1 Lecture 9: Discriminant function analysis (DFA) l Rationale.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
MANOVA Lecture 12 Nuance stuff Psy 524 Andrew Ainsworth.
Factorial BG ANOVA Psy 420 Ainsworth. Topics in Factorial Designs Factorial? Crossing and Nesting Assumptions Analysis Traditional and Regression Approaches.
DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Stats Methods at IC Lecture 3: Regression.
Chapter 12 – Discriminant Analysis
Chapter 12 Understanding Research Results: Description and Correlation
Multiple Discriminant Analysis and Logistic Regression
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Making Sense of Advanced Statistical Procedures in Research Articles
3 basic analytical tasks in bivariate (or multivariate) analyses:
Discrimination and Classification
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

D/RS 1013 Discriminant Analysis

Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more groups n goal is to discriminate between groups n considers several predictor variables simultaneously

Discriminant - Overview n provides a way to describe differences between groups in simple terms. n removes the redundancy among large numbers of variables by combining into a smaller number of Discriminant functions n can classify cases to groups when their group membership is unknown

Overview (cont.) n tests the significance of differences between two or more groups n examines several predictor variables simultaneously n construct linear combination of these variables, forming a single composite variable called a discriminant function n basically MANOVA flipped upside down

Discriminant parallels with MANOVA and Regression n Discriminant works the other way, predicting group membership by some kind of scores n The discriminant function takes the form: n D = d 1 z 1 + d 2 z d p z p

Discriminant functions n D i = d 1 z 1 + d 2 z d p z p –where, D = scores on the discriminant function –d 1 - d p = discriminant function weighting coefficients for each of p predictor variables –z 1 - z p = standardized scores on the original p variables

Unstandardized Functions n even more like regression equation n D i = a + d 1 x 1 + d 2 x d p x p –a= the discriminant function constant –d 1 - d p = discriminant function weighting coefficients for each of p predictor variables –x 1 - x p = raw scores on the original p variables

Forming discriminant functions n discriminant function is formed to maximize the F value associated with the D n F = bg variance on D / wg variance on D n provides a function with the greatest discriminating power.

Functions beyond the first n first function is one of many combinations of the p original predictor variables. n # of useful functions is p (# of original variables) or k-1 (k=# of groups being considered), whichever is smaller. n later functions maximize the separation between groups and are orthogonal with the preceding functions.

First discriminant function (3 gps) Separates group 1 from groups 2 & 3

Second function (3 gps) Separates group 3 from groups 1 & 2

Both functions together Orthogonal = uncorrelated

Confusion Matrix n assign cases to groups based on their discriminant function scores n assignments compared with actual group memberships n confusion matrix gives both overall accuracy of classification and the relative frequencies of various types of misclassification

Confusion matrix: example  our proportion correct is ( )/100=.82  by chance alone we would end up with.50 correct  if we evenly divided our group assignments  between A & B half in each group correct by chance  can consider prior probabilities, if known

Cross Validation n hold back some of the data to test the model that emerges n gives good idea of the kind of predictive accuracy we can expect for another sample n small samples and several variables unlikely to replicate across samples

Classification Functions n weights and constants used to calculate scores for each case n as many scores as there are groups for each case n assign to group that the case has the highest classification function score for

Assumptions n assumes that all predictors follow a multivariate normal distribution n test is robust with respect to normality, in practice, lack of normality doesn't make much of a difference n especially with large n and moderate number of predictors