Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.

Slides:



Advertisements
Similar presentations
Multiple Analysis of Variance – MANOVA
Advertisements

Canonical Correlation
StatisticalDesign&ModelsValidation. Introduction.
Chapter 17 Overview of Multivariate Analysis Methods
MULTIPLE REGRESSION. OVERVIEW What Makes it Multiple? What Makes it Multiple? Additional Assumptions Additional Assumptions Methods of Entering Variables.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Psych 524 Andrew Ainsworth Data Screening 1. Data check entry One of the first steps to proper data screening is to ensure the data is correct Check out.
1 Canonical Analysis Introduction Assumptions Model representation An output example Conditions Procedural steps Mechanical steps - with the use of attached.
Canonical Correlation: Equations Psy 524 Andrew Ainsworth.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
ANCOVA Psy 420 Andrew Ainsworth. What is ANCOVA?
19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.
What Is Multivariate Analysis of Variance (MANOVA)?
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Analysis of Variance & Multivariate Analysis of Variance
Slide 1 Testing Multivariate Assumptions The multivariate statistical techniques which we will cover in this class require one or more the following assumptions.
Discriminant Analysis Testing latent variables as predictors of groups.
Business Research Methods William G. Zikmund Chapter 24 Multivariate Analysis.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Inference for regression - Simple linear regression
Multivariate Statistical Data Analysis with Its Applications
Multivariate Data Analysis Chapter 8 - Canonical Correlation Analysis.
By Hui Bian Office for Faculty Excellence 1. K-group between-subjects MANOVA with SPSS Factorial between-subjects MANOVA with SPSS How to interpret SPSS.
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Business Research Methods William G. Zikmund Chapter 24 Multivariate Analysis.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
CANONICAL CORRELATION. Return to MR  Previously, we’ve dealt with multiple regression, a case where we used multiple independent variables to predict.
 Muhamad Jantan & T. Ramayah School of Management, Universiti Sains Malaysia Data Analysis Using SPSS.
Canonical Correlation Psy 524 Andrew Ainsworth. Matrices Summaries and reconfiguration.
Chapter 16 Data Analysis: Testing for Associations.
Adjusted from slides attributed to Andrew Ainsworth
Multivariate Data Analysis Chapter 1 - Introduction.
Right Hand Side (Independent) Variables Ciaran S. Phibbs.
Examining Relationships in Quantitative Research
ANCOVA. What is Analysis of Covariance? When you think of Ancova, you should think of sequential regression, because really that’s all it is Covariate(s)
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Applied Quantitative Analysis and Practices LECTURE#30 By Dr. Osman Sadiq Paracha.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
D/RS 1013 Data Screening/Cleaning/ Preparation for Analyses.
DTC Quantitative Research Methods Regression I: (Correlation and) Linear Regression Thursday 27 th November 2014.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
ANCOVA.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
D/RS 1013 Discriminant Analysis. Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
MANOVA Lecture 12 Nuance stuff Psy 524 Andrew Ainsworth.
PROFILE ANALYSIS. Profile Analysis Main Point: Repeated measures multivariate analysis One/Several DVs all measured on the same scale.
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Canonical Correlation Analysis (CCA). CCA This is it! The mother of all linear statistical analysis When ? We want to find a structural relation between.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard)   Week 5 Multiple Regression  
Multiple Regression Prof. Andy Field.
The Correlation Coefficient (r)
Multiple Regression.
Chapter 12: Regression Diagnostics
BIVARIATE REGRESSION AND CORRELATION
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
CH2. Cleaning and Transforming Data
Regression is the Most Used and Most Abused Technique in Statistics
Multiple Regression Chapter 14.
MANOVA Control of experimentwise error rate (problem of multiple tests). Detection of multivariate vs. univariate differences among groups (multivariate.
Checking Assumptions Primary Assumptions Secondary Assumptions
The Correlation Coefficient (r)
Presentation transcript:

Canonical Correlation

Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of multiple dependent variables and multiple independent variables. Whereas multiple regression analysis is used to predict the value of a single (metric) dependent variable from a linear function of a set of independent variables, canonical correlation analysis predicts multiple dependent variables from multiple independent variables. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of multiple dependent variables and multiple independent variables. Whereas multiple regression analysis is used to predict the value of a single (metric) dependent variable from a linear function of a set of independent variables, canonical correlation analysis predicts multiple dependent variables from multiple independent variables.

Canonical Correlation l Measuring the relationship between two separate sets of variables. l This is also considered multivariate multiple regression (MMR) l Measuring the relationship between two separate sets of variables. l This is also considered multivariate multiple regression (MMR)

Canonical Correlation l Often called Set correlation n Set 1 n Set 2 p doesn’t have to equal q l Number of cases required ≈ 10 per variable in the social sciences where typical reliability is.80, if higher reliability than less subjects per variable are sufficient. l Often called Set correlation n Set 1 n Set 2 p doesn’t have to equal q l Number of cases required ≈ 10 per variable in the social sciences where typical reliability is.80, if higher reliability than less subjects per variable are sufficient.

Canonical Correlation l In general, CanCorr is a method that basically does multiple regression on both sides of the equation l This isn’t really what happens but you can think of this way in general. l In general, CanCorr is a method that basically does multiple regression on both sides of the equation l This isn’t really what happens but you can think of this way in general.

Canonical Correlation l A better way to think about it: n Creating some single variable that represents the Xs and another single variable that represents the Ys. n This could be by merely creating composites (e.g. sum or mean) n Or by creating linear combinations of variables based on shared variance: l A better way to think about it: n Creating some single variable that represents the Xs and another single variable that represents the Ys. n This could be by merely creating composites (e.g. sum or mean) n Or by creating linear combinations of variables based on shared variance:

Canonical Correlation l Make a note that the arrows are coming from the measured variables to the canonical variates.

BackgroundBackground l Canonical Correlation is one of the most general multivariate forms – multiple regression, discriminate function analysis and MANOVA are all special cases of CanCorr l It is essentially a correlational method. l In multiple regression the linear combinations of Xs we use to predict y is really a single canonical variate. l Canonical Correlation is one of the most general multivariate forms – multiple regression, discriminate function analysis and MANOVA are all special cases of CanCorr l It is essentially a correlational method. l In multiple regression the linear combinations of Xs we use to predict y is really a single canonical variate.

QuestionsQuestions l How strongly does a set of variables relate to another set of variables? That is how strong is the canonical correlation? l How strongly does a variables relate to its own canonical correlate? l How strongly does a variable relate to the other set’s canonical variate? l How strongly does a set of variables relate to another set of variables? That is how strong is the canonical correlation? l How strongly does a variables relate to its own canonical correlate? l How strongly does a variable relate to the other set’s canonical variate?

Canonical Correlation In Canonical correlation, you have two sets of two or more interval-level (scale) variables each and you want to see how differences in one set relate to differences in the other set of variables. With canonical correlation, unlike regression, there is no distinction between independent and dependent variables; they are called by SPSS “Set 1” and “Set 2”. One would use canonical correlation when the variables in each set can be grouped together conceptually, but you want to see if there are particular subsets of them that relate to subsets in the other variable set, so you do not want to sum each set to make an overall score.

Canonical Correlation: Conditions and Assumptions ASSUMPTIONS: linearity of relationship (between each variable pair as well as between the variables and the linear composites). Multivariate normality (evaluate univariate normality - because multivariate normality is difficult to assess) Homoscedasticity (evaluate using matrix scatterplot of the canonical variate scores) Multicollinearity (evaluate using matrix scatterplot of the canonical variate scores) CONDITIONS: All variables in canonical correlation must be scale. It is recommended to have at least 10 subjects per variable in order to have adequate power.

AssumptionsAssumptions l Multicollinearity Check Set 1 and Set 2 separately n Run correlations and use the collinearity diagnostics function in regular multiple regression l Outliers – Check for both univariate and multivariate outliers on both set 1 and set 2 separately l Multicollinearity Check Set 1 and Set 2 separately n Run correlations and use the collinearity diagnostics function in regular multiple regression l Outliers – Check for both univariate and multivariate outliers on both set 1 and set 2 separately

AssumptionsAssumptions l Normality n Univariate – univariate normality is not explicitly required for MMR n Multivariate – multivariate normality is required and there is no way to test for except establishing univariate normality on all variables, even though this is still no guarantee. l Normality n Univariate – univariate normality is not explicitly required for MMR n Multivariate – multivariate normality is required and there is no way to test for except establishing univariate normality on all variables, even though this is still no guarantee.

AssumptionsAssumptions l Linearity – linear relationship assumed for all variables in each set and also between sets l Homoscedasticity (that is, when one vari- able exhibits similar amount of variance across the range of values for the other variable) – needs to be checked for all pairs of variables within and between sets. l Linearity – linear relationship assumed for all variables in each set and also between sets l Homoscedasticity (that is, when one vari- able exhibits similar amount of variance across the range of values for the other variable) – needs to be checked for all pairs of variables within and between sets.

Example: Canonical Correlation File: hsbdataNew.sav First, download the files “Canonical correlation.sps”, “Canonical Corr_MANOVA_Syntax1” and “Canonical Corr_Syntax2” to C: drive. Open Canonical correlation.sps and Run it, by selecting the entire contents. In the same worksheet, Open the data file hsbdataNew.sav Open File – New – Syntax Copy the contents of Canonical Corr_MANOVA_Syntax1. Highlight/Select the contents and Run. We get the output for checking the assumptions of Canonical Correlation. Open File – New – Syntax Copy the contents of Canonical Corr_Syntax2. Highlight/Select the contents and Run. We get the output for Canonical Correlation analysis.

Example: Canonical Correlation File: hsbdataNew.sav

Canonical Correlation (Cont.) Canonical Correlation (Cont.)