Presentation is loading. Please wait.

Presentation is loading. Please wait.

Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.

Similar presentations


Presentation on theme: "Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of."— Presentation transcript:

1 Canonical Correlation

2 Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of multiple dependent variables and multiple independent variables. Whereas multiple regression analysis is used to predict the value of a single (metric) dependent variable from a linear function of a set of independent variables, canonical correlation analysis predicts multiple dependent variables from multiple independent variables. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of multiple dependent variables and multiple independent variables. Whereas multiple regression analysis is used to predict the value of a single (metric) dependent variable from a linear function of a set of independent variables, canonical correlation analysis predicts multiple dependent variables from multiple independent variables.

3 Canonical Correlation l Measuring the relationship between two separate sets of variables. l This is also considered multivariate multiple regression (MMR) l Measuring the relationship between two separate sets of variables. l This is also considered multivariate multiple regression (MMR)

4 Canonical Correlation l Often called Set correlation n Set 1 n Set 2 p doesn’t have to equal q l Number of cases required ≈ 10 per variable in the social sciences where typical reliability is.80, if higher reliability than less subjects per variable are sufficient. l Often called Set correlation n Set 1 n Set 2 p doesn’t have to equal q l Number of cases required ≈ 10 per variable in the social sciences where typical reliability is.80, if higher reliability than less subjects per variable are sufficient.

5 Canonical Correlation l In general, CanCorr is a method that basically does multiple regression on both sides of the equation l This isn’t really what happens but you can think of this way in general. l In general, CanCorr is a method that basically does multiple regression on both sides of the equation l This isn’t really what happens but you can think of this way in general.

6 Canonical Correlation l A better way to think about it: n Creating some single variable that represents the Xs and another single variable that represents the Ys. n This could be by merely creating composites (e.g. sum or mean) n Or by creating linear combinations of variables based on shared variance: l A better way to think about it: n Creating some single variable that represents the Xs and another single variable that represents the Ys. n This could be by merely creating composites (e.g. sum or mean) n Or by creating linear combinations of variables based on shared variance:

7 Canonical Correlation l Make a note that the arrows are coming from the measured variables to the canonical variates.

8 BackgroundBackground l Canonical Correlation is one of the most general multivariate forms – multiple regression, discriminate function analysis and MANOVA are all special cases of CanCorr l It is essentially a correlational method. l In multiple regression the linear combinations of Xs we use to predict y is really a single canonical variate. l Canonical Correlation is one of the most general multivariate forms – multiple regression, discriminate function analysis and MANOVA are all special cases of CanCorr l It is essentially a correlational method. l In multiple regression the linear combinations of Xs we use to predict y is really a single canonical variate.

9 QuestionsQuestions l How strongly does a set of variables relate to another set of variables? That is how strong is the canonical correlation? l How strongly does a variables relate to its own canonical correlate? l How strongly does a variable relate to the other set’s canonical variate? l How strongly does a set of variables relate to another set of variables? That is how strong is the canonical correlation? l How strongly does a variables relate to its own canonical correlate? l How strongly does a variable relate to the other set’s canonical variate?

10 Canonical Correlation In Canonical correlation, you have two sets of two or more interval-level (scale) variables each and you want to see how differences in one set relate to differences in the other set of variables. With canonical correlation, unlike regression, there is no distinction between independent and dependent variables; they are called by SPSS “Set 1” and “Set 2”. One would use canonical correlation when the variables in each set can be grouped together conceptually, but you want to see if there are particular subsets of them that relate to subsets in the other variable set, so you do not want to sum each set to make an overall score.

11 Canonical Correlation: Conditions and Assumptions ASSUMPTIONS: linearity of relationship (between each variable pair as well as between the variables and the linear composites). Multivariate normality (evaluate univariate normality - because multivariate normality is difficult to assess) Homoscedasticity (evaluate using matrix scatterplot of the canonical variate scores) Multicollinearity (evaluate using matrix scatterplot of the canonical variate scores) CONDITIONS: All variables in canonical correlation must be scale. It is recommended to have at least 10 subjects per variable in order to have adequate power.

12 AssumptionsAssumptions l Multicollinearity Check Set 1 and Set 2 separately n Run correlations and use the collinearity diagnostics function in regular multiple regression l Outliers – Check for both univariate and multivariate outliers on both set 1 and set 2 separately l Multicollinearity Check Set 1 and Set 2 separately n Run correlations and use the collinearity diagnostics function in regular multiple regression l Outliers – Check for both univariate and multivariate outliers on both set 1 and set 2 separately

13 AssumptionsAssumptions l Normality n Univariate – univariate normality is not explicitly required for MMR n Multivariate – multivariate normality is required and there is no way to test for except establishing univariate normality on all variables, even though this is still no guarantee. l Normality n Univariate – univariate normality is not explicitly required for MMR n Multivariate – multivariate normality is required and there is no way to test for except establishing univariate normality on all variables, even though this is still no guarantee.

14 AssumptionsAssumptions l Linearity – linear relationship assumed for all variables in each set and also between sets l Homoscedasticity (that is, when one vari- able exhibits similar amount of variance across the range of values for the other variable) – needs to be checked for all pairs of variables within and between sets. l Linearity – linear relationship assumed for all variables in each set and also between sets l Homoscedasticity (that is, when one vari- able exhibits similar amount of variance across the range of values for the other variable) – needs to be checked for all pairs of variables within and between sets.

15 Example: Canonical Correlation File: hsbdataNew.sav First, download the files “Canonical correlation.sps”, “Canonical Corr_MANOVA_Syntax1” and “Canonical Corr_Syntax2” to C: drive. Open Canonical correlation.sps and Run it, by selecting the entire contents. In the same worksheet, Open the data file hsbdataNew.sav Open File – New – Syntax Copy the contents of Canonical Corr_MANOVA_Syntax1. Highlight/Select the contents and Run. We get the output for checking the assumptions of Canonical Correlation. Open File – New – Syntax Copy the contents of Canonical Corr_Syntax2. Highlight/Select the contents and Run. We get the output for Canonical Correlation analysis.

16 Example: Canonical Correlation File: hsbdataNew.sav

17 Canonical Correlation (Cont.) Canonical Correlation (Cont.)

18

19

20

21


Download ppt "Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of."

Similar presentations


Ads by Google