Presentation is loading. Please wait.

Presentation is loading. Please wait.

Canonical Correlation/Regression. AKA multiple, multiple regression AKA multivariate multiple regression Have two sets of variables (Xs and Ys) Create.

Similar presentations


Presentation on theme: "Canonical Correlation/Regression. AKA multiple, multiple regression AKA multivariate multiple regression Have two sets of variables (Xs and Ys) Create."— Presentation transcript:

1 Canonical Correlation/Regression

2 AKA multiple, multiple regression AKA multivariate multiple regression Have two sets of variables (Xs and Ys) Create a pair of canonical variates – a 1 X 1 + a 2 X 2 +.... + a p X p, and – b 1 Y 1 + b 2 Y 2 +.... + b q Y m Such that the correlation between the canonical variates is as large as possible.

3 Extract a Second Pair From the (residual) variance not captured in the first pair, create a second pair of canonical variates. Continue like this, creating additional pairs of canonical variates, until –You reach a cutoff threshold based on the significance of the correlation –You have created the max number of pairs, which = the smaller of m or p

4 What is a Canonical Variate? A weighted linear combination of variables You can think of it as –Something (a superordinate variable) you have created from several variables, or –An estimate of an construct, a latent variable, a dimension that causes variance in the observed variables.

5 What is a Canonical Correlation? It is the simple correlation between one canonical variate of the X variables and the corresponding canonical variate of the Y variables. We can test the significance of the first canonical correlation all by itself (Roy’s maximum root). And the set of canonical correlations from the n th through the >n th.

6 Eigenvalues For each pair of canonical variates, the eigenvalue is equal to the ratio of the squared canonical correlation (explained variance in the canonical variate) to one minus the squared canonical correlation (unexplained variance in the canonical variate).

7 Eigenvalues An eigenvalue of 1 would be obtained if the squared canonical correlation was.5 – the proportion of variance explained would be equal to the proportion of variance not explained.

8 Eigenvalues An eigenvalue of 1/3 would be obtained if the squared canonical correlation was.25 – the proportion of unexplained variance would be three times the proportion of explained variance. An eigenvalue of 3 would be obtained if the squared canonical correlation was.75 – the proportion of explained variance would be three times the proportion of unexplained variance.

9 Redundancy We measure how much variance each canonical variance extracts from its own set of variables And how much variance each canonical variate explains in the variables of the other canonical variate.

10 R-Squared We also obtain, for each variable, the r 2 for predicting that variable from the first canonical variate of the other set of variables And the R 2 for predicting that variable from the first and second canonical variates of the other set of variables And so on.

11 Patel, Long, McCammon, & Wuensch (1995) Male college students Xs = Personality variables (MMPI) –PD (psychopathically deviant, Scale 4) – social maladjustment and hostility –MF (masculinity/femininity, Scale 5) – in men, low scores = stereotypical masculinity –MA (hypomania, Scale 9) – overactivity, flight of ideas, low frustration tolerance, narcissism, irritability, restlessness, hostility, and difficulty with controlling impulses –Scale K (clinical defensiveness) – low scores = unusually frank.

12 Ys: Homonegativity Variables IAH (Index of Attitudes Towards Homosexuals) –Affective component of “homophobia,” disgust. –High scores – discomfort around homosexuals SBS (self-report behavior scale) –Past negative actions towards male homosexuals –High score – high frequency of such actions.

13 What is This Thing I Have Created or Discovered? Look at the standardized weights used to construct the canonical variate. Even better, look at the loadings –Compute, for each case, a score on the canonical variate. –Correlate those scores with scores on the original variables in its set.

14 The Weights MMPI_1 Femininity-.61 Scale K-.60 Psycho. Dev..43 Hypomania.46 Homoneg_1 SBS.93 IAH.15 Being stereotypically masculine, unusually frank, psycho. deviant, and hypomanic is associated with acting negatively towards gays.

15 The Loadings MMPI_1 = Johnny Pissoff Scale K-.53 Hypomania.53 Femininity-.49 Psycho. Dev..32 Homoneg_1 = Aggressive Homophobia SBS.99 IAH.52 Being unusually frank, hypomanic, stereotypically masculine, and psycho. deviant, is associated with being uncomfortable around and acting negatively towards gays.

16 Weights or Loadings? Like the Beta weights in a multiple regression, the weights for a canonical variate can be deceptive. If two variables within a set are well correlated with each other, one or both weights may be artificially low. I generally prefer to interpret loadings.

17 A Second Pair of Canonical Variates There likely is variance in the variables that was not “captured” by the first pair of canonical variates. We can create a second pair, orthogonal to the first, from that residual variance. The number of pairs of canonical variates we can create is equal to the number of variables in the smaller set.

18 The Second Pair of Weights MMPI_2 Femininity.70 Hypomania.67 Psycho. Dev.-.09 Scale K-.04 Homoneg_2 IAH-1.08 SBS.57 Being unusually feminine and hypomanic is associated with not being uncomfortable around gays but acting negatively towards them anyhow.

19 The Equal Opportunity Bully What are we to make of “not being uncomfortable around gays but acting negatively towards them anyhow.” One student called this “the equal opportunity bully.” He acts negatively towards everybody, gay or straight.

20 The Second Pair of Loadings MMPI_2 = Feminine Hypomania Femininity.76 Hypomania.72 Psycho. Dev..21 Scale K-.08 Psycho. Dev..21 Homoneg_2 = Equal Opportunity Bully IAH-.85 SBS.14 Being unusually feminine and hypomanic is associated with not being uncomfortable around gays.

21 The Canonical Correlations Compute canonical variate scores for each case. Correlate each with its pairmate. Will always be highest for first pair, lower for each subsequent pair. Here, the canonical corrs are.38 and.32. Both were statistically significant.

22 SAS It is now time to jump to the annotated output.annotated output


Download ppt "Canonical Correlation/Regression. AKA multiple, multiple regression AKA multivariate multiple regression Have two sets of variables (Xs and Ys) Create."

Similar presentations


Ads by Google