Download presentation
Presentation is loading. Please wait.
Published byIlene Miles Modified over 9 years ago
1
Multivariate Descriptive Research In the previous lecture, we discussed ways to quantify the relationship between two variables when those variables are continuous. What do we do when one or more of the variables is categorical?
2
Categorical Variables Fortunately, this situation is much easier to deal with because we can use the same techniques that we’ve discussed already. Let’s consider a situation in which we are interested in how one continuous variable varies as a function of a categorical variable. Example: How does mood vary as a function of sex (male vs. female)?
3
In this case, we want to know how the average woman’s score compares to that of the average man’s score. level of a categorical variable
4
ParticipantsMood score Males A4 B3 C4 D3 M = 3.5, SD =.5 Females A5 B4 C5 D4 M = 4.5, SD =.5 First, find the average score for each level of the categorical variable separately. (Also find the SD.) Second, find the difference between the means of each group. This is called a mean difference. (4.5 – 3.5 = 1.0) Third, express this mean difference relative to the SD. This is called a standardized mean difference. 1/.5 = 2 In this example, women score 2 SD higher than the men.
5
ParticipantsMood score Males A4 B3 C4 D3 M = 3.5, SD =.5 Females A5 B4 C5 D3 M = 4.25, SD =.83 Note: If the SD’s for the two groups are different, you can simply average the two SD’s. Here, the two SD’s are.5 and.83. Averaged, these are (.5 +.83)/2 =.66. The standardized mean difference is (4.25 – 3.5)/.66 =.75/.66 = 1.13 Thus, on average, women score 1.13 SD’s higher than men on this mood variable.
6
Cohen’s d If we divide the mean difference by the average SD of the two groups, we obtain a standardized mean difference or Cohen’s d. Pooled standard deviation
7
Bargraph
8
Bargraph: More than two categorical variables
9
Both variables are categorical When two variables are categorical, it is sometimes most useful to express the data as percentages. Example: Let’s assume that depression is a categorical variable, such that some people are depressed and others are not. What is the relationship between biological sex and depression?
10
Depression status SexNot DepressedDepressedrow total Male60060660 Female40300340 column total6403601000
11
Depression status SexNot DepressedDepressedrow total Male.60.06.66 Female.04.30.34 column total.64.361.00 In this table, we’ve expressed each cell as a proportion of the total.
12
Depression status SexNot DepressedDepressedrow total Male.60.06.66 Female.04.30.34 column total.64.361.00.60/.64 =.94.06/.36 =.16 Here, we’ve expressed the association with respect to sex. For example, we can see here that 16% of people who are depressed are male. Moreover, 94% of people who are not depressed are male.
13
Depression status SexNot Depressed Depressedrow total Male.60.06.66.06/.66 =.09 Female.04.30.34.30/.34 =.88 column total.64.361.00 Here, we’ve expressed the association with respect to depression status. For example, we can see here that 9% of men are depressed and 88% of women are depressed.
14
Phi It is possible to quantify the association among these variables using a correlation coefficient when the two variables are binary. This statistic is sometimes referred to as phi. (Phi is +.78 in this example)
15
Variable 1 Variable 201row total 0abn3 1cdn4 Col totaln1n2 Phi = (a*d) – (b*c) / sqrt(n1*n2*n3*n4) Online calculator at: http://www.quantitativeskills.com/sisa/statistics/twoby2.htm
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.