# Contingency tables enable us to compare one characteristic of the sample, e.g. degree of religious fundamentalism, for groups or subsets of cases defined.

## Presentation on theme: "Contingency tables enable us to compare one characteristic of the sample, e.g. degree of religious fundamentalism, for groups or subsets of cases defined."— Presentation transcript:

Contingency tables enable us to compare one characteristic of the sample, e.g. degree of religious fundamentalism, for groups or subsets of cases defined by another categorical variable, e.g. gender. A contingency table, which SPSS calls a cross-tabulated table is shown below: 4/1/2017

Each cell in the table represents a combination of the characteristics associated with the two variables: 29 males were also fundamentalists. 42 females were fundamentalists. While a larger number of females were fundamentalist, we cannot tell if females were more likely to be fundamentalist because the total number of females (146) was different from the total number of males (107). To answer the “more likely” question, we need to compare percentages. 4/1/2017

percentage of the total number of cases
There are three percentages that can be calculated for a contingency table: percentage of the total number of cases percentage of the total in each row percentage of the total in each column Each of the three percentages provide different information and answer a different question. 4/1/2017

The percentage of the total number of cases is computed by dividing the number in each cell (e.g. 29, 42, etc.) by the total number of cases (253). 11.5% of the cases were both male and fundamentalist. 16.6% of the cases were both female and fundamentalist. We have two clues that the table contains total percentages. First, the rows that the percentages are on are labeled “% of Total.” Second, the 100% figure appears ONLY in the grand total cell beneath the table total of 253. 4/1/2017

The percentage of the total for each row is computed by dividing the number in each cell (e.g. 29, 42) by the total for the row (71). 40.8% of the fundamentalists were male. 59.2% of the fundamentalists were female. The label for the percentage tells us that it is computed within the category for fundamentalist. The percentages in each row sums to 100% in the total column for rows (the row margin). 4/1/2017

The percentage of the total for each column is computed by dividing the number in each cell (e.g. 29, 36, and 42) by the total for the column (107). 27.1% of the males were fundamentalists. 33.6% of the males were moderates. The label for the percentage tells us that it is computed within the category for sex. The percentage in each column sums to 100% in the total row for columns (the column margin). 4/1/2017

The three percentages tell us:
the percent that is in both categories (total percentage) the percent of each row that is found in each of the column categories (row percentages) the percent of each column that is found in each of the row categories (column percentages) The row and column percentages are referred to as conditional or contingent percentages. 4/1/2017

The three percentages tell us:
the percent that is in both categories (total percentage) the percent of each row that is found in each of the column categories (row percentages) the percent of each column that is found in each of the row categories (column percentages) The row and column percentages are referred to as conditional or contingent percentages. 4/1/2017

Our real interest is in conditional or contingent percentages because these tell us about the relationship between the variables. The relationship between variables is defined by a distinct role for each: the variable which affected or impacted by the other is the dependent variable the variable which affects or impacts the other is the independent variable We assign the role to the variable. An independent variable in one analysis may be a dependent variable in another analysis. 4/1/2017

A categorical variable has a relationship to another categorical variable if the probability of being in one category of the dependent variable differs depending on the category of the independent variable. For example, if there is a relationship between social class and college attendance, the percentage of upper class persons who attend college will be different from the percentage of middle class persons who attend college. Attending college is the dependent variable and social class is the independent variable. 4/1/2017

Given that we can represent this statistically with either the row or column percentages in a contingency table, my practice is to always put the independent variable in the columns and the dependent variable in the rows, and compute column percentages. This order matches the order for many graphics where the dependent variable is on the vertical axis and the independent variable is on the horizontal axis. 4/1/2017

Based on the column percentages, we can make statements like the following:
Males were most likely to be liberal (39.3%), while females were most likely to be moderate (45.5%). 4/1/2017

Based on the column percentages, we can make statements like the following:
Males were more likely to be liberal (39.3%) compared to females (26.7%). This is not equivalent to the statement that liberals are more likely to be male or female. 4/1/2017

We can also describe a relationship based on a comparison of odds
We can also describe a relationship based on a comparison of odds. First, we compute the odds separately for each category of the independent variable: The odds that a male would be a liberal rather than a fundamentalist are 42 ÷ 29 = 1.45. The odds that a female would be liberal rather than fundamentalist are 39 ÷ 42 = .93. 4/1/2017

We compare the odds by computing the ratio between the two: 1
We compare the odds by computing the ratio between the two: for males ÷ .93 for females = an odds ratio of 1.56. We can now state the relationship between the two variables as: males are 1.56 times more likely to be liberal rather than fundamentalist, than are females. This could also be stated as: being male increases the odds of being liberal rather than fundamentalist by a factor of 1.56 or 56%. (1.56 – 1.0 = .56) and multiplying .56 by 100 to convert it to a percent. 4/1/2017

If the odds ratio were 1.0, then both groups would be equally likely to be liberals rather than fundamentalists. We could have divided the odds for females by the odds for males (.93 ÷ 1.45 = .64) and stated that being female decreased the odds of being liberal versus fundamentalist by a factor of .64, or 36%. (.64 – 1.00 = .36) and multiplying .36 by 100 to convert it to a percent. Explaining decreases in odds is more awkward and less intuitive. 4/1/2017

The introductory statement in the question indicates:
The data set to use (GSS200R) The statistic to use (contingency table) The variable to use in the rows of the table (attitude toward life ) The variable to use in the columns of the table (sex) 4/1/2017

The first statement for us to evaluate concerns the number of valid and missing cases. To answer this question, we produce the contingency table in SPSS. 4/1/2017

To compute a contingency table in SPSS, select the Descriptive Statistics > Crosstabs command from the Analyze menu. 4/1/2017

First, move the row variable life to the Row(s) list box.
Second, move the column variable sex to the Column(s) list box. Third, click on Cells button to specify what should be printed in each cell of the table. 4/1/2017

Second, click on the Continue button to close the dialog box.
First, mark the check boxes for Column and Total percentages. 4/1/2017

After returning to the Crosstabs dialog box, click on the OK button to produce the output.
4/1/2017

The SPSS output provides us with the answer to the question on sample size.
The 'Case Processing Summary' in the SPSS output showed the total number of valid cases to be 186 and the number of missing cases to be 84. 4/1/2017

The 'Case Processing Summary' in the SPSS output showed the total number of valid cases to be 186 and the number of missing cases to be 84. Click on the check box to mark the statement as correct. 4/1/2017

The next statement asks which combination of characteristics was most common. The key word and tell us that the problem is looking for total percentages, i.e. the percentage that has both characteristics. 4/1/2017

The largest total percentage (30
The largest total percentage (30.1%) in the contingency table was in the cell for the column labeled FEMALE and the row labeled ROUTINE. 4/1/2017

The statement that "more survey respondents were female and said that they generally find life pretty routine than any other combination of categories" is correct. The check box for the first statement is marked. Since this precludes the second statement from being marked, its checkbox is left unmarked. 4/1/2017

The next pair of statements asks us to compare the groups, and identify which group was more likely (had a larger proportion) with the specified characteristic. 4/1/2017

The column percent for survey respondents who were female was 54
The column percent for survey respondents who were female was 54.4%, which was larger than the column percent of 44.6 for survey respondents who were male. 4/1/2017

The statement that "compared to survey respondents who were male, those who were female were more likely to have said that they generally find life pretty routine" is correct. The check box for the first statement is marked. Since this precludes the second statement from being marked, its checkbox is left unmarked. 4/1/2017

The next pair of questions gives two options for the most likely response for each group. The first option identifies alternatives for the most likely response. The second option states that both groups have the same most likely responses. The question of which response was most likely for each group requires that we identify the mode for each group. 4/1/2017

The category EXCITING had the largest percentage of cases (49
The category EXCITING had the largest percentage of cases (49.4%), making it the modal category for survey respondents who were male. 4/1/2017

The category ROUTINE had the largest percentage of cases (54
The category ROUTINE had the largest percentage of cases (54.4%), making it the modal category for females. 4/1/2017

The statement that "survey respondents who were male were most likely to have said that they generally find life exciting, while survey respondents who were female were most likely to have said that they generally find life pretty routine" is correct The check box for the first statement is marked. Since this precludes the second statement from being marked, its checkbox is left unmarked. 4/1/2017

The final pair of questions requires us to compute the odds of describing life as routine rather than exciting for each group, and then compute the odds ratio to determine which group was more likely to have said life was routine rather than exciting. 4/1/2017

The odds for survey respondents who were female was computed by dividing the ‘Count’ for ROUTINE by the ‘Count' for EXCITING (56÷44=1.27). 4/1/2017

The odds for survey respondents who were male were computed by dividing the ‘Count' for ROUTINE by the ‘Count' for EXCITING (37÷41=0.90). 4/1/2017

I use Excel to do the calculations that I cannot do easily in SPSS.
First, I calculate the odds for each group (sex). Second, I calculate the odds ratio, once for the ratio of group 1 (females) to group 2 (males), and a second time for the ratio of group 2 (males) to group 1 (females). The odds ratio for females to males is 1.4, which corresponds to a greater likelihood for females. 4/1/2017

The statement that "compared to survey respondents who were male, those who were female were about 1.4 times more likely to have said that they generally find life pretty routine" is correct. The odds for survey respondents who were female was computed by dividing the 'Count' for the ROUTINE row by the 'Count' for the EXCITING row in the FEMALE column (56÷44=1.27). The odds for survey respondents who were male was computed by dividing the 'Count' for the ROUTINE row by the 'Count' for the EXCITING row in the MALE column (37÷41=0.90). The odds ratio for survey respondents who were female to survey respondents who were male is 1.27 to 0.90, or 1.41 to 1. The second statement in the pair is marked. 4/1/2017

Homework problems will describe the likelihood as: Examples:
The homework problems translate some of the decimal fractions for odds and odds ratios from numbers to text. The following table shows the translations used. If the odds are: Homework problems will describe the likelihood as: Examples: 0.95 through 1.05 about equally likely 0.95,0.96,0.97,0.98,0.99,1.00,1.01,1.02,1.03,1.04,1.05 1.95 through 2.05 about twice as likely 1.95,1.96,1.97,1.98,1.99,2.00,2.01,2.02,2.03,2.04,2.05 2.95 through 3.05 about three times as likely 2.95,2.96,2.97,2.98,2.99,3.00,3.01,3.02,3.03,3.04,3.05 3.95 through 4.05 about four times as likely 3.95,3.96,3.97,3.98,3.99,4.00,4.01,4.02,4.03,4.04,4.05 4.95 through 5.05 about five times as likely 4.95,4.96,4.97,4.98,4.99,5.00,5.01,5.02,5.03,5.04,5.05 5.95 through 6.05 about six times as likely 5.95,5.96,5.97,5.98,5.99,6.00,6.01,6.02,6.03,6.04,6.05 6.95 through 7.05 about seven times as likely 6.95,6.96,6.97,6.98,6.99,7.00,7.01,7.02,7.03,7.04,7.05 7.95 through 8.05 about eight times as likely 7.95,7.96,7.97,7.98,7.99,8.00,8.01,8.02,8.03,8.04,8.05 8.95 through 9.05 about nine times as likely 8.95,8.96,8.97,8.98,8.99,9.00,9.01,9.02,9.03,9.04,9.05 9.05 through 10.05 about ten times as likely 9.95,9.96,9.97,9.98,9.99,10.00,10.01,10.02,10.03,10.04,10.05 and so on… 4/1/2017

If the decimal fraction for the odds is:
The homework problems translate some of the decimal fractions for odds and odds ratios from numbers to text. The following table shows the translations used. If the decimal fraction for the odds is: Homework problems will describe the likelihood as: Examples: 0.20 through 0.30 and a quarter times more likely 3.21 three and a quarter times more likely Greater than 0.30 and less than 0.37 and a third times more likely 3.36 three and a third times more likely 0.45 through 0.55 and a half times more likely 3.49 three and a half times more likely Greater than 0.63 and less than 0.70 and two thirds times more likely 3.69 three and two thirds times more likely 0.70 through 0.80 and three quarter times more likely 3.70 three and three quarters times more likely otherwise reported as a number rounded to one decimal place 3.42 3.4 times more likely 4/1/2017

After BlackBoard grades the assignment, it will give you an option to review the results.
For this problem, we received the full 10 points because we marked all of the correct answers and did not mark any of the incorrect answers. Note: this version of BlackBoard does not give partial credit. 4/1/2017