Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistics 303 Chapter 9 Two-Way Tables. Relationships Between Two Categorical Variables Relationships between two categorical variables –Depending on.

Similar presentations


Presentation on theme: "Statistics 303 Chapter 9 Two-Way Tables. Relationships Between Two Categorical Variables Relationships between two categorical variables –Depending on."— Presentation transcript:

1 Statistics 303 Chapter 9 Two-Way Tables

2 Relationships Between Two Categorical Variables Relationships between two categorical variables –Depending on the situation, one of the variables is the explanatory variable and the other is the response variable. –In this case, we look at the percentages of one variable for each level of the other variable. –Examples: Gender and Soda Preference Country of Origin and Marital Status Smoking Habits and Socioeconomic Status

3 Relationships Between Two Categorical Variables Relationships between two categorical variables –A two-way table can summarize the data for relationships between two categorical variables. Example: Gender and Highest Degree Obtained

4 SPSS OUTPUT Example: Percents

5 Review of Two-Way Tables Two-way tables come about when we are interested in the relationship between two categorical variables. –One of the variables is the row variable. –The other is the column variable. –The combination of a row variable and a column variable is a cell.

6 Review of Two-Way Tables Example: Row variable Column variable Column Totals Row Totals Overall Total Cells

7 Chi-Squared Test for Independence To test whether or not there is a relationship between the row variable and the column variable, we use the chi-square statistic (X 2 ), which can be calculated in the computer. The null hypothesis (H 0 ) is no relationship among the two variables, i.e. the variables are independent. The alternative hypothesis (H A ) is that there is a relationship, i.e. the variables are not independent. For 2x2 tables, we require that all four expected cell counts be 5 or more. For tables larger than 2x2, we will use this approximation whenever the average of the expected counts is 5 or more and the smallest expected count is 1 or more.

8 Chi-Squared Test for Independence A comparison of the proportion of “successes” in two populations leads to a 2x2 table. We can compare two population proportions either by the chi-square test or by the two-sample z test from section 8.2 These tests always give exactly the same result. The chi-square statistic is equal to the square of the z statistic and χ 2 (1) critical values are equal to the squares of the corresponding N(0,1) critical values. Advantage of the z test: We can test either one-sided or two-sided alternatives Chi-square test always tests the two-sided alternative Advantage of chi-square: We can compare more than two populations z-Test compares only two populations

9 Chi-Squared Test for Independence The chi-square statistic compares the observed cell counts with the expected cell counts The chi-square statistic is a measure of how much the observed cell counts in a two-way table diverge from the expected cell counts. If the expected counts and the observed counts are very different, a large value of X 2 will result. Large values of X 2 provide evidence against the null hypothesis.

10 Chi-Square Test Like the t distributions, the χ 2 distributions are described by a single parameter, degrees of freedom (df). The degrees of freedom for the chi-square test are df = (r – 1)*(c – 1 ) = (#rows – 1)*(#columns – 1). For a 2x2 table, we have df = (2 – 1)(2 – 1) = 1. The p-value is determined by looking in Table F. P(χ 2 ≥ X 2 )Notice Table F gives probabilities to the right. Also, note χ 2 distributions take only positive values and are skewed to the right.

11 Analysis in SPSS gives us: The p-value is 0.103. Because this is larger than 0.05 we fail to reject H 0 and conclude there is no significant relationship between gender and tomato enjoyment. We are interested in this row:

12 Link between Diabetes and Heart Disease? Background: Contradictory opinions: 1. A diabetic’s risk of dying after a first heart attack is the same as that of someone without diabetes. There is no link between diabetes and heart disease. vs. 2. Diabetes takes a heavy toll on the body and diabetes patients often suffer heart attacks and strokes or die from cardiovascular complications at a much younger age. So we use hypothesis test based on the latest data to see what’s the right conclusion. There are a total of 5167 managed-care patients, among which 1131 patients are non-diabetics and 4036 are diabetics. Among the non-diabetic patients, 42% of them had their blood pressure properly controlled (therefore it’s 475 of 1131). While among the diabetic patients only 20% of them had the blood pressure controlled (therefore it’s 807 of 4036).

13 Link between Diabetes and Heart Disease? Data ControlledUncontrolledTotal Non-diabetes4756561131 Diabetes80732294036 Total128238855167

14 Link between Diabetes and Heart Disease? Data: Diabetes: 1=Not have diabetes, 2=Have Diabetes Control: 1=Controlled, 2=Uncontrolled

15 Link between Diabetes and Heart Disease?

16 Hypothesis test: 1) H 0 : There is no link between diabetes and heart disease. (There is no relationship between diabetes and heart disease. Diabetes and heart disease are independent.) 2) H A : There is link between diabetes and heart disease. (There is a relationship between diabetes and heart disease. Diabetes and heart disease are dependent.) 3) Assume a significance level of.05

17 Link between Diabetes and Heart Disease? SPSS Output

18 Link between Diabetes and Heart Disease? 4) The computer gives us a Chi-Square Statistic of 229.268 5) The computer gives us a p-value of.000 6) Because our p-value is less than alpha, we would reject the null hypothesis. 7) There IS sufficient evidence that there is link between diabetes and heart disease.

19 Is there a relationship between exposure to R- rated movies and adolescent smoking? The study attempted to examine the relationship between exposure to R-Rated movies and smoking habits among adolescents. Smoking in R-rated movies is higher than any other movie-rating category. Therefore, the objective of this study was to determine if an association existed between parental restrictions on movies and adolescent cigarette use. SmokingNon-smokingTotal Complete Restriction14701715 Partial Restriction28821142402 No Restriction4999281427 Total80137434544

20 Is there a relationship between exposure to R- rated movies and adolescent smoking?

21 Hypothesis Test 1) H 0 : There is no relationship between exposure to R-rated movies and tobacco use among adolescents 2) H A : There is a relationship between the occurrence of tobacco use and the exposure to R- rated movies among adolescents 3) alpha = 0.05

22 SPSS Output 4) The computer gives us a chi-square test statistic of 469.003 5) The computer output gives us a p-value that is 0.000

23 Is there a relationship between exposure to R-rated movies and adolescent smoking? 6) Decision Rule: –If p-value ≤ alpha, we reject H 0 –If p-value > alpha, we fail to reject H 0 Because our p-value is less than our significance level (alpha), we would reject the null hypothesis 7) Because we rejected H 0, we can conclude that there IS significant evidence that a relationship between exposure to R-rated movies and adolescent tobacco use exists.


Download ppt "Statistics 303 Chapter 9 Two-Way Tables. Relationships Between Two Categorical Variables Relationships between two categorical variables –Depending on."

Similar presentations


Ads by Google