Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.

Slides:



Advertisements
Similar presentations
Chapter 18: The Chi-Square Statistic
Advertisements

CHI-SQUARE(X2) DISTRIBUTION
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Basic Statistics The Chi Square Test of Independence.
Hypothesis Testing IV Chi Square.
Statistical Inference for Frequency Data Chapter 16.
Chapter 13: The Chi-Square Test
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Chapter18 Determining and Interpreting Associations Among Variables.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
BCOR 1020 Business Statistics
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Statistics for the Behavioral Sciences (5 th ed.) Gravetter & Wallnau Chapter 17 The Chi-Square Statistic: Tests for Goodness of Fit and Independence University.
Chapter 16 Data Analysis: Frequency Distribution, Hypothesis Testing, and Cross-Tabulation.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chapter SixteenChapter Sixteen. Figure 16.1 Relationship of Frequency Distribution, Hypothesis Testing and Cross-Tabulation to the Previous Chapters and.
© 2009 Pearson Education, Inc publishing as Prentice Hall 16-1 Chapter 16 Data Analysis: Frequency Distribution, Hypothesis Testing, and Cross-Tabulation.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Frequency Distribution, Cross-Tabulation and Hypothesis Testing 15.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 16 The Chi-Square Statistic
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
MARKETING RESEARCH CHAPTER
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chi Square Classifying yourself as studious or not. YesNoTotal Are they significantly different? YesNoTotal Read ahead Yes.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Lecture 2 Frequency Distribution, Cross-Tabulation, and Hypothesis Testing.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Leftover Slides from Week Five. Steps in Hypothesis Testing Specify the research hypothesis and corresponding null hypothesis Compute the value of a test.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter 12 Chi-Square Tests and Nonparametric Tests.
Ch 13: Chi-square tests Part 2: Nov 29, Chi-sq Test for Independence Deals with 2 nominal variables Create ‘contingency tables’ –Crosses the 2 variables.
Statistics 300: Elementary Statistics Section 11-3.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 9 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Introduction to Marketing Research
Basic Statistics The Chi Square Test of Independence.
Test of independence: Contingency Table
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter Fifteen McGraw-Hill/Irwin
Making Use of Associations Tests
Community &family medicine
Qualitative data – tests of association
The Chi-Square Distribution and Test for Independence
Chapter 10 Analyzing the Association Between Categorical Variables
Making Use of Associations Tests
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and
Chapter 18: The Chi-Square Statistic
Chapter Outline Goodness of Fit test Test of Independence.
Presentation transcript:

Cross Tabulation and Chi-Square Testing

Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes two or more variables simultaneously. Cross-tabulation results in tables that reflect the joint distribution of two or more variables with a limited number of categories or distinct values, e.g., Table Copyright © 2012 Pearson Education, Inc. Chapter

Chapter Copyright © 2012 Pearson Education, Inc. Usage GENDER FemaleMaleRaw Total Light Users14519 Medium Users5510 Heavy Users51116 Column Total2421

Two Variables Cross-Tabulation Since two variables have been cross classified, percentages could be computed either column wise, based on column totals (Table 16.4), or row wise, based on row totals (Table 16.5). The general rule is to compute the percentages in the direction of the independent variable, across the dependent variable. The correct way of calculating percentages is as shown in Table Copyright © 2012 Pearson Education, Inc. Chapter

Usage GENDER FemaleMale Light Users58.4%23.8% Medium Users20.8%23.8% Heavy Users20.8%52.4% Column Total100.0% Chapter Copyright © 2012 Pearson Education, Inc.

Usage GENDER FemaleMaleRaw Total Light Users73.7%26.3%100.0% Medium Users50.0% 100.0% Heavy Users31.2%68.8%100.0% Chapter Copyright © 2012 Pearson Education, Inc.

Statistics Associated with Cross- Tabulation Chi-Square To determine whether a systematic association exists, the probability of obtaining a value of chi-square as large or larger than the one calculated from the cross-tabulation is estimated. An important characteristic of the chi-square statistic is the number of degrees of freedom (df) associated with it. That is, df = (r - 1) x (c -1). The null hypothesis (H 0 ) of no association between the two variables will be rejected only when the calculated value of the test statistic is greater than the critical value of the chi- square distribution with the appropriate degrees of freedom, as shown in Figure Copyright © 2012 Pearson Education, Inc. Chapter

 2 Level of Significance, α Chapter Copyright © 2012 Pearson Education, Inc.

The chi-square statistic (  2 ) is used to test the statistical significance of the observed association in a cross- tabulation. The expected frequency for each cell can be calculated by using a simple formula: where = total number in the row = total number in the column = total sample size Chapter Copyright © 2012 Pearson Education, Inc.

For the data in Table 16.3, for the six cells from left to right and top to bottom = (24 x 19)/45 = 10.1 = (21 x 19)/45 =8.9 = (24 x 10)/45 = 5.3 = (21 x 10)/45 =4.7 = (24 x 16)/45 = 8.5 = (21 x 16)/45 =7.5 Chapter Copyright © 2012 Pearson Education, Inc.

= ( ) 2 + (5 – 8.9) (5 – 5.3) 2 + (5 – 4.7) (5 – 8.5) 2 + (11 – 7.5) = = 6.33 Chapter Copyright © 2012 Pearson Education, Inc.

Statistics Associated with Cross-Tabulation Chi-Square (Cont.)  The chi-square distribution is a skewed distribution whose shape depends solely on the number of degrees of freedom. As the number of degrees of freedom increases, the chi-square distribution becomes more symmetrical.  Table 3 in the Statistical Appendix contains upper-tail areas of the chi-square distribution for different degrees of freedom. For 2 degrees of freedom, the probability of exceeding a chi-square value of is  For the cross-tabulation given in Table 16.3, there are (3-1) x (2-1) = 2 degrees of freedom. The calculated chi-square statistic had a value of Since this is greater than the critical value of 5.991, the null hypothesis of no association is rejected indicating that the association is statistically significant at the 0.05 level. Copyright © 2012 Pearson Education, Inc. Chapter

Statistics Associated with Cross-Tabulation Phi Coefficient  The phi coefficient (  ) is used as a measure of the strength of association in the special case of a table with two rows and two columns (a 2 x 2 table).  The phi coefficient is proportional to the square root of the chi-square statistic:  It takes the value of 0 when there is no association, which would be indicated by a chi-square value of 0 as well. When the variables are perfectly associated, phi assumes the value of 1 and all the observations fall just on the main or minor diagonal. Copyright © 2012 Pearson Education, Inc. Chapter

Statistics Associated with Cross-Tabulation Contingency Coefficient While the phi coefficient is specific to a 2 x 2 table, the contingency coefficient (C) can be used to assess the strength of association in a table of any size. The contingency coefficient varies between 0 and 1. The maximum value of the contingency coefficient depends on the size of the table (number of rows and number of columns). For this reason, it should be used only to compare tables of the same size. Copyright © 2012 Pearson Education, Inc. Chapter

Statistics Associated with Cross- Tabulation Cramer’s V  Cramer's V is a modified version of the phi correlation coefficient, , and is used in tables larger than 2 x 2. or. Chapter Copyright © 2012 Pearson Education, Inc.

Cross-Tabulation in Practice While conducting cross-tabulation analysis in practice, it is useful to proceed along the following steps:  Test the null hypothesis that there is no association between the variables using the chi-square statistic. If you fail to reject the null hypothesis, then there is no relationship.  If H 0 is rejected, then determine the strength of the association using an appropriate statistic (phi-coefficient, contingency coefficient, or Cramer's V), as discussed earlier.  If H 0 is rejected, interpret the pattern of the relationship by computing the percentages in the direction of the independent variable, across the dependent variable. Draw marketing conclusions. Copyright © 2012 Pearson Education, Inc. Chapter

Construct the Cross-Tabulation Data Calculate the Chi-Square Statistic, Test the Null Hypothesis of No Association Reject H 0 ? Interpret the Pattern of Relationship by Calculating Percentages in the Direction of the Independent Variable No Association Determine the Strength of Association Using an Appropriate Statistic NOYES Chapter Copyright © 2012 Pearson Education, Inc.