Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)

Slides:



Advertisements
Similar presentations
Tests of Significance and Measures of Association
Advertisements

Contingency Table Analysis Mary Whiteside, Ph.D..
Chapter 18: The Chi-Square Statistic
Hypothesis Testing and Comparing Two Proportions Hypothesis Testing: Deciding whether your data shows a “real” effect, or could have happened by chance.
2013/12/10.  The Kendall’s tau correlation is another non- parametric correlation coefficient  Let x 1, …, x n be a sample for random variable x and.
Association Between Two Variables Measured at the Nominal Level
Measures of Association for contingency tables 4 Figure 8.2 : lambda – association; +-1: strong; near 0: weak Positive association: as value of the independent.
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Association Predicting One Variable from Another.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Chapter 14 Analysis of Categorical Data
Chapter Eighteen MEASURES OF ASSOCIATION
PPA 415 – Research Methods in Public Administration Lecture 9 – Bivariate Association.
Chi-square Test of Independence
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Inferential Statistics
Week 11 Chapter 12 – Association between variables measured at the nominal level.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing for Ordinal & Categorical Data EPSY 5245 Michael C. Rodriguez.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Chapter 18 Measures of Association McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All Rights Reserved.
1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.
INFO 515Lecture #91 Action Research More Crosstab Measures INFO 515 Glenn Booker.
In the Lab: Working With Crosstab Tables Lab: Association and the Chi-square Test Chapters 7, 8 and 9 1.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
1 Lecture 7 Two-Way Tables Slides available from Statistics & SPSS page of Social Science Statistics Module I Gwilym Pryce.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
1 Lecture 7: Two Way Tables Graduate School Quantitative Research Methods Gwilym Pryce
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Chapter 11, 12, 13, 14 and 16 Association at Nominal and Ordinal Level The Procedure in Steps.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Contingency Tables – Part II – Getting Past Chi-Square?
Lecture 2 Frequency Distribution, Cross-Tabulation, and Hypothesis Testing.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Describing Association for Discrete Variables. Discrete variables can have one of two different qualities: 1. ordered categories 2. non-ordered categories.
Copyright © 2014 by Nelson Education Limited Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured.
McGraw-Hill/Irwin Business Research Methods, 10eCopyright © 2008 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 18 Measures of Association.
Nonparametric Statistics
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
Copyright © 2012 by Nelson Education Limited. Chapter 12 Association Between Variables Measured at the Ordinal Level 12-1.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.McGraw-Hill/Irwin 19-1 Chapter 19 Measures of Association.
Introduction to Marketing Research
Nonparametric Statistics
Association Between Variables Measured at the Ordinal Level
Final Project Reminder
Final Project Reminder
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis mutually exclusive exhaustive.
Chapter 11 Chi-Square Tests.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Making Use of Associations Tests
Qualitative data – tests of association
Association Between Variables Measured at Nominal Level
Nonparametric Statistics
The Chi-Square Distribution and Test for Independence
THE PRINCIPLE OF PRE.
Nominal/Ordinal Level Measures of Association
Nominal/Ordinal Level Measures of Association
Chapter 10 Analyzing the Association Between Categorical Variables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Statistics II: An Overview of Statistics
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
RES 500 Academic Writing and Research Skills
Presentation transcript:

Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation) Chapter Sixteen –Basic Probability Theory Concepts –Test of Hypothesis of Independence

Basic Empirical Situation Unit of data. Two nominal scales measured for each unit. –Example: interview study, sex of respondent, variable such as whether or not subject has a cellular telephone. –Objective is to compare males and females with respect to what fraction have cellular telephones.

Contingency Table One column for each value of the column variable; C is the number of columns. One row for each value of the row variable; R is the number of rows. R x C contingency table.

Contingency Table Each entry is the OBSERVED COUNT O(i,j) of the number of units having the (i,j) contingency. Column of marginal totals. Row of marginal totals.

Basic Hypothesis ASSUME column variable is the independent variable. Hypothesis is independence. That is, the conditional distribution in any column is the same as the conditional distribution in any other column.

Expected Count Basic idea is proportional allocation of observations in a column based on column total. Expected count in (i, j ) contingency = E(i,j)= total number in column j *total number in row i/total number in table. Expected count need not be an integer; one expected count for each contingency.

Residual Residual in (i,j) contingency = observed count in (i,j) contingency - expected count in (i,j) contingency. That is, R(i,j)= O(i,j)-E(i,j) One residual for each contingency.

Pearson Chi-squared Component Chi-squared component for (i, j) contingency =C(i,j)= (Residual in (i, j) contingency) 2 /expected count in (i, j) contingency. C(i,j)=(R(i,j)) 2 / E(i,j)

Assessing Pearson Component Rough guides on whether the (i, j) contingency has an excessively large chi- squared component C(i,j): –the observed significance level of 3.84 is about –Of 6.63 is about –Of is

Pearson Chi-Squared Test Sum C(i,j) over all contingencies. Pearson chi-squared test has (R-1)(C-1) degrees of freedom. Under null hypothesis –Expected value of chi-square equals its degrees of freedom. –Variance is twice its degrees of freedom

Marijuana Use at Time 4 by Marijuana Use at Time 3

Contingency Tables Chapter Eighteen –Measures of Association –For nominal variables –For ordinal variables

Measures of Association Measures strength of an association –usually, a dimensionless number between 0 and 1 in absolute value. –Values near 0 indicate no association, near 1 mean strong association. Correlation coefficient is a measure of association Chi-square test is not –depends on the number of observations.

Measures of Association for Nominal Scale Variables Chi-square based –Phi coefficient –Coefficient of contingency –Cramers V Proportional reduction in error –Lambda, symmetric –Lambda, not symmetric

Chi-squared Measure: Phi Coefficient Definition of the Phi Coefficient

Phi Coefficient Can be greater than one. N is the total number of the table. For marijuana at time 3 and 4 data, phi coefficient is (96.595/366) 0.5 =0.51.

Coefficient of Contingency Definition of coefficient of contingency

Coefficient of Contingency Can never get as large as one. Largest value depends on number in table. For example given, c=0.46.

Cramérs V Definition of statistic; k is smaller of number of rows and columns.

Interpretation of Chi-squared measures of association An approximate observed level of significance is given for each measure. Use this in the usual way.

Proportional Reduction in Error (PRE) Measures Prediction is the modal category. Predict overall –Predict used marijuana at time 4; correct for 237 and wrong for 129. Number of misclassified is 129.

Proportional Reduction in Error (PRE) Measures Predict for each condition of the independent variable. –Predict not use at time 4 for those not using at time 3 correct 120 of 215 times misclassify 95 times –Predict use at time 4 for those using at time correct 142 of 151 times misclassify 9 times.

Proportional Reduction in Error (PRE) Measures Using only totals, number of misclassified is 129. Using marijuana at time 3, number misclassified is 104. The lambda measure is λ= ( ) /129=0.19

Lambda PRE Measures There is a lambda measure using marijuana use at time 4 as the independent variable. –Total: predict no usage at time 3: 151 errors. –Conditional no usage at Time 4: predict none at 3 with 9 errors usage at time 4: predict use at 3 with 95 errors 104 total errors. –Lambda measure is ( )/151=0.31

Lambda PRE Measures There is a symmetric lambda measure. [( )+( )]/( )=0.26

Text Example Data Set

Comparing Pairs of Cases Concordant pair of cases: sign of difference on variable 1 is the same as the sign of the difference on variable 2. –Case 1 and Case 2: concordant. –Case 2 and Case 3: discordant –Case 1 and Case 3: tied Let P be number of concordant pairs and Q be the number of discordant pairs.

Measures Based on Concordant and Discordant Pairs Goodman and Kruskals Gamma –(P-Q)/(P+Q) Kendalls Tau-b Kendalls Tau-c Somers d

Choosing a measure Choose a measure interpretable for the purpose in hand! Avoid data dredging (taking the measure that is largest for the data set that you have).

Other measures Correlation based –Pearsons correlation –Spearman correlation: replace values by ranks. Measures of agreement –Cohens kappa.

Summary Contingency table methods crucial to the analysis of market research and social science data. Hypothesis of independence Measures of association describe the strength of the dependence between two variables.