1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.

Slides:



Advertisements
Similar presentations
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Advertisements

Contingency Table Analysis Mary Whiteside, Ph.D..
Parametric & Nonparametric Models for Tests of Association Models we will consider X 2 Tests for qualitative variables Parametric tests Pearson’s correlation.
Chapter 12 Goodness-of-Fit Tests and Contingency Analysis
Basic Statistics The Chi Square Test of Independence.
Chapter 13: The Chi-Square Test
Sociology 601 Class 13: October 13, 2009 Measures of association for tables (8.4) –Difference of proportions –Ratios of proportions –the odds ratio Measures.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Lecture 3: Chi-Sqaure, correlation and your dissertation proposal Non-parametric data: the Chi-Square test Statistical correlation and regression: parametric.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Chapter18 Determining and Interpreting Associations Among Variables.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 14 Analysis of Categorical Data
Chapter 12 Chi-Square Tests and Nonparametric Tests
CJ 526 Statistical Analysis in Criminal Justice
Chapter Goals After completing this chapter, you should be able to:
PPA 415 – Research Methods in Public Administration Lecture 9 – Bivariate Association.
Chi-square Test of Independence
Statistics for Managers Using Microsoft® Excel 5th Edition
Previous Lecture: Analysis of Variance
Chap 12-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 12-1 Chapter 12 Chi-Square Tests and Nonparametric Tests Basic Business.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Chapter 15 Nonparametric Statistics
Week 11 Chapter 12 – Association between variables measured at the nominal level.
Presentation 12 Chi-Square test.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Association between Variables Measured at the Nominal Level.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 12-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Correlation.
CJ 526 Statistical Analysis in Criminal Justice
In the Lab: Working With Crosstab Tables Lab: Association and the Chi-square Test Chapters 7, 8 and 9 1.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
1 Lecture 7 Two-Way Tables Slides available from Statistics & SPSS page of Social Science Statistics Module I Gwilym Pryce.
Non-parametric Measures of Association. Chi-Square Review Did the | organization| split | Type of leadership for organization this year? | Factional Weak.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
1 Lecture 7: Two Way Tables Graduate School Quantitative Research Methods Gwilym Pryce
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Chap 11-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 11 Chi-Square Tests Business Statistics: A First Course 6 th Edition.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 12-1 Chapter 12 Chi-Square Tests and Nonparametric Tests Statistics for Managers using.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Describing Association for Discrete Variables. Discrete variables can have one of two different qualities: 1. ordered categories 2. non-ordered categories.
Chi Square & Correlation
Copyright © 2014 by Nelson Education Limited Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured.
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chapter 12 Chi-Square Tests and Nonparametric Tests.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Copyright © 2012 by Nelson Education Limited. Chapter 12 Association Between Variables Measured at the Ordinal Level 12-1.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 11 Chi-Square Tests.
Chapter 10 Analyzing the Association Between Categorical Variables
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter 11 Chi-Square Tests.
Chapter 13 Goodness-of-Fit Tests and Contingency Analysis
Chapter 11 Chi-Square Tests.
Presentation transcript:

1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ SYSTEM OVERALL judgrate: RATE JOB DONE: VT'S JUDGES OVERALL proscrat: RATE JOB DONE: VT'S PROSECUTORS

2 Measures of association Survey rated the criminal justice system in Vermont. There is only 3.2 % of the 601 to rate the system to be Excellent.

3 Measures of association There are 27% of the people who rated judges as Excellent rated the System as Excellent as well.

4 Almost 69% of the people who rated prosecutors as Good rated the System as Good as well. Measures of association

5 The word related can have many different meaning. A perfect relationship is one in which all people gave the same ratings to the overall system and a particular component. It is imperfect relationships that van be quantified in many different ways. We need measures of association. Their range, in general, are in absolute value from 0 to 1.

6 Measures of association-Lambda Let be the number of misclassified in situation 1, and let the number of misclassified in situation 2. The measure of association Lambda is defined by Let us see an example in data crimjust.sav.

7 Measures of association-Lambda For Table on p.3 we consider Situation 1: If we predict Good for everyone, the misclassified number is =310. Situation 2: Consider the rule For each category of the independent variable, predict the category of the dependent variable that occurs most frequently. By the use of this rule we have Excellent: 17=10+7; Good: 104= ; Only fair: 78= ; poor: 21=4+17 The total misclassified is 220=

8 Measures of association-Lambda The value of the is from 0 to 1. The case of indicates you make prediction no errors The case of means that the independent variable is of no help in prediction. Two different lambdas: it is not a symmetric measure. Its value depends on which variable you predict from which.

9 Measures of association-Lambda Two different lambdas We have calculated the lambda for predicting judgrate. To calculate the lambda for predicting cjsrate we have ( )/284=0.19. The symmetric lambda is defined by

10 Measures of association-Lambda

11 Measures of association For Ordinal Variable In the past discussion we did not use the order information: Excellent > Good > Just poor > Poor If judges’ ratings increase as overall rating increase, you can say that the two variables have a positive relationship. Similarly we can define a negative relationship.

12 Measures of association For Ordinal Variable Concordant and discordant pairs A pair of cases is discordant if the value of one variable for a case is larger than the value for the other case but the direction is reversed for the second variable. A pair of cases is called concordant if it is not discordant. cjsratejudgrate Case 112 Case 223 Case 332

13 Measures of association For Ordinal Variable Concordant and discordant pairs Let P be the number of concordant pairs and Q be the number of discordant pairs for all distinct pairs of observations. The Goodman and Kruskai’s gamma is defined by Gamma=(P-Q)/(P+Q)

14 Measures of association For Ordinal Variable A positive gamma tells you that there are more like (concordant) pairs of cases than unlike pairs. There is a positive relationship between the two sets of ratings. As judges’ ratings increase, so do ratings of the overall system. If two variables are independent, the value of gamma is 0. However, a gamma of 0, like a lambda of 0, does not necessarily mean independence.

15 Measures of association For Ordinal Variable Kendall’s Tau-b A measure that attempts to normalized P-Q by considering ties on each variable in a pair separately in tau-b. The tau-b is defined by Where is the number of ties involving only the first variable and is the number of ties involving only the second variable. Tau-b can have the value of 1 and -1 only for tables that have the same number of rows and columns.

16 Measures of association For Ordinal Variable Kendall’s Tau-c A measure that attempts to normalized P-Q is tau-c. That is defined by where m is the samller of the number of rows and columns and N is the number of cases Unfortunately, there is no simple proportional reduction of error interpretation of tau-c either.

17 Measures of association For Ordinal Variable The following results of tau-b and tau-c between cjsrate and judgrate. There is no simple interpretation for the values. The tau-b is a commonly used measure of association.

18 Measures of association For Ordinal Variable There are more measures of association Somers’d on p.428 The Cohen’s kappa on pp How can you decide what measure of association to use? No single measure of association is best for all situations.

19 Correlation-based Measures

20

21 Correlation-based Measures Coefficient of correlation When two variables are numerical the Pearson correlation coefficient has been widely used. It measures the strength of the linear relationship between two numerical variables.

22 Correlation-based Measures Definition of Spearman rank correlation

23 An example of rank correlation

24 Correlation-based Measures Spearman correlation coefficient is a nonparametric measure of association.

25 Measure based on the chi-square statistic Since the chi-square test of independence is often used when analyzing crosstabulations, there are a variety of measure of association that are based on the chi-square statistic. The chi-square statistic is not a good measure of association. Several modifications have been proposed.

26 Contingency Table Example Left-Handed vs. Gender Dominant Hand: Left vs. Right Gender: Male vs. Female  2 categories for each variable, so called a 2 x 2 table  Suppose we examine a sample of size 300

27 Contingency Table Example Sample results organized in a contingency table: (continued) Gender Hand Preference LeftRight Female Male Females, 12 were left handed 180 Males, 24 were left handed sample size = n = 300:

28  2 Test for the Difference Between Two Proportions If H 0 is true, then the proportion of left-handed females should be the same as the proportion of left-handed males! The two proportions above should be the same as the proportion of left-handed people overall H 0 : π 1 = π 2 (Proportion of females who are left handed is equal to the proportion of males who are left handed) H 1 : π 1 ≠ π 2 (The two proportions are not the same Hand preference is not independent of gender)

29 The Chi-Square Test Statistic where: f o = observed frequency in a particular cell f e = expected frequency in a particular cell if H 0 is true  2 for the K x L case has (K-1)(L-1) degree of freedom (Assumed: each cell in the contingency table has expected frequency of at least 5) The Chi-square test statistic is:

30 Decision Rule  2U2U Decision Rule: If  2 >  2 U, reject H 0, otherwise, do not reject H 0 The  2 test statistic approximately follows a chi- squared distribution with one degree of freedom 0  Reject H 0 Do not reject H 0

31 Computing the Average Proportion Here: 120 Females, 12 were left handed 180 Males, 24 were left handed i.e., the proportion of left handers overall is 0.12, that is, 12% The average proportion is:

32 Finding Expected Frequencies To obtain the expected frequency for left handed females, multiply the average proportion left handed (p) by the total number of females To obtain the expected frequency for left handed males, multiply the average proportion left handed (p) by the total number of males If the two proportions are equal, then P(Left Handed | Female) = P(Left Handed | Male) =.12 i.e., we would expect (.12)(120) = 14.4 females to be left handed (.12)(180) = 21.6 males to be left handed

33 Observed vs. Expected Frequencies Gender Hand Preference LeftRight Female Observed = 12 Expected = 14.4 Observed = 108 Expected = Male Observed = 24 Expected = 21.6 Observed = 156 Expected =

34 Gender Hand Preference LeftRight Female Observed = 12 Expected = 14.4 Observed = 108 Expected = Male Observed = 24 Expected = 21.6 Observed = 156 Expected = The Chi-Square Test Statistic The test statistic is:

35 Decision Rule Decision Rule: If  2 > 3.841, reject H 0, otherwise, do not reject H 0 Here,  2 = <  2 U = 3.841, so we do not reject H 0 and conclude that there is not sufficient evidence that the two proportions are different at  = 0.05   2 U =  Reject H 0 Do not reject H 0

36 Measure based on the chi-square statistic Since the chi-square test of independence is often used when analyzing crosstabulations, there are a variety of measure of association that are based on the chi-square statistic. The chi-square statistic is not a good measure of association. Several modifications have been proposed.