PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.

Slides:



Advertisements
Similar presentations
Chapter 18: The Chi-Square Statistic
Advertisements

COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Hypothesis Testing IV Chi Square.
Statistical Inference for Frequency Data Chapter 16.
INTRODUCTION TO NON-PARAMETRIC ANALYSES CHI SQUARE ANALYSIS.
Chapter 13: The Chi-Square Test
S519: Evaluation of Information Systems
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Chapter 12 Chi-Square Tests and Nonparametric Tests
Lecture 8 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Chi-square Test of Independence
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Business 205. Review Correlation MS5 Preview Chi-Square.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Chi-Square.
CHI-SQUARE statistic and tests
Lecture 7 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
AM Recitation 2/10/11.
Statistics for the Behavioral Sciences (5 th ed.) Gravetter & Wallnau Chapter 17 The Chi-Square Statistic: Tests for Goodness of Fit and Independence University.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
230 Jeopardy Unit 4 Chi-Square Repeated- Measures ANOVA Factorial Design Factorial ANOVA Correlation $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500.
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 16 The Chi-Square Statistic
Statistics for the Behavioral Sciences, Sixth Edition by Frederick J. Gravetter and Larry B. Wallnau Copyright © 2004 by Wadsworth Publishing, a division.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
© aSup-2007 CHI SQUARE   1 The CHI SQUARE Statistic Tests for Goodness of Fit and Independence.
Other Chi-Square Tests
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Week 6 Dr. Jenne Meyer.  Article review  Rules of variance  Keep unaccounted variance small (you want to be able to explain why the variance occurs)
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Chapter Fifteen Chi-Square and Other Nonparametric Procedures.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Cross Tabs and Chi-Squared Testing for a Relationship Between Nominal/Ordinal Variables.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Chapter 17: Chi Square Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Goodness-of-Fit and Contingency Tables Chapter 11.
CHAPT 18 Resampling and Nonparametric Approaches to Data &
Chi-Square (Association between categorical variables)
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Hypothesis Testing Review
Community &family medicine
Qualitative data – tests of association
Different Scales, Different Measures of Association
Parametric versus Nonparametric (Chi-square)
Chapter 18: The Chi-Square Statistic
Presentation transcript:

PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology

Agenda Correlation & Regression (Continue) see Lecture 11 outline Interpreting Correlations The Point-Biserial Correlation Introduction to Regression Chi-Square (  2 ) Parametric & Nonparametric Tests Chi-Square test for Goodness of Fit The Chi-Square Distribution & df The Chi-Square Test for Independence Measuring Effect Size for  2 Test of Independence Assumptions & Restrictions for Chi-Square Tests

Parametric & Nonparametric Tests Parametric Tests: Test hypotheses about population parameters (e.g. ,  1 –  2,  ) Make assumptions about the shape of the population distribution & population parameters (e.g. ANOVA requires normal population distributions & homogeneity of variance) Require data measured on interval or ratio scales. Nonparametric Tests: Distribution-free tests, make few (if any) assumptions about the population distribution) Nominal or ordinal scale data (mean and variance cannot be calculated) The data for many nonparametric tests are frequencies. NOTE: nonparametric tests are less sensitive (i.e. less powerful) then parametric tests. Always use parametric when you can.

Chi-Square (  2 ) Test for Goodness of Fit Chi-Square (  2 ) Test for Goodness of Fit: Is a hypothesis-testing procedure that uses sample proportions to test hypotheses about the shape or proportions of a population distribution. Examples proportion of male and female psychologists Of two brands of O.J., which is preferred by most Americans. To what extent are ethnic minority groups represented in post-graduate education. Evaluates how well the obtained sample proportions fit the population proportions specified by the null hypothesis.

The Null Hypothesis for Goodness-of-Fit Test The Null Hypothesis Specifies the proportion (or percentage) of the population in each measurement category. e.g. 90% of lawyers are men, 10% are women. H 0 : MenWomen General Types of Null Hypotheses: No Preference among the categories e.g. no preference among 3 brands of soda H 0 : Brand X Brand Y Brand Z No Difference from a Known Population e.g. 60% American oppose the president’s foreign policy, 40% favor A researcher wonder if the same pattern exists among Europeans. H 0 : FavorOppose 90%10% 1/3 40%60%

The Data For the Goodness-of-Fit Test Data for the Goodness-of-Fit Chi-Square: Just count how many individuals in your sample in each category. This count is the observed frequency, f o e.g.observed frequencies for a sample of n = 40 classified into three personality types: Category A Category BCategory C n = 40  f o = n 15196

The Expected Frequencies The expected frequency, f e, for each category: is the frequency value predicted from the null hypothesis and the sample size (n). expected frequency = f e = pn e.g. H 0 : Category A Category B Category C According to this null hypothesis, how would a random sample of n = 40 distribute across the three categories? 25% of 40 =.25(40) = 10 in Category A 50% of 40 =.50(40) = 20 in Category B 25% of 40 =.25(40) = 10 in Category C 25%50%25%

The Chi-Square Statistic The  2 test for goodness of fit: measures how well the data (the observed frequencies) fit the distribution defined by the null hypothesis (the expected frequencies). Chi-square =  2 =  (f o – f e ) 2 f e the numerator of the  2 simply measure how much difference between the data and the null hypothesis for each category this difference is divided by the expected frequency to determine the relative size the obtained discrepancy between f o and f e

The Chi-Square Distribution & df The Typical Chi-Square Distribution: all  2 values are > 0 When H 0 is true, the data (f o ) should be close to the hypothesis (f e ). Chi-square should be small (close to 0). The Chi-Square Distribution is positively skewed. Figure 16.2 Degree of Freedom There is a whole family of chi-square distributions, the exact shape determined by the df df = C – 1, for the goodness-of-fit test C = number of categories Locating the Critical Region Table B.7

The Chi-Square Test for Goodness of Fit, Example Experiment: a psychologist presents an abstract painting with no obvious top or bottom to a sample of n = 50 participants, each participant was asked to hang the painting in whatever orientation looked the best. The Research Question: Are there any preferences among the four possible orientations. The Data Top up Bottom up Left side up Right side up

The Chi-Square Test for Goodness of Fit, Example Step 1: State the hypotheses: H 0 : In the general population, there is no preference for any specific orientation Top up Bottom up Left side upRight side up H 1 : In the general population, one or more of the orientations is preferred over the others. Step 2: Local the critical region: df = C – 1 = 4 – 1 = 3 For df = 3 and  =.05, the critical values for  2 = %

The Chi-Square Test for Goodness of Fit, Example Step 3: Calculate the statistic: compute the expected frequencies (f e ) f e = pn = ¼(50) = 12.5 Observed frequencies Top up Bottom up Left side up Right side up Expected frequencies Top up Bottom up Left side up Right side up  2 =  (f o – f e ) 2 = ( ) 2 + (17 – 12.5) 2 + ( ) 2 + (8 – 12.5) 2 f e = 8.08 Step 4: Make a decision: Since 8.08 is in the critical region, reject H 0 and conclude that the 4 orientations are not equally likely to be preferred

Chi-Square (  2 ) Test for Independence Chi-Square (  2 ) Test for Independence: Test whether or not there is a relationship between two categorical variables. Example: is there a relationship between personality and color preference? Each individual is measured/classified on the two variables The Data: the observed frequencies RedYellowGreenBlue Introvert50 Extrovert

Chi-Square (  2 ) Test for Independence, the Null Hypothesis The General Null Hypothesis: The two variables being measured are independent (i.e. for each individual, the value obtained for one variable is not influenced by the value for the second variable. Two Different Forms of the Null Hypothesis: H 0 version 1 (similar to correlation): The data are viewed as a single sample with each individual measured on two variables. H 0 : For the general population, there is no relationship between color preference and personality. H 0 version 2 (similar to t test/ANOVA): The data are viewed as two (or more) separate samples representing two (or more) separate populations. H 0 : in the population, there is no difference between the distribution of color preferences for introverts and the distribution of color preferences for extroverts (i.e. the distributions have the same proportions). The two versions of H 0 are equivalent

The Observed & Expected Frequencies The observed frequencies, f o : The frequencies in the sample distribution. The expected frequencies, f e : Define an ideal hypothetical distribution in perfect agreement with the null hypothesis.

Calculating the Expected Frequencies, Step 1 H 0 : the frequency distribution of color preference has the same shape (same proportions) for both categories of personality. Step 1: determine the overall distribution of color preference, irrespective of personality type. RedYellowGreenBlue Introvert50 Extrovert Proportion prefer red: 100 out of 200 = 50% Proportion prefer yellow: 20 out of 200 = 10% Proportion prefer green:40 out of 200 = 20% Proportion prefer blue:40 out of 200 = 20%

Calculating the Expected Frequencies, Step 2 Step 2: apply this distribution of color preference to both categories of personality RedYellowGreenBlue Introvert50 Extrovert For the sample of Introvert: 50% choose red:f e = 0.50(50) = 25 10% choose yellow:f e =.10(50) = 5 20% choose green:f e =.20(50) = 10 20% choose blue:f e =.20(50) = 10 For the sample of Extrovert: 50% choose red:f e = 0.50(150) = 75 10% choose yellow:f e =.10(150) = 15 20% choose green:f e =.20(150) = 30 20% choose blue:f e =.20(150) = 30

Calculating the Expected Frequencies The Complete Set of Expected Frequencies Red YellowGreen Blue Introvert 50 Extrovert This is the distribution predicted by the null hypothesis. A Simple Formula for f e : f e = f c f r n f c = frequency total for the column (column total) f r = frequency total for the row (row total) n = total number of individuals in the study

The Chi-Square Statistic The  2 test of independent Chi-square =  2 =  (f o – f e ) 2 f e (same formula as for test of goodness of fit) df = (R – 1)(C – 1) R = the number of rows C = the number of columns

Chi-Square Test for Independent, An Example A researcher is interested in the relationship between academic performance and self-esteem. Sample: n = 150 ten-year-old children The data (observed frequencies) Level of self-esteem High Medium Low Academic High 60 performance Low

The Chi-Square Test for Independence, An Example Step 1: State the hypotheses: Version 1: H 0 : In the general population, there is no relationship between academic performance and self-esteem. H 1 : There is a consistent, predictable relationship between academic performance and self-esteem. Version 2: H 0 : In the general population, the distribution of self-esteem is the same for high and low academic performers. H 1 : The distribution for self-esteem for high academic performers is different form the distribution for low academic performers. Step 2: Local the critical region: df = (R – 1)(C – 1) = (2 – 1)(3 – 1) = 2 For df = 2 and  =.05, the critical values for  2 = 5.99

The Chi-Square Test for Goodness of Fit, Example Step 3: Calculate the statistic: compute the expected frequencies (f e ) f e = f c f r n Level of self-esteem High Medium Low Academic High 60 performance Low n =

The Chi-Square Test for Goodness of Fit, Example Step 3: Calculate the statistic: Calculate the chi-square  2 =  (f o – f e ) 2 f e = ( ) 2 + (32 – 30) 2 + ( ) 2 + (13 – 18) ( ) 2 + (34 – 27) = 8.23 Step 4: Make a decision: Since 8.23 is in the critical region, reject H 0 and conclude that there is a significant relationship between academic performance and self-esteem. Or, the data show a significant difference between the distribution of self-esteem for high academic performers versus low academic performers.

Effect Size for Chi-Square Test for Independence For 2 X 2 Matrix: Phi-Coefficient =  =  is a correlation and measures the strength of the relationship between the two variables. Sometime  is squared to obtain the percentage of variance accounted for, exactly the same as r 2 For Matrix Larger than 2 X 2: Cramer’s V = n = number of individuals in the study df* is the smaller of (C – 1) and (R – 1) Table 16.9: Interpreting Cramer’s V Table 16.9

Assumptions & Restrictions for Chi-Square Tests Independence of Observations One consequences is that each individual can be classified in only one category. The  all the cell frequencies = n Size of Expected Frequencies A chi-square test should not be performed when the expected frequency of any cell is less than 5. Very small f e can distort the chi-square statistic.