Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution.

Slides:



Advertisements
Similar presentations
What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
Advertisements

CHI-SQUARE(X2) DISTRIBUTION
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Contingency Tables (cross tabs)  Generally used when variables are nominal and/or ordinal Even here, should have a limited number of variable attributes.
Bivariate Analysis Cross-tabulation and chi-square.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
CJ 526 Statistical Analysis in Criminal Justice
PPA 415 – Research Methods in Public Administration Lecture 9 – Bivariate Association.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Crosstabs and Chi Squares Computer Applications in Psychology.
PPA 415 – Research Methods in Public Administration Lecture 8 – Chi-square.
PPA 501 – Analytical Methods in Administration Lecture 9 – Bivariate Association.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Chapter 14 in 1e Ch. 12 in 2/3 Can. Ed. Association Between Variables Measured at the Ordinal Level Using the Statistic Gamma and Conducting a Z-test for.
Week 11 Chapter 12 – Association between variables measured at the nominal level.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Hypothesis Testing IV (Chi Square)
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
AM Recitation 2/10/11.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
CJ 526 Statistical Analysis in Criminal Justice
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
Chapter 8 Hypothesis Testing I. Chapter Outline  An Overview of Hypothesis Testing  The Five-Step Model for Hypothesis Testing  One-Tailed and Two-Tailed.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Copyright © 2012 by Nelson Education Limited. Chapter 10 Hypothesis Testing IV: Chi Square 10-1.
Chi-Square. All the tests we’ve learned so far assume that our data is normally distributed z-test t-test We test hypotheses about parameters of these.
Chapter 16 The Chi-Square Statistic
Chapter 11 Hypothesis Testing IV (Chi Square). Chapter Outline  Introduction  Bivariate Tables  The Logic of Chi Square  The Computation of Chi Square.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
CHI SQUARE TESTS.
Chi-square Test of Independence
Chapter 11, 12, 13, 14 and 16 Association at Nominal and Ordinal Level The Procedure in Steps.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Reasoning in Psychology Using Statistics Psychology
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Nonparametric Tests of Significance Statistics for Political Science Levin and Fox Chapter Nine Part One.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 13. The Chi Square Test ( ) : is a nonparametric test of significance - used with nominal data -it makes no assumptions about the shape of the.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
ANOVA Knowledge Assessment 1. In what situation should you use ANOVA (the F stat) instead of doing a t test? 2. What information does the F statistic give.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
I. ANOVA revisited & reviewed
Chapter 9: Non-parametric Tests
Hypothesis Testing: One Sample Cases
Hypothesis Testing Review
Community &family medicine
Hypothesis Testing Using the Chi Square (χ2) Distribution
Chapter 14 in 1e Ch. 12 in 2/3 Can. Ed.
PPA 501 – Analytical Methods in Administration
Reasoning in Psychology Using Statistics
Contingency Tables (cross tabs)
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Chapter 18: The Chi-Square Statistic
Hypothesis Testing - Chi Square
Contingency Tables (cross tabs)
Presentation transcript:

Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution

Outline: The basic logic of Chi Square. The terminology used with bivariate tables. The computation of Chi Square with an example problem using the Five Step Model

Basic Logic Chi Square is a test of significance based on bivariate tables. We are looking for significant differences between the actual cell frequencies in a table (f o ) and those that would be expected by random chance (f e ). The data are often presented in a table format. If starting with raw data on two variables, a bivariate table must be created first.

Bivariate Tables: Must have a title. Cells are intersections of columns and rows. Subtotals are called marginals. N is reported at the intersection of row and column marginals.

Tables (cont.) Columns are scores of the independent variable.  There will be as many columns as there are scores on the independent variable. Rows are scores of the dependent variable.  There will be as many rows as there are scores on the dependent variable.

Tables (cont.) There will be as many cells as there are scores on the two variables combined. Each cell reports the number of times each combination of scores occurred.

What your table should look like: Title RowsColumns  Total Row 1cell acell bRow Marginal 1 Row 2cell ccell dRow Marginal 2 TotalColumn Marginal 1 Column Marginal 2 N

The Chi Square Distribution The chi square distribution is asymmetric and its values are always positive (Appendix C). Degrees of freedom are based on the table and are calculated as (rows-1)X(columns-1).

Example: 1e #11.2 (#10.2 in 2/3e) Question: Are the homicide rate and volume of gun sales related for a sample of 25 cities? The bivariate table showing the relationship between homicide rate (columns) and gun sales (rows). This 2x2 table has 4 cells. GUN SALESLowHighTotals High8513 Low4812 Totals1213N = 25 HOMICIDE RATE

Solution Using 5-Step Method Step 1 Make Assumptions and Meet Test Requirements Independent random samples Level of measurement is nominal Note that no assumption is made about the shape of the sampling distribution. When the distribution is normal, a parametric test (Z- or t-test, ANOVA) can be used. The chi square test is non-parametric. It can be used when normality is not assumed.

Step 2 State the Null and Alternate Hypothesis H 0 : The variables are independent  You can also say:H 0 : f o = f e H 1 : The variables are dependent  Or: H 1 : f o ≠ f e

Step 3 Select the Sampling Distribution and Establish the Critical Region Because normality is not assumed and our data are in tabular form, our Sampling Distribution = χ 2 Alpha =.05 df = (r-1)(c-1) = 1 χ 2 (critical) = 3.841

Step 4 Calculate the Test Statistic Formula: χ 2 (obtained) = Method: 1. Find expected frequencies for each cell. 2. Complete computational table to find χ 2 (obtained)

1. Find expected frequencies for each cell. To find f e = Multiply column and row marginals for each cell and divide by N. (13*12)/25 = 156/25 = 6.24 (13*13)/25 = 169/25 = 6.76 (12*12)/25 = 144/25 = 5.76 (12*13)/25 = 156/25 = 6.24

Observed and Expected Frequencies for each cell (Note that totals are unchanged): GUN SALESLowHighTotal High f o = 8 f e = 6.24 f o = 5 f e = Low f o = 4 f e = 5.76 f o = 8 f e = Total1213N = 25 HOMICIDE RATE

2. Complete Computational Table A table like this will help organize the computations: (a) Add values for f o and f e for each cell to table. fofo fefe f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e Total 2525

Computational Table (cont.) (b) Subtract each f e from each f o. The total of this column must be zero. fofo fefe f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e Total 25250

Computational Table (cont.) (c) Square each of these values fofo fefe f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e Total 25250

Computational Table (cont.) (d) Divide each of the squared values by the f e for that cell. (e) The sum of this column is chi square. χ 2 (obtained) = 2.00 fofo fefe f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e Total χ 2 = 2.00

Step 5 Make a Decision and Interpret the Results of the Test χ 2 (critical) = χ 2 (obtained) = 2.00 The test statistic is not in the Critical Region. Fail to reject the H 0. There is no significant relationship between homicide rate and gun sales.

Interpreting Chi Square The chi square test tells us only if the variables are independent or not. It does not tell us the pattern or nature of the relationship. To investigate the pattern, compute % within each column and compare across the columns.

Interpreting Chi Square (cont.) Cities low on homicide rate were high in gun sales and cities high in homicide rate were low in gun sales. As homicide rates increase, gun sales decrease. This relationship is not significant but does have a clear pattern. GUN SALESLowHighTotal High8 (66.7%)5 (38.5%)13 Low4 (33.3%)8 (61.5%)12 Total12 (100%)13 (100%)N = 25 HOMICIDE RATE

The Limits of Chi Square Like all tests of hypothesis, chi square is sensitive to sample size.  As N increases, obtained chi square increases.  With large samples, trivial relationships may be significant. To correct for this, when N>1000, set your alpha =.01. Remember: significance is not the same thing as importance.

Yates’ Correction for Continuity The chi square statistic is sensitive to small cell sizes. Whenever any of your cell sizes are <5, a slightly modified formula using Yates’ correction for continuity to calculate chi square should be calculated. Modified Formula (Note:.5 is deducted from the absolute value of f o – f e before squaring) (obtained) =

Using SPSS to Calculate Chi Square 1. Open data set in SPSS. 2. Go to Analyze>Descriptive Statistics>Crosstabs. 3. Move your dependent variable into the Rows box, and your independent variable into the Columns box. 4. Click Statistics and check box for Chi square. 5. Click Cells and select Column in percentages box. 6. Click Continue and OK. Note: The “rule of thumb” for analyzing your % data is “Percentage Down, Compare Across” When analyzing your %, always compare the categories of your dependent (Row) variable across the columns of your independent (Column) variable.

Practice Question: Try 1e #11.3 (#10.3 in 2/3e) parts a and b. Use the five step model to calculate a full solution to this question.