Chi Square Tests Chapter 17. Nonparametric Statistics A special class of hypothesis tests Used when assumptions for parametric tests are not met –Review:

Slides:



Advertisements
Similar presentations
What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
Advertisements

Chapter 18: The Chi-Square Statistic
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Chi Square Tests Chapter 17.
Random variable Distribution. 200 trials where I flipped the coin 50 times and counted heads no_of_heads in a trial.
Chapter Seventeen HYPOTHESIS TESTING
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Bivariate Statistics GTECH 201 Lecture 17. Overview of Today’s Topic Two-Sample Difference of Means Test Matched Pairs (Dependent Sample) Tests Chi-Square.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
1 Categorical Data (Chapter 10) Inference about one population proportion (§10.2). Inference about two population proportions (§10.3). Chi-square goodness-of-fit.
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
COURSE: JUST 3900 Tegrity Presentation Developed By: Ethan Cooper Final Exam Review.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Statistics for the Behavioral Sciences
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 16 The Chi-Square Statistic
The Logic of Statistical Significance & Making Statistical Decisions
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
© aSup-2007 CHI SQUARE   1 The CHI SQUARE Statistic Tests for Goodness of Fit and Independence.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
Copyright © 2010 Pearson Education, Inc. Slide
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
1 Hypothesis Testing Goodness-of-fit & Independence Chi-Squared Tests.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 12 The Analysis of Categorical Data and Goodness of Fit Tests.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi Square Tests Chapter 17. Assumptions for Parametrics >Normal distributions >DV is at least scale >Random selection Sometimes other stuff: homogeneity,
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Cross Tabulation with Chi Square
The Chi-square Statistic
Part Four ANALYSIS AND PRESENTATION OF DATA
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 16: Research with Categorical Data.
Hypothesis Testing Review
Qualitative data – tests of association
What are their purposes? What kinds?
Chapter 18: The Chi-Square Statistic
Presentation transcript:

Chi Square Tests Chapter 17

Nonparametric Statistics A special class of hypothesis tests Used when assumptions for parametric tests are not met –Review: What are the assumptions for parametric tests?

Assumptions for Parametric Tests Dependent variable is a scale variable  interval or ratio –If the dependent variable is ordinal or nominal, it is a non- parametric test Participants are randomly selected –If there is no randomization, it is a non-parametric test The underlying population distribution is normal –If the shape is not normal, it is a non-parametric test

When to Use Nonparametric Tests When the dependent variable is nominal –What are ordinal, nominal, interval, and ratio scales of measurement? Used when either the dependent or independent variable is ordinal Used when the sample size is small Used when underlying population is not normal

Limitations of Nonparametric Tests Cannot easily use confidence intervals or effect sizes Have less statistical power than parametric tests Nominal and ordinal data provide less information More likely to commit type II error –Review: What is type I error? Type II error?

Chi-Square Test for Goodness-of-Fit Nonparametric test when we have one nominal variable –These variables, also called "attribute variables" or "categorical variables," classify observations into a small number of categories. A good rule of thumb is that an individual observation of a nominal variable is usually a word, not a number –Examples of nominal variables include sex (the possible values are male or female), genotype (values are AA, Aa, or aa), or ankle condition (values are normal, sprained, torn ligament, or broken)

Chi-Square Test for Goodness-of-Fit Nonparametric test when we have one nominal variable –Measurement v. Nominal: Imagine recording each observation in a lab notebook. If you record a number (width, height, speed, errors) it’s a measurement, if you record a label it’s nominal (sex, popularity, beauty)

Examples of When to Use Chi-Square The observed counts of numbers of observations in each category are compared with the expected counts, which are calculated using some kind of theoretical expectation, such as a 1:1 sex ratio, or 4:2:1 population density in following example. Example: looking at an area of shore that had 59% of the area covered in sand, 28% mud and 13% rocks (4:2:1); if seagulls were standing in random places, your null hypothesis would be that 59% of the seagulls were standing on sand, 28% on mud and 13% on rocks (4:2:1).

Examples of Chi-Square Does the count of the Observed match the count of the Expected? Mendel crossed peas that were heterozygotes for Smooth/wrinkled, where Smooth is dominant. The expected ratio in the offspring is 3 Smooth: 1 wrinkled. He observed 423 Smooth and 133 wrinkled. The expected frequency of Smooth is calculated by multiplying the sample size (556) by the expected proportion (0.75) to yield 417. The same is done for green to yield 139. The number of degrees of freedom when an extrinsic hypothesis is used is the number of values of the nominal variable minus one. In this case, there are two values (Smooth and wrinkled), so there is one degree of freedom. The result is chi-square=0.35, 1 d.f., P=0.557, indicating that the null hypothesis cannot be rejected; there is no significant difference between the observed and expected frequencies.

Examples of Chi-Square Does the count of the Observed match the count of the Expected? Mannan and Meslow (1984) studied bird foraging behavior in a forest in Oregon. In a managed forest, 54% of the canopy volume was Douglas fir, 40% was ponderosa pine, 5% was grand fir, and 1% was western larch. They made 156 observations of foraging by red-breasted nuthatches; 70 observations (45% of the total) in Douglas fir, 79 (51%) in ponderosa pine, 3 (2%) in grand fir, and 4 (3%) in western larch. The biological null hypothesis is that the birds forage randomly, without regard to what species of tree they're in; the statistical null hypothesis is that the proportions of foraging events are equal to the proportions of canopy volume. The difference in proportions is significant (chi-square=13.593, 3 d.f., P=0.0035).

How the test works The test statistic is calculated by taking an observed number (O), subtracting the expected number (E), then squaring this difference. The larger the deviation from the null hypothesis, the larger the difference between observed and expected is. Squaring the differences makes them all positive. Each difference is divided by the expected number, and these standardized ratios are summed: the more differences between what you would expect and what you get the bigger the number.

Chi-Square Test for Goodness-of-Fit The six steps of hypothesis testing Question: Are the best soccer players born early rather than later in the year ? 1. Identify 2. State the hypotheses 3. Characteristics of the comparison distribution 4. Critical values 5. Calculate 6. Decide

Chi-Square Test for Goodness-of-Fit The six steps of hypothesis testing 1.Identify Pop. Distribution & Assumptions a)Two populations, one distribution that matches expected outcomes and another where distribution matches observed outcomes. E.g., great soccer players are born evenly throughout year, great soccer players born in first half of year. b)Comparison distribution is chi-square c)First assumption, variable of interest is nominal, birth month. Second, independence of observation, that is each observation fits in only one category, no soccer player has two birth months. Third, random selection of pop ( in this case, they are only Germans, and only elite). Fourth, large enough sample size, ideally 5 times the number of cells (in this case N= 56 > 10 (2 x 5).

Chi-Square Test for Goodness-of-Fit State the hypotheses: does the Observed count of elite soccer player Birth Months match the Expected count of elite soccer player Birth Months Null: Match Alternative: No match

Chi-Square Test for Goodness-of-Fit Characteristics of the comparison distribution Only two categories of soccer players

Chi-Square Test for Goodness-of-Fit Critical values

Chi-Square Test for Goodness-of-Fit Calculate

Chi-Square Test for Goodness-of-Fit Calculate

Making a Decision

Evenly divided expected frequencies –Can you think of examples where you would expect evenly divided expected frequencies in the population? A more typical Chi-Square

Chi-square test for independence –Analyzes 2 nominal variables –The six steps of hypothesis testing 1. Identify 2. State the hypotheses 3. Characteristics of the comparison distribution 4. Critical values 5. Calculate 6. Decide

The Cutoff for a Chi-Square Test for Independence

The Decision

Cramer’s V (phi) The effect size for chi-square test for independence

Graphing Chi-Squared Percentages

Relative Risk >We can quantify the size of an effect with chi square through relative risk, also called relative likelihood. >By making a ratio of two conditional proportions, we can say, for example, that one group is three times as likely to show some outcome or, conversely, that the other group is one-third as likely to show that outcome.

Adjusted Standardized Residuals >The difference between the observed frequency and the expected frequency for a cell in a chi-square research design, divided by the standard error; also called adjusted residual.

Formulae

Determining the Cutoff for a Chi-Square Statistic