The Practice of Statistics Third Edition Chapter (13.1) 14.1: Chi-square Test for Goodness of Fit Copyright © 2008 by W. H. Freeman & Company Daniel S.

Slides:



Advertisements
Similar presentations
Lesson Test for Goodness of Fit One-Way Tables.
Advertisements

Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Chapter 11 Inference for Distributions of Categorical Data
Chi Square Procedures Chapter 11.
Inference about the Difference Between the
Chapter 11 Inference for Distributions of Categorical Data
Chapter 13: Inference for Distributions of Categorical Data
Chapter 26: Comparing Counts
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Chi-Square Tests and the F-Distribution
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Testing Distributions Section Starter Elite distance runners are thinner than the rest of us. Skinfold thickness, which indirectly measures.
Goodness-of-Fit Tests and Categorical Data Analysis
AP STATISTICS LESSON 13 – 1 (DAY 1) CHI-SQUARE PROCEDURES TEST FOR GOODNESS OF FIT.
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
13.1 Goodness of Fit Test AP Statistics. Chi-Square Distributions The chi-square distributions are a family of distributions that take on only positive.
Analysis of two-way tables - Formulas and models for two-way tables - Goodness of fit IPS chapters 9.3 and 9.4 © 2006 W.H. Freeman and Company.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
Chapter 11: Inference for Distributions of Categorical Data.
Analysis of two-way tables - Formulas and models for two-way tables - Goodness of fit IPS chapters 9.3 and 9.4 © 2006 W.H. Freeman and Company.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 26 Chi-Square Testing
Chapter 11 Inference for Tables: Chi-Square Procedures 11.1 Target Goal:I can compute expected counts, conditional distributions, and contributions to.
13.2 Chi-Square Test for Homogeneity & Independence AP Statistics.
+ Chi Square Test Homogeneity or Independence( Association)
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Chapter 14: Chi-Square Procedures – Test for Goodness of Fit.
CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Copyright © 2010 Pearson Education, Inc. Slide
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
+ Chapter 11 Inference for Distributions of Categorical Data 11.1Chi-Square Goodness-of-Fit Tests 11.2Inference for Relationships.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 12 The Analysis of Categorical Data and Goodness of Fit Tests.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter Fifteen Chi-Square and Other Nonparametric Procedures.
The Practice of Statistics Third Edition Chapter 14: Inference for Distributions of Categorical Variables: Chi-Square Procedures Copyright © 2008 by W.
Chi Square Test for Goodness of Fit. p ,5,8.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
Chapter 14 Inference for Distribution of Categorical Variables: Chi-Squared Procedures.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
11/12 9. Inference for Two-Way Tables. Cocaine addiction Cocaine produces short-term feelings of physical and mental well being. To maintain the effect,
Test for Goodness of Fit
John Loucks St. Edward’s University . SLIDES . BY.
1) A bicycle safety organization claims that fatal bicycle accidents are uniformly distributed throughout the week. The table shows the day of the week.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Lesson 11 - R Chapter 11 Review:
Analyzing the Association Between Categorical Variables
Chapter 13: Inference for Distributions of Categorical Data
Inference for Distributions of Categorical Data
Presentation transcript:

The Practice of Statistics Third Edition Chapter (13.1) 14.1: Chi-square Test for Goodness of Fit Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

When to use the Chi Square, χ 2, Procedure Used when the dependent variable is categorical or ranked data. When the assumptions about the population are not reasonable. For example, populations that are non-normal distributions.

This Chapter will Cover Three Tests based on the Chi-square Distributions. Test if observed counts for a categorical data could come from a certain hypothesized distribution. ( Goodness of Fit). Test whether a single categorical variable has the same distribution in two or more distinct population. ( Inference for Two-Way Tables, Tests for Homogeneity of Populations) Test whether two categorical variables are associated or independent. (Inference for Two- Way Tables, Tests for of Association/ Independence.)

Required Conditions For Goodness of Fit Procedure SRS The observations must be independent and each observation must fit into one and only one cell or category. All individual expected counts are at least one and no more than 20% of the expected counts are less than 5. Please note: We are working with counts – not proportions. There is no mention of normality. Chi-squared procedures do not rely on assumptions about the population from which the sample is selected.

Hypothesizes for Goodness of Fit Test H 0 = the actual population proportions are equal to the hypothesized proportions. H a = the actual proportions are different from the hypothesized proportions.

Chi-square Test Statistic Degree of freedom = k – 1, where k is the number of categories. Use the appropriate chi-square distribution based on degree of freedom, to find the critical value of χ 2 at an α level.

Properties of Chi-square Distributions Total area under the curve is one. Each chi-square distribution except for df = 1 start at the origin, increases to a peak and then approach the x-axis asymptotically form above. Each distribution is skewed to the right. As the number of degrees of freedom increases the distribution becomes for symmetrical and looks like a normal curve.

Example 1 Consider the problem of determining whether the distribution of car sales in the Eastern United States in the current year for Nissans, Mazdas, Toyotas and Hondas is the same as the known distribution of the pervious year, given in the table below: Nissan18% Mazda10% Toyota35% Honda37% From the Motor Vehicle Bureau records, we select a random sample of 1,000 of new car purchases for one of these four types of foreign cars in the current year. The information is displayed below: Frequency Nissan150 Mazda65 Toyota385 Honda400 Is the current year’s sales distribution the same as last year’s sales ?

Example 1 Continued Step 1 – We want to determine if the sales distribution is different from last year’s sales distribution. –Population – this year sales of Nissan, Mazda, Toyota, and Hondas. –Parameter – the proportion of each car sold. –H 0 = The current year’s sales distribution is the same as that of the pervious year’s distribution ( Nissan: 18%, Mazda: 10%, Toyota: 35%, and Honda: 37%). –H a = The current year’s sales distribution is not the same as the previous year.

Example 1 Continued Step 2 Condition –SRS – Random sample taken from the Motor Vehicle Bureau. We do not know if the sample was taken from all state motor vehicle bureau is eastern United States. We will assume we have an SRS. –Expected counts: Nissan: 0.18 x 1000 = 180 Mazda: 0.10 x 1000 = 100 Toyota: 0.35 x 1000 = 350 Honda: 0.37 x 1000 = 370 All expected counts are at least 5 or more. –Independence - observations or counts are independent.

Example 1 Continued Step 3 Calculations Nissan Mazda Toyota Honda Observed Expected Count (O) Count (E) Sum = From Table D using df = 3 and α = 0.05, the critical χ 2 * = 7.81.

Example 1 Continued Step 4 Interpretation Since χ 2 = is to the right of χ 2 *, the P-value is smaller than α = The results are statistically significant to reject H 0. The current sales distribution is not the same as last year’s sales distribution. The test only tells you there is a change. Additional analysis may be required. We need to look at (O –E) 2 /E column to find the major contributor to the Chi-square statistic. In this problem, not as many Mazda were sold in the current year.

Example 2 Are you more likely to have a motor vehicle collision when using a cell phone? A study of 699 drivers who were using a cell phone when they were involved in a collision examined this question. These drivers made 26,798 cell phone calls during a 14 month study period, Each of the 699 collisions was classified in various ways. Here are the counts for each day of the week: Day: Sun Mon Tues Wed Thu Fri Sat Total Num Are the accidents equally likely to occur on any day of the week?

Example 2 Continued Step 1 –Population? –Parameter? –H0?–H0? –Ha?–Ha? Step 1 –Population – all accidents involving cell phones. –Parameter – proportion of accidents for each day of the week. –H 0 : Motor vehicle accidents involving cell phone use are equally likely to occur on each day of the week. –H a : The probabilities of a motor accident involving a cell phone use vary from day to day ( not all the same.)

Example 2 continued Step 2 Conditions –SRS? –Expected counts? –Independent? Step 2 –SRS Assume an SRS. –Expected counts are: Sun 699 x (1/7) = Mon 699 x (1/7) = Tue 699 x (1/7) = Wed 699 x (1/7) = Thu 699 x (1/7) = Fri 699 x (1/7) = Sat 699 x (1/7) = All expected counts are greater than 5. - The observed counts are independent.

Example 2 Continued Step 3 Calculations Use calculator. L1 = Observed counts L2 = Expected counts L3 – (O –E) 2 /E = (L1 – L2) 2 / L2 Sum (L3) Sum = χ 2 2 nd Distr χ 2 cdf( Lower bound, Upper bound, df)

Example 2 Continued Step 4 Interpretation –The P-value is extremely small. At α = 0.05 we would reject H 0. The accidents involving cell phones are not evenly distributed over the days of the week. –Additional analysis: Saturday and Sunday provided the biggest contribution to χ 2 statistic. There were less accidents involving cell phones over the weekends.