Section 10.1 Goodness of Fit. Section 10.1 Objectives Use the chi-square distribution to test whether a frequency distribution fits a claimed distribution.

Slides:



Advertisements
Similar presentations
Testing the Difference Between Means (Large Independent Samples)
Advertisements

Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Chapter 11 Inference for Distributions of Categorical Data
Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Goodness-of-Fit Test.
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 11 Chi-Square Procedures 11.1 Chi-Square Goodness of Fit.
Introduction to Chi-Square Procedures March 11, 2010.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Hypothesis Testing for Variance and Standard Deviation
BCOR 1020 Business Statistics
Chi-Square Tests and the F-Distribution
Chi-square Goodness of Fit Test
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
Chapter 13: Inference for Tables – Chi-Square Procedures
Hypothesis Testing for the Mean (Small Samples)
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Hypothesis Testing for Proportions 1 Section 7.4.
Section 10.3 Comparing Two Variances Larson/Farber 4th ed1.
13.1 Goodness of Fit Test AP Statistics. Chi-Square Distributions The chi-square distributions are a family of distributions that take on only positive.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Inference for Distributions of Categorical Data.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. 1.. Section 11-2 Goodness of Fit.
10.1: Multinomial Experiments Multinomial experiment A probability experiment consisting of a fixed number of trials in which there are more than two possible.
Chapter 10 Chi-Square Tests and the F-Distribution
Section 7.4 Hypothesis Testing for Proportions Larson/Farber 4th ed.
Chapter Chi-Square Tests and the F-Distribution 1 of © 2012 Pearson Education, Inc. All rights reserved.
Chapter 11 Inference for Tables: Chi-Square Procedures 11.1 Target Goal:I can compute expected counts, conditional distributions, and contributions to.
Comparing Two Variances
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.
Chi-Square Test.
Chapter 10 Chi-Square Tests and the F-Distribution
Section 8.3 Testing the Difference Between Means (Dependent Samples)
GOODNESS OF FIT Larson/Farber 4th ed 1 Section 10.1.
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics 300: Elementary Statistics Section 11-2.
Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.
381 Goodness of Fit Tests QSCI 381 – Lecture 40 (Larson and Farber, Sect 10.1)
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
11.1 Chi-Square Tests for Goodness of Fit Objectives SWBAT: STATE appropriate hypotheses and COMPUTE expected counts for a chi- square test for goodness.
Section 7.4 Hypothesis Testing for Proportions © 2012 Pearson Education, Inc. All rights reserved. 1 of 101.
Section 7.4 Hypothesis Testing for Proportions © 2012 Pearson Education, Inc. All rights reserved. 1 of 14.
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Goodness-of-Fit and Contingency Tables Chapter 11.
Chapter 10 Chi-Square Tests and the F-Distribution.
Section 10.1 Goodness of Fit © 2012 Pearson Education, Inc. All rights reserved. 1 of 91.
Check your understanding: p. 684
Chapter 10 Chi-Square Tests and the F-Distribution
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chapter 7 Hypothesis Testing with One Sample.
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chapter 7 Hypothesis Testing with One Sample.
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chapter 8 Hypothesis Testing with Two Samples.
Chapter 10 Chi-Square Tests and the F-Distribution
Section 10-1 – Goodness of Fit
Elementary Statistics: Picturing The World
Chapter 7 Hypothesis Testing with One Sample.
Hypothesis Testing for Proportions
Presentation transcript:

Section 10.1 Goodness of Fit

Section 10.1 Objectives Use the chi-square distribution to test whether a frequency distribution fits a claimed distribution “chi” - kie (rhymes with “eye”) [see page 330] - a distribution - area under curve = 1 - all values of χ 2 are greater than or equal to zero - positively skewed - df = n -1

Multinomial Experiments Multinomial experiment A probability experiment consisting of a fixed number of independent trials in which there are more than two possible outcomes for each trial. The probability for each outcome is fixed and each outcome is classified into categories. Recall that a binomial experiment had only two possible outcomes.

Multinomial Experiments Books on a shelf Fiction60% Non-fiction30% Reference10% 3 categories – type of books If distribution of books is “true” and one had a library of 50 books, how many books of fiction would there be? 60% of 50 is 30 (50*.60= 30) Non-fiction? 50*.30 = 15 Reference? 50*.10 = 5

Multinomial Experiments 3 categories – type of books If distribution of books is “true” and one had a library of 50 books, how many books of fiction would there be? 60% of 50 is 30 (50*.60= 30) Non-fiction? 50*.30 = 15 Reference? 50*.10 = 5 These are Expected Values (given a sample of size n and assuming a given distribution is truth) Expected value = number in sample size * assumed probability E = n*p

Multinomial Experiments A tax preparation company wants to determine the proportions of people who used different methods to prepare their taxes. The company can perform a multinomial experiment. It wants to test a previous survey’s claim concerning the distribution of proportions of people who use different methods to prepare their taxes. It can compare the distribution of proportions obtained in the multinomial experiment with the previous survey’s specified distribution. It can perform a chi-square goodness-of-fit test.

Chi-Square Goodness-of-Fit Test Used to test whether a frequency distribution fits an expected distribution. The null hypothesis states that the frequency distribution fits the specified distribution. The alternative hypothesis states that the frequency distribution does not fit the specified distribution.

Multinomial Experiments Results of a survey of tax preparation methods. Distribution of tax preparation methods Accountant25% By hand20% Computer software35% Friend/family5% Tax preparation service15% Each outcome is classified into categories. The probability for each possible outcome is fixed.

Chi-Square Goodness-of-Fit Test To test the previous survey’s claim, a company can perform a chi-square goodness-of-fit test using the following hypotheses. H 0 : The distribution of tax preparation methods is 25% by accountant, 20% by hand, 35% by computer software, 5% by friend or family, and 15 % by tax preparation service. (claim) H a : The distribution of tax preparation methods differs from the claimed or expected distribution.

Chi-Square Goodness-of-Fit Test To calculate the test statistic for the chi-square goodness-of-fit test, the observed frequencies and the expected frequencies are used. The observed frequency O of a category is the frequency for the category observed in the sample data.

Chi-Square Goodness-of-Fit Test The expected frequency E of a category is the calculated frequency for the category.  Expected frequencies are obtained assuming the specified (or hypothesized) distribution. The expected frequency for the i th category is E i = np i where n is the number of trials (the sample size) and p i is the assumed probability of the i th category.

Example: Finding Observed and Expected Frequencies A tax preparation company randomly selects 300 adults and asks them how they prepare their taxes. The results are shown at the right. Find the observed frequency and the expected frequency for each tax preparation method. Survey results (n = 300) Accountant71 By hand40 Computer software 101 Friend/family35 Tax preparation service 53

Solution: Finding Observed and Expected Frequencies Observed frequency: The number of adults in the survey naming a particular tax preparation method Survey results (n = 300) Accountant71 By hand40 Computer software 101 Friend/family35 Tax preparation service 53 observed frequency

Solution: Finding Observed and Expected Frequencies Expected Frequency: E i = np i Tax preparation method % of people Observed frequency Expected frequency Accountant 25%71300(0.25) = 75 By hand20%40300(0.20) = 60 Computer Software35%101300(0.35) = 105 Friend/family 5%35300(0.05) = 15 Tax preparation service 15%53300(0.15) = 45 n = 300

Chi-Square Goodness-of-Fit Test For the chi-square goodness-of-fit test to be used, the following must be true. 1.The observed frequencies must be obtained by using a random sample. 2.Each expected frequency must be greater than or equal to 5.

Chi-Square Goodness-of-Fit Test If these conditions are satisfied, then the sampling distribution for the goodness-of-fit test is approximated by a chi-square distribution with k – 1 degrees of freedom, where k is the number of categories. The test statistic for the chi-square goodness-of-fit test is where O represents the observed frequency of each category and E represents the expected frequency of each category. The test is always a right-tailed test.

Chi - Square Goodness - of - Fit Test 1.Identify the claim. State the null and alternative hypotheses. 2.Specify the level of significance. 3.Identify the degrees of freedom. 4.Determine the critical value. State H 0 and H a. Identify α. Use Table 6 in Appendix B. d.f. = k – 1 In WordsIn Symbols

Chi - Square Goodness - of - Fit Test If χ 2 is in the rejection region, reject H 0. Otherwise, fail to reject H 0. 5.Determine the rejection region. 6.Calculate the test statistic. 7.Make a decision to reject or fail to reject the null hypothesis. 8.Interpret the decision in the context of the original claim. In WordsIn Symbols

Example: Performing a Goodness of Fit Test Use the tax preparation method data to perform a chi- square goodness-of-fit test to test whether the distributions are different. Use α = Survey results (n = 300) Accountant71 By hand40 Computer software 101 Friend/family35 Tax preparation service 53 Distribution of tax preparation methods Accountant25% By hand20% Computer software 35% Friend/family5% Tax preparation service 15%

Solution: Performing a Goodness of Fit Test H 0 : H a : α = d.f. = Rejection Region Test Statistic: Decision: Conclusion: – 1 = 4 The distribution is 25% by accountant, 20% by hand, 35% by computer software, 5% by friend/ family, and 15% by tax preparation service. (Claim) The distribution of tax preparation methods differs from the claimed or expected distribution.

Solution: Performing a Goodness of Fit Test methodObserv ed (O) Expe cted (E) O-E(O-E) 2 Accountant By hand Computer hardware Friend/famil y Tax prep service So compute “chi-square: = =

Solution: Performing a Goodness of Fit Test H 0 : H a : α = d.f. = Rejection Region Test Statistic: Decision: – 1 = 4 The distribution is 25% by accountant, 20% by hand, 35% by computer software, 5% by friend/ family, and 15% by tax preparation service. (Claim) The distribution of tax preparation methods differs from the claimed or expected distribution. χ 2 ≈ There is enough evidence at the 1% significance level to conclude that the distribution of tax preparation methods differs from the previous survey’s claimed or expected distribution. Reject H 0

Example: Performing a Goodness of Fit Test A researcher claims that the number of different-colored candies in bags of dark chocolate M&M’s is uniformly distributed. To test this claim, you randomly select a bag that contains 500 dark chocolate M&M’s. The results are shown in the table on the next slide. Using α = 0.10, perform a chi-square goodness-of-fit test to test the claimed or expected distribution. What can you conclude?

Example: Performing a Goodness of Fit Test ColorFrequency Brown80 Yellow95 Red88 Blue83 Orange76 Green78 Solution: The claim is that the distribution is uniform, so the expected frequencies of the colors are equal. To find each expected frequency, divide the sample size by the number of colors. E = 500/6 ≈ 83.3 n = 500

Solution: Performing a Goodness of Fit Test H 0 : H a : α = d.f. = Rejection Region Test Statistic: Decision: Conclusion: – 1 = χ2χ Distribution of different-colored candies in bags of dark chocolate M&M’s is uniform. (Claim) Distribution of different-colored candies in bags of dark chocolate M&M’s is not uniform.

Solution: Performing a Goodness of Fit Test Color Observed frequency Expected frequency O – E(O-E) 2 Brown Yello w Red Blue Orang e Green = = 3.016

Solution: Performing a Goodness of Fit Test H 0 : H a : α = d.f. = Rejection Region Test Statistic: Decision: – 1 = χ2χ χ 2 ≈ There is not enough evidence at the 10% level of significance to reject the claim that the distribution is uniform. Distribution of different-colored candies in bags of dark chocolate M&M’s is uniform. (Claim) Distribution of different-colored candies in bags of dark chocolate M&M’s is not uniform. Fail to Reject H 0

Section 10.1 Summary Used the chi-square distribution to test whether a frequency distribution fits a claimed distribution