1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.

Slides:



Advertisements
Similar presentations
© 2011 Pearson Education, Inc
Advertisements

Categorical Data Analysis
CHI-SQUARE(X2) DISTRIBUTION
Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
1 1 Slide © 2003 South-Western /Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Uji Kebaikan Suai (Uji Kecocokan) Pertemuan 23
Inference about the Difference Between the
Statistical Inference for Frequency Data Chapter 16.
1 1 Slide Mátgæði Kafli 11 í Newbold Snjólfur Ólafsson + Slides Prepared by John Loucks © 1999 ITP/South-Western College Publishing.
© 2002 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 10 Tests for Two or More Samples with Categorical.
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
1 Pertemuan 09 Pengujian Hipotesis Proporsi dan Data Katagorik Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Chapter 16 Chi Squared Tests.
1 Pertemuan 09 Pengujian Hipotesis 2 Matakuliah: I0272 – Statistik Probabilitas Tahun: 2005 Versi: Revisi.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Chapter 11a: Comparisons Involving Proportions and a Test of Independence Inference about the Difference between the Proportions of Two Populations Hypothesis.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
Goodness of Fit Test for Proportions of Multinomial Population Chi-square distribution Hypotheses test/Goodness of fit test.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
© 2004 Prentice-Hall, Inc.Chap 12-1 Basic Business Statistics (9 th Edition) Chapter 12 Tests for Two or More Samples with Categorical Data.
AM Recitation 2/10/11.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Introduction to Probability and Statistics Thirteenth Edition Chapter 13 Analysis of Categorical Data.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1/71 Statistics Tests of Goodness of Fit and Independence.
Chapter Outline Goodness of Fit test Test of Independence.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics 300: Elementary Statistics Section 11-2.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 23 日 第六週:配適度與獨立性檢定.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 Pertemuan 24 Uji Kebaikan Suai Matakuliah: I0134 – Metoda Statistika Tahun: 2005 Versi: Revisi.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chi-Två Test Kapitel 6. Introduction Two statistical techniques are presented, to analyze nominal data. –A goodness-of-fit test for the multinomial experiment.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Section 10.1 Goodness of Fit © 2012 Pearson Education, Inc. All rights reserved. 1 of 91.
©2006 Thomson/South-Western 1 Chapter 12 – Analysis of Categorical Data Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western Concise.
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
St. Edward’s University
Chapter Fifteen McGraw-Hill/Irwin
John Loucks St. Edward’s University . SLIDES . BY.
Statistics for Business and Economics (13e)
Econ 3790: Business and Economics Statistics
Inference on Categorical Data
CHI SQUARE TEST OF INDEPENDENCE
Chapter Outline Goodness of Fit test Test of Independence.
St. Edward’s University
Presentation transcript:

1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population Such a population is a multinomial population. On each trial of a multinomial experiment: One and only one of the outcomes occurs Each trial is assumed to be independent The probabilities of the outcomes remain the same for each trial

2 Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population 1. State the null and alternative hypotheses. H 0 : The population follows a multinomial distribution with specified probabilities distribution with specified probabilities for each of the k categories for each of the k categories H 0 : The population follows a multinomial distribution with specified probabilities distribution with specified probabilities for each of the k categories for each of the k categories H a : The population does not follow a multinomial distribution with specified multinomial distribution with specified probabilities for each of the k categories probabilities for each of the k categories H a : The population does not follow a multinomial distribution with specified multinomial distribution with specified probabilities for each of the k categories probabilities for each of the k categories 2. Select a random sample and record the observed frequency, f i, for each of the k categories. frequency, f i, for each of the k categories. 3. Assuming H 0 is true, compute the expected frequency, e i, in each category by multiplying the frequency, e i, in each category by multiplying the category probability by the sample size category probability by the sample size

3 4. Compute the value of the test statistic. Note: The test statistic has a chi-square distribution with k – 1 df provided that the expected frequencies are 5 or more for all categories. Note: The test statistic has a chi-square distribution with k – 1 df provided that the expected frequencies are 5 or more for all categories. f i = observed frequency for category i e i = expected frequency for category i k = number of categories where: Hypothesis Test for Proportions of a Multinomial Population

4 where  is the significance level and there are k - 1 degrees of freedom p -value approach: Critical value approach: Reject H 0 if p -value <  5. Rejection rule: Reject H 0 if Hypothesis Test for Proportions of a Multinomial Population

5 Multinomial Distribution Goodness of Fit Test n Example: Finger Lakes Homes (A) Finger Lakes Homes manufactures four models of prefabricated homes, a two-story colonial, a log cabin, a split-level, and an A-frame. To help in production planning, management would like to determine if previous customer purchases indicate that there is a preference in the style selected. The number of homes sold of each model for 100 The number of homes sold of each model for 100 sales over the past two years is shown below. Model Colonial Log Split-Level A-Frame # Sold

6 n Hypotheses Multinomial Distribution Goodness of Fit Test where: p C = population proportion that purchase a colonial p C = population proportion that purchase a colonial p L = population proportion that purchase a log cabin p L = population proportion that purchase a log cabin p S = population proportion that purchase a split-level p S = population proportion that purchase a split-level p A = population proportion that purchase an A-frame p A = population proportion that purchase an A-frame H 0 : p C = p L = p S = p A =.25 H a : The population proportions are not p C =.25, p L =.25, p S =.25, and p A =.25 p C =.25, p L =.25, p S =.25, and p A =.25

7 n Rejection Rule 22 2 Do Not Reject H 0 Reject H 0 Multinomial Distribution Goodness of Fit Test With  =.05 and k - 1 = = 3 k - 1 = = 3 degrees of freedom degrees of freedom if p -value Reject H 0 if p -value

8 n Expected Frequencies n Test Statistic Multinomial Distribution Goodness of Fit Test e 1 =.25(100) = 25 e 2 =.25(100) = 25 e 3 =.25(100) = 25 e 4 =.25(100) = 25 e 3 =.25(100) = 25 e 4 =.25(100) = 25 = = 10 n Conclusion Using the Critical Value Approach  2 = 10 > We reject, at the.05 level of significance, the assumption that there is no home style preference.

9 As personnel director, you want to test the perception of fairness of four methods of performance evaluation. Of 240 employees, 60 rated Method 1 as fair, 50 rated Method 2 as fair, 70 rated Method 3 as fair and 60 rated Method 4 as fair. At the.05 level of significance, is there a difference in perceptions?  2 Test for k Proportions Example

10 Price Colonial Log Split-Level A-Frame Price Colonial Log Split-Level A-Frame Each home sold by Finger Lakes Homes can be classified according to price and to style. Finger Lakes’ manager would like to determine if the price of the home and the style of the home are independent variables. The number of homes sold for each model and price for the past two years is shown below. For convenience, the price of the home is listed as either $99,000 or less or more than $99,000. > $99, < $99, Contingency Table (Independence) Test n Example: Finger Lakes Homes (B)

11 n Hypotheses Contingency Table (Independence) Test H 0 : Price of the home is independent of the style of the home that is purchased H a : Price of the home is not independent of the style of the home that is purchased 1. Set up the null and alternative hypotheses. 2. Select a random sample and record the observed frequency, f ij, for each cell of the contingency table. frequency, f ij, for each cell of the contingency table. 3. Compute the expected frequency, e ij, for each cell.

12 n Expected Frequencies Contingency Table (Independence) Test Price Colonial Log Split-Level A-Frame Total Price Colonial Log Split-Level A-Frame Total < $99K > $99K Total Total

13 Test of Independence: Contingency Tables 5. Determine the rejection rule. Reject H 0 if p -value <  or 4. Compute the test statistic. where  is the significance level and, with n rows and m columns, there are ( n - 1)( m - 1) degrees of freedom.

14 n Rejection Rule Contingency Table (Independence) Test With  =.05 and (2 - 1)(4 - 1) = 3 d.f., Reject H 0 if p -value = = n Test Statistic Conclusion Using the Critical Value Approach  2 = > 7.8 Conclusion Using the Critical Value Approach  2 = > 7.8 We reject, at the.05 level of significance, the assumption that the price of the home is independent of the style of home that is purchased

15 Caution about the  2 Test The  2 is one of the most widely applied statistical tools and also one of the most abused statistical tool. Be certain the experiment satisfies the assumptions. Be certain the sample is drawn from the correct population. Avoid using when the expected counts are very small.

16 © 2011 Pearson Education, Inc Caution About the  2 Test If the  2 value does not exceed the established critical value of  2, do not accept the hypothesis of independence. You risk a Type II error. Avoid concluding that two classifications are independent, even when  2 is small. If a contingency table  2 value does exceed the critical value, we must be careful to avoid inferring that a causal relationship exists between the classifications. The existence of a causal relationship cannot be established by a contingency table analysis.