Chi-Två Test Kapitel 6. Introduction Two statistical techniques are presented, to analyze nominal data. –A goodness-of-fit test for the multinomial experiment.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

15- 1 Chapter Fifteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
1 1 Slide © 2003 South-Western /Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 Chapter 12 Inference About One Population Introduction In this chapter we utilize the approach developed before to describe a population.In.
Inference about the Difference Between the
Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Test of Independence.
Chapter Goals After completing this chapter, you should be able to:
Chapter 16 Chi Squared Tests.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
BCOR 1020 Business Statistics
Chapter 11a: Comparisons Involving Proportions and a Test of Independence Inference about the Difference between the Proportions of Two Populations Hypothesis.
Chi-Square Tests and the F-Distribution
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.7.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Section 10.1 Goodness of Fit. Section 10.1 Objectives Use the chi-square distribution to test whether a frequency distribution fits a claimed distribution.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
Chapter 10 Chi-Square Tests and the F-Distribution
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
Hypothesis and Hypothesis Testing
GOODNESS OF FIT Larson/Farber 4th ed 1 Section 10.1.
Other Chi-Square Tests
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Copyright © 2010 Pearson Education, Inc. Slide
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1/71 Statistics Tests of Goodness of Fit and Independence.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
1 Chi-square Test Dr. T. T. Kachwala. Using the Chi-Square Test 2 The following are the two Applications: 1. Chi square as a test of Independence 2.Chi.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Section 10.1 Goodness of Fit © 2012 Pearson Education, Inc. All rights reserved. 1 of 91.
Keller: Stats for Mgmt & Econ, 7th Ed Chi-Squared Tests
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chapter 11 Chi-Square Tests.
Chapter Fifteen McGraw-Hill/Irwin
Chapter 11 Goodness-of-Fit and Contingency Tables
Econ 3790: Business and Economics Statistics
Chapter 11 Chi-Square Tests.
Inference on Categorical Data
Copyright © Cengage Learning. All rights reserved.
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11 Chi-Square Tests.
Presentation transcript:

Chi-Två Test Kapitel 6

Introduction Two statistical techniques are presented, to analyze nominal data. –A goodness-of-fit test for the multinomial experiment. –A contingency table test of independence. Both tests use the  2 as the sampling distribution of the test statistic.

Chi-squared Goodness-of-Fit Test We test whether there is sufficient evidence to reject a pre-specified set of values for p i. The hypothesis: The test builds on comparing actual frequency and the expected frequency of occurrences in all the cells.

Example –Two competing companies A and B have enjoy dominant position in the market. The companies conducted aggressive advertising campaigns. –Market shares before the campaigns were: Company A = 45% Company B = 40% Other competitors = 15%. The multinomial goodness of fit test - Example

Example 16.1 – continued –To study the effect of the campaign on the market shares, a survey was conducted. The multinomial goodness of fit test - Example –200 customers were asked to indicate their preference regarding the product advertised. –Survey results: 102 customers preferred the company A’s product, 82 customers preferred the company B’s product, 16 customers preferred the competitors product.

The multinomial goodness of fit test - Example Example – continued Can we conclude at 5% significance level that the market shares were affected by the advertising campaigns?

Solution –The population investigated is the brand preferences. –The data are nominal (A, B, or other) –This is a multinomial experiment (three categories). –The question of interest: Are p 1, p 2, and p 3 different after the campaign from their values before the campaign? The multinomial goodness of fit test - Example

The hypotheses are: H 0 : p 1 =.45, p 2 =.40, p 3 =.15 H 1 : At least one p i changed. The expected frequency for each category (cell) if the null hypothesis is true is shown below: 90 = 200(.45) 30 = 200(.15) What actual frequencies did the sample return? The multinomial goodness of fit test - Example 80 = 200(.40)

The statistic is The rejection region is The multinomial goodness of fit test - Example

Example 16.1 – continued

The multinomial goodness of fit test - Example Example – continued Conclusion: Since 8.18 > 5.99, there is sufficient evidence at 5% significance level to reject the null hypothesis. At least one of the probabilities p i is different. Thus, at least two market shares have changed. P valueAlpha Rejection region  2 with 2 degrees of freedom

Required conditions – the rule of five The test statistic used to perform the test is only approximately Chi-squared distributed. For the approximation to apply, the expected cell frequency has to be at least 5 for all the cells (np i  5). If the expected frequency in a cell is less than 5, combine it with other cells.

Chi-squared Test of a Contingency Table This test is used to test whether… –two nominal variables are related? –there are differences between two or more populations of a nominal variable To accomplish the test objectives, we need to classify the data according to two different criteria.

Contingency table  2 test – Example Example –In an effort to better predict the demand for courses offered by a certain MBA program, it was hypothesized that students’ academic background affect their choice of MBA major, thus, their courses selection. –A random sample of last year’s MBA students was selected. The following contingency table summarizes relevant data.

Contingency table  2 test – Example There are two ways to address the problem If each undergraduate degree is considered a population, do these populations differ? If each classification is considered a nominal variable, are these two variables dependent? The observed values

Solution –The hypotheses are: H 0 : The two variables are independent H 1 : The two variables are dependent k is the number of cells in the contingency table. –The test statistic – The rejection region Contingency table  2 test – Example Since e i = np i but p i is unknown, we need to estimate the unknown probability from the data, assuming H 0 is true.

Under the null hypothesis the two variables are independent: P(Accounting and BA) = P(Accounting)*P(BA) UndergraduateMBA Major DegreeAccountingFinanceMarketingProbability BA 6060/152 BENG 3131/152 BBA 3939/152 Other 2222/ Probability61/15244/15247/152 The number of students expected to fall in the cell “Accounting - BA” is e Acct-BA = n(p Acct-BA ) = 152(61/152)(60/152) = [61*60]/152 = = [61/152][60/152] The number of students expected to fall in the cell “Finance - BBA” is e Finance-BBA = np Finance-BBA = 152(44/152)(39/152) = [44*39]/152 = Estimating the expected frequencies

The expected frequencies for a contingency table e ij = (Column j total)(Row i total) Sample size The expected frequency of cell of raw i and column j in the contingency table is calculated by

UndergraduateMBA Major DegreeAccountingFinanceMarketing BA 31 (24.08)13 (17.37)16 (18.55)60 BENG 8 (12.44)16 (8.97) 7 (9.58)31 BBA 12 (15.65)10 (11.29)17 (12.06)39 Other 10 (8.83) 5 (6.39) 7 (6.80) The expected frequency ( ) … ( ) … ( ) 2=2= = Calculation of the  2 statistic Solution – continued

Contingency table  2 test – Example Conclusion: Since  2 = > , there is sufficient evidence to infer at 5% significance level that students’ undergraduate degree and MBA students courses selection are dependent. Solution – continued – The critical value in our example is:

Code : Undergraduate degree 1 = BA 2 = BENG 3 = BBA 4 = OTHERS MBA Major 1 = ACCOUNTING 2 = FINANCE 3 = MARKETING Select the Chi squared / raw data Option from Data Analysis Plus under tools. See Xm16-02Xm16-02 Define a code to specify each nominal value. Input the data in columns one column for each category. Using the computer

Required condition Rule of five –The  2 distribution provides an adequate approximation to the sampling distribution under the condition that e ij >= 5 for all the cells. –When e ij < 5 rows or columns must be added such that the condition is met. 4 (5.1) 7 (6.3) 4 (3.6) 18 (17.9) 23 (22.3) 12 (12.8) Example We combine column 2 and 3