1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.

Slides:



Advertisements
Similar presentations
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Advertisements

Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
1 1 Slide © 2003 South-Western /Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Uji Kebaikan Suai (Uji Kecocokan) Pertemuan 23
Inference about the Difference Between the
Statistical Inference for Frequency Data Chapter 16.
1 1 Slide Mátgæði Kafli 11 í Newbold Snjólfur Ólafsson + Slides Prepared by John Loucks © 1999 ITP/South-Western College Publishing.
© 2002 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 10 Tests for Two or More Samples with Categorical.
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
Chapter Goals After completing this chapter, you should be able to:
1 Pertemuan 09 Pengujian Hipotesis Proporsi dan Data Katagorik Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Chapter 16 Chi Squared Tests.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
1 Pertemuan 09 Pengujian Hipotesis 2 Matakuliah: I0272 – Statistik Probabilitas Tahun: 2005 Versi: Revisi.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
BCOR 1020 Business Statistics
Chapter 11a: Comparisons Involving Proportions and a Test of Independence Inference about the Difference between the Proportions of Two Populations Hypothesis.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
Chi-Square Tests and the F-Distribution
Goodness of Fit Test for Proportions of Multinomial Population Chi-square distribution Hypotheses test/Goodness of fit test.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
QMS 6351 Statistics and Research Methods Regression Analysis: Testing for Significance Chapter 14 ( ) Chapter 15 (15.5) Prof. Vera Adamchik.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chi-squared Tests. We want to test the “goodness of fit” of a particular theoretical distribution to an observed distribution. The procedure is: 1. Set.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Introduction to Probability and Statistics Thirteenth Edition Chapter 13 Analysis of Categorical Data.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1/71 Statistics Tests of Goodness of Fit and Independence.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 23 日 第六週:配適度與獨立性檢定.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 Pertemuan 24 Uji Kebaikan Suai Matakuliah: I0134 – Metoda Statistika Tahun: 2005 Versi: Revisi.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
1 1 Slide © 2011 Cengage Learning Assumptions About the Error Term  1. The error  is a random variable with mean of zero. 2. The variance of , denoted.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chi-Två Test Kapitel 6. Introduction Two statistical techniques are presented, to analyze nominal data. –A goodness-of-fit test for the multinomial experiment.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
St. Edward’s University
John Loucks St. Edward’s University . SLIDES . BY.
Statistics for Business and Economics (13e)
Chapter 11 Inferences About Population Variances
Econ 3790: Business and Economics Statistics
CHI SQUARE TEST OF INDEPENDENCE
Analyzing the Association Between Categorical Variables
Hypothesis Tests for a Standard Deviation
Chapter Outline Goodness of Fit test Test of Independence.
St. Edward’s University
Presentation transcript:

1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach

2 2 Slide IS 310 – Business Statistics Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of Fit Test: Poisson Goodness of Fit Test: Poisson and Normal Distributions and Normal Distributions Test of Independence Test of Independence

3 3 Slide IS 310 – Business Statistics Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population 1. Set up the null and alternative hypotheses. 2. Select a random sample and record the observed frequency, f i, for each of the k categories. frequency, f i, for each of the k categories. 3. Assuming H 0 is true, compute the expected frequency, e i, in each category by multiplying the frequency, e i, in each category by multiplying the category probability by the sample size. category probability by the sample size.

4 4 Slide IS 310 – Business Statistics Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population 4. Compute the value of the test statistic. Note: The test statistic has a chi-square distribution with k – 1 df provided that the expected frequencies are 5 or more for all categories. Note: The test statistic has a chi-square distribution with k – 1 df provided that the expected frequencies are 5 or more for all categories. f i = observed frequency for category i e i = expected frequency for category i k = number of categories where:

5 5 Slide IS 310 – Business Statistics Hypothesis (Goodness of Fit) Test for Proportions of a Multinomial Population where  is the significance level and there are k - 1 degrees of freedom p -value approach: Critical value approach: Reject H 0 if p -value <  5. Rejection rule: Reject H 0 if

6 6 Slide IS 310 – Business Statistics Multinomial Distribution Goodness of Fit Test n Example: Finger Lakes Homes (A) Finger Lakes Homes manufactures Finger Lakes Homes manufactures four models of prefabricated homes, four models of prefabricated homes, a two-story colonial, a log cabin, a a two-story colonial, a log cabin, a split-level, and an A-frame. To help split-level, and an A-frame. To help in production planning, management in production planning, management would like to determine if previous would like to determine if previous customer purchases indicate that there customer purchases indicate that there is a preference in the style selected. is a preference in the style selected.

7 7 Slide IS 310 – Business Statistics Split- A- Split- A- Model Colonial Log Level Frame # Sold The number of homes sold of each The number of homes sold of each model for 100 sales over the past two years is shown below. Multinomial Distribution Goodness of Fit Test n Example: Finger Lakes Homes (A)

8 8 Slide IS 310 – Business Statistics n Hypotheses Multinomial Distribution Goodness of Fit Test where: p C = population proportion that purchase a colonial p C = population proportion that purchase a colonial p L = population proportion that purchase a log cabin p L = population proportion that purchase a log cabin p S = population proportion that purchase a split-level p S = population proportion that purchase a split-level p A = population proportion that purchase an A-frame p A = population proportion that purchase an A-frame H 0 : p C = p L = p S = p A =.25 H a : The population proportions are not p C =.25, p L =.25, p S =.25, and p A =.25 p C =.25, p L =.25, p S =.25, and p A =.25

9 9 Slide IS 310 – Business Statistics n Rejection Rule 22 2 Do Not Reject H 0 Reject H 0 Multinomial Distribution Goodness of Fit Test With  =.05 and k - 1 = = 3 k - 1 = = 3 degrees of freedom degrees of freedom if p -value Reject H 0 if p -value

10 Slide IS 310 – Business Statistics n Expected Frequencies n Test Statistic Multinomial Distribution Goodness of Fit Test e 1 =.25(100) = 25 e 2 =.25(100) = 25 e 3 =.25(100) = 25 e 4 =.25(100) = 25 e 3 =.25(100) = 25 e 4 =.25(100) = 25 = = 10

11 Slide IS 310 – Business Statistics Multinomial Distribution Goodness of Fit Test n Conclusion Using the p -Value Approach The p -value < . We can reject the null hypothesis. The p -value < . We can reject the null hypothesis. Because  2 = 10 is between and , the Because  2 = 10 is between and , the area in the upper tail of the distribution is between area in the upper tail of the distribution is between.025 and and.01. Area in Upper Tail  2 Value (df = 3) Note: A precise p -value can be found using Note: A precise p -value can be found using Minitab or Excel. Minitab or Excel. Note: A precise p -value can be found using Note: A precise p -value can be found using Minitab or Excel. Minitab or Excel.

12 Slide IS 310 – Business Statistics n Conclusion Using the Critical Value Approach Multinomial Distribution Goodness of Fit Test We reject, at the.05 level of significance, We reject, at the.05 level of significance, the assumption that there is no home style preference.  2 = 10 > 7.815

13 Slide IS 310 – Business Statistics Test of Independence: Contingency Tables 1. Set up the null and alternative hypotheses. 2. Select a random sample and record the observed frequency, f ij, for each cell of the contingency table. frequency, f ij, for each cell of the contingency table. 3. Compute the expected frequency, e ij, for each cell.

14 Slide IS 310 – Business Statistics Test of Independence: Contingency Tables 5. Determine the rejection rule. Reject H 0 if p -value <  or. 4. Compute the test statistic. where  is the significance level and, with n rows and m columns, there are ( n - 1)( m - 1) degrees of freedom.

15 Slide IS 310 – Business Statistics Each home sold by Finger Lakes Each home sold by Finger Lakes Homes can be classified according to price and to style. Finger Lakes’ manager would like to determine if the price of the home and the style of the home are independent variables. Contingency Table (Independence) Test n Example: Finger Lakes Homes (B)

16 Slide IS 310 – Business Statistics Price Colonial Log Split-Level A-Frame Price Colonial Log Split-Level A-Frame The number of homes sold for The number of homes sold for each model and price for the past two years is shown below. For convenience, the price of the home is listed as either $99,000 or less or more than $99,000. > $99, < $99, Contingency Table (Independence) Test n Example: Finger Lakes Homes (B)

17 Slide IS 310 – Business Statistics n Hypotheses Contingency Table (Independence) Test H 0 : Price of the home is independent of the style of the home that is purchased style of the home that is purchased H a : Price of the home is not independent of the style of the home that is purchased style of the home that is purchased

18 Slide IS 310 – Business Statistics n Expected Frequencies Contingency Table (Independence) Test Price Colonial Log Split-Level A-Frame Total Price Colonial Log Split-Level A-Frame Total < $99K > $99K Total Total

19 Slide IS 310 – Business Statistics n Rejection Rule Contingency Table (Independence) Test With  =.05 and (2 - 1)(4 - 1) = 3 d.f., Reject H 0 if p -value = = n Test Statistic

20 Slide IS 310 – Business Statistics n Conclusion Using the p -Value Approach The p -value < . We can reject the null hypothesis. The p -value < . We can reject the null hypothesis. Because  2 = is between and 9.348, the Because  2 = is between and 9.348, the area in the upper tail of the distribution is between area in the upper tail of the distribution is between.05 and and.025. Area in Upper Tail  2 Value (df = 3) Contingency Table (Independence) Test Note: A precise p -value can be found using Note: A precise p -value can be found using Minitab or Excel. Minitab or Excel. Note: A precise p -value can be found using Note: A precise p -value can be found using Minitab or Excel. Minitab or Excel.

21 Slide IS 310 – Business Statistics n Conclusion Using the Critical Value Approach Contingency Table (Independence) Test We reject, at the.05 level of significance, We reject, at the.05 level of significance, the assumption that the price of the home is independent of the style of home that is purchased.  2 = > 7.815

22 Slide IS 310 – Business Statistics Goodness of Fit Test: Poisson Distribution 1. Set up the null and alternative hypotheses. H 0 : Population has a Poisson probability distribution H 0 : Population has a Poisson probability distribution H a : Population does not have a Poisson distribution H a : Population does not have a Poisson distribution 3. Compute the expected frequency of occurrences e i for each value of the Poisson random variable. for each value of the Poisson random variable. 2. Select a random sample and a. Record the observed frequency f i for each value of a. Record the observed frequency f i for each value of the Poisson random variable. the Poisson random variable. b. Compute the mean number of occurrences . b. Compute the mean number of occurrences .

23 Slide IS 310 – Business Statistics Goodness of Fit Test: Poisson Distribution 4. Compute the value of the test statistic. f i = observed frequency for category i e i = expected frequency for category i k = number of categories where:

24 Slide IS 310 – Business Statistics where  is the significance level and there are k - 2 degrees of freedom there are k - 2 degrees of freedom p -value approach: Critical value approach: Reject H 0 if p -value <  5. Rejection rule: Reject H 0 if Goodness of Fit Test: Poisson Distribution

25 Slide IS 310 – Business Statistics n Example: Troy Parking Garage In studying the need for an In studying the need for an additional entrance to a city parking garage, a consultant has recommended an analysis approach that is applicable only in situations where the number of cars entering during a specified time period follows a Poisson distribution. Goodness of Fit Test: Poisson Distribution

26 Slide IS 310 – Business Statistics A random sample of 100 one- A random sample of 100 one- minute time intervals resulted in the customer arrivals listed below. A statistical test must be conducted to see if the assumption of a Poisson distribution is reasonable. Goodness of Fit Test: Poisson Distribution n Example: Troy Parking Garage # Arrivals Frequency

27 Slide IS 310 – Business Statistics n Hypotheses Goodness of Fit Test: Poisson Distribution H a : Number of cars entering the garage during a one-minute interval is not Poisson distributed one-minute interval is not Poisson distributed H 0 : Number of cars entering the garage during a one-minute interval is Poisson distributed a one-minute interval is Poisson distributed

28 Slide IS 310 – Business Statistics n Estimate of Poisson Probability Function Goodness of Fit Test: Poisson Distribution  otal Arrivals = 0(0) + 1(1) + 2(4) (1) = 600 Hence, Estimate of  = 600/100 = 6 Total Time Periods = 100

29 Slide IS 310 – Business Statistics n Expected Frequencies Goodness of Fit Test: Poisson Distribution x f ( x ) nf ( x ) Total x f ( x ) nf ( x )

30 Slide IS 310 – Business Statistics n Observed and Expected Frequencies Goodness of Fit Test: Poisson Distribution i f i e i f i - e i i f i e i f i - e i or 1 or or more

31 Slide IS 310 – Business Statistics n Test Statistic Goodness of Fit Test: Poisson Distribution With  =.05 and k - p - 1 = = 7 d.f. With  =.05 and k - p - 1 = = 7 d.f. (where k = number of categories and p = number of population parameters estimated), Reject H 0 if p -value n Rejection Rule

32 Slide IS 310 – Business Statistics n Conclusion Using the p -Value Approach The p -value > . We cannot reject the null hypothesis. There is no reason to doubt the assumption of a Poisson distribution. The p -value > . We cannot reject the null hypothesis. There is no reason to doubt the assumption of a Poisson distribution. Because  2 = is between and in the Chi-Square Distribution Table, the area in the upper tail Because  2 = is between and in the Chi-Square Distribution Table, the area in the upper tail of the distribution is between.90 and.10. Area in Upper Tail  2 Value (df = 7) Goodness of Fit Test: Poisson Distribution Note: A precise p -value can be found Note: A precise p -value can be found using Minitab or Excel. using Minitab or Excel. Note: A precise p -value can be found Note: A precise p -value can be found using Minitab or Excel. using Minitab or Excel.

33 Slide IS 310 – Business Statistics Goodness of Fit Test: Normal Distribution 1. Set up the null and alternative hypotheses. 3. Compute the expected frequency, e i, for each interval. 2. Select a random sample and a. Compute the mean and standard deviation. a. Compute the mean and standard deviation. b. Define intervals of values so that the expected b. Define intervals of values so that the expected frequency is at least 5 for each interval. frequency is at least 5 for each interval. c. For each interval record the observed frequencies c. For each interval record the observed frequencies

34 Slide IS 310 – Business Statistics 4. Compute the value of the test statistic. Goodness of Fit Test: Normal Distribution 5. Reject H 0 if (where  is the significance level and there are k - 3 degrees of freedom). and there are k - 3 degrees of freedom).

35 Slide IS 310 – Business Statistics Normal Distribution Goodness of Fit Test n Example: IQ Computers IQIQ IQ Computers (one better than HP?) IQ Computers (one better than HP?) manufactures and sells a general purpose microcomputer. As part of a study to evaluate sales personnel, management wants to determine, at a.05 significance level, if the annual sales volume (number of units sold by a salesperson) follows a normal probability distribution.

36 Slide IS 310 – Business Statistics A simple random sample of 30 of the salespeople was taken and their numbers of units sold are below. Normal Distribution Goodness of Fit Test n Example: IQ Computers (mean = 71, standard deviation = 18.54) IQIQ

37 Slide IS 310 – Business Statistics n Hypotheses Normal Distribution Goodness of Fit Test H a : The population of number of units sold does not have a normal distribution with does not have a normal distribution with mean 71 and standard deviation mean 71 and standard deviation H 0 : The population of number of units sold has a normal distribution with mean 71 has a normal distribution with mean 71 and standard deviation and standard deviation

38 Slide IS 310 – Business Statistics n Interval Definition Normal Distribution Goodness of Fit Test To satisfy the requirement of an expected To satisfy the requirement of an expected frequency of at least 5 in each interval we will divide the normal distribution into 30/5 = 6 equal probability intervals.

39 Slide IS 310 – Business Statistics n Interval Definition Areas = 1.00/6 =.1667 Areas = 1.00/6 = .43(18.54) = = (18.54) Normal Distribution Goodness of Fit Test

40 Slide IS 310 – Business Statistics n Observed and Expected Frequencies Normal Distribution Goodness of Fit Test Less than to to to to to to to to More than i f i e i f i - e i i f i e i f i - e i Total

41 Slide IS 310 – Business Statistics n Test Statistic With  =.05 and k - p - 1 = = 3 d.f. With  =.05 and k - p - 1 = = 3 d.f. (where k = number of categories and p = number of population parameters estimated), Reject H 0 if p -value n Rejection Rule Normal Distribution Goodness of Fit Test

42 Slide IS 310 – Business Statistics Normal Distribution Goodness of Fit Test n Conclusion Using the p -Value Approach The p -value > . We cannot reject the null hypothesis. There is little evidence to support rejecting the assumption the population is normally distributed with  = 71 and  = The p -value > . We cannot reject the null hypothesis. There is little evidence to support rejecting the assumption the population is normally distributed with  = 71 and  = Because  2 = is between.584 and in the Chi-Square Distribution Table, the area in the upper tail Because  2 = is between.584 and in the Chi-Square Distribution Table, the area in the upper tail of the distribution is between.90 and.10. Area in Upper Tail  2 Value (df = 3) A precise p -value can be found A precise p -value can be found using Minitab or Excel. using Minitab or Excel. A precise p -value can be found A precise p -value can be found using Minitab or Excel. using Minitab or Excel.

43 Slide IS 310 – Business Statistics End of Chapter 12