Overview and Chi-Square

Slides:



Advertisements
Similar presentations
Categorical Data Analysis
Advertisements

Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Hypothesis Testing IV Chi Square.
11-2 Goodness-of-Fit In this section, we consider sample data consisting of observed frequency counts arranged in a single row or column (called a one-way.
Chi-Square Tests and the F-Distribution
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Chapter 11 Goodness of Fit Test (section 11.2)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. 1.. Section 11-2 Goodness of Fit.
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
Chapter 16 The Chi-Square Statistic
1 Pertemuan 11 Uji kebaikan Suai dan Uji Independen Mata kuliah : A Statistik Ekonomi Tahun: 2010.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
AP Statistics Section 14.. The main objective of Chapter 14 is to test claims about qualitative data consisting of frequency counts for different categories.
Copyright © 2010 Pearson Education, Inc. Slide
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics 300: Elementary Statistics Section 11-2.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Goodness-of-Fit and Contingency Tables Chapter 11.
Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a goodness-of-fit.
Comparing Counts Chi Square Tests Independence.
Basic Statistics The Chi Square Test of Independence.
Other Chi-Square Tests
Keller: Stats for Mgmt & Econ, 7th Ed Chi-Squared Tests
Chapter 12 Chi-Square Tests and Nonparametric Tests
CHAPTER 26 Comparing Counts.
Chi-square test or c2 test
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Chapter 18 Chi-Square Tests
CHAPTER 11 CHI-SQUARE TESTS
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 12 Tests with Qualitative Data
Chapter 25 Comparing Counts.
Section 10-1 – Goodness of Fit
1) A bicycle safety organization claims that fatal bicycle accidents are uniformly distributed throughout the week. The table shows the day of the week.
Chapter 11 Goodness-of-Fit and Contingency Tables
Elementary Statistics: Picturing The World
Elementary Statistics
Lecture Slides Elementary Statistics Tenth Edition
Chi Square Two-way Tables
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Chapter 13: Categorical Data Analysis
Contingency Tables: Independence and Homogeneity
Chapter 13 – Applications of the Chi-Square Statistic
Inference on Categorical Data
Analyzing the Association Between Categorical Variables
CHAPTER 11 CHI-SQUARE TESTS
Hypothesis Tests for a Standard Deviation
Chapter 26 Comparing Counts.
Chi-squared tests Goodness of fit: Does the actual frequency distribution of some data agree with an assumption? Test of Independence: Are two characteristics.
Section 11-1 Review and Preview
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Inference for Two Way Tables
Chapter 26 Comparing Counts.
Chapter Outline Goodness of Fit test Test of Independence.
Presentation transcript:

Overview and Chi-Square Section 11-1 & 11-2 Overview and Chi-Square Goodness of Fit Test

Analyzing Categorical Data We focus on analysis of categorical (qualitative or attribute) data that can be separated into different categories (often called cells). Use the 2 (chi-square) test statistic (Table A- 4). The goodness-of-fit test uses a one-way frequency table (single row or column). page 590 of Elementary Statistics, 10th Edition

Purpose of the Goodness of Fit Test Given data separated into different categories, we will test the hypothesis that the distribution of the data agrees with or “fits” some claimed distribution. The hypothesis test will use the chi-square distribution with the observed frequency counts and the frequency counts that we would expect with the claimed distribution. page 591 of Elementary Statistics, 10th Edition

Multinomial Experiment This is an experiment that meets the following conditions: 1. The number of trials is fixed. 2. The trials are independent. 3. All outcomes of each trial must be classified into exactly one of several different categories. 4. The probabilities for the different categories remain constant for each trial. Note that these conditions are similar to those of a binomial experiment. A binomial experiment has only two categories, whereas a multinomial experiment has more than two categories.

Goodness-of-fit Test A goodness-of-fit test is used to test the hypothesis that an observed frequency distribution fits (or conforms to) some claimed distribution. page 592 of Elementary Statistics, 10th Edition

Goodness-of-Fit Test Notation O represents the observed frequency of an outcome. E represents the expected frequency of an outcome. k represents the number of different categories or outcomes. n represents the total number of trials. page 592 of Elementary Statistics, 10th Edition

Expected Frequencies n k E = If all expected frequencies are equal: the sum of all observed frequencies divided by the number of categories n E = k page 593 of Elementary Statistics, 10th Edition

If expected frequencies are not all equal: Each expected frequency is found by multiplying the sum of all observed frequencies by the probability for the category. E = n p page 593 of Elementary Statistics, 10th Edition

Goodness-of-Fit Test in Multinomial Experiments: Requirements/Assumptions The data have been randomly selected. The sample data consist of frequency counts for each of the different categories. For each category, the expected frequency is at least 5. (The expected frequency for a category is the frequency that would occur if the data actually have the distribution that is being claimed. There is no requirement that the observed frequency for each category must be at least 5.) page 593 of Elementary Statistics, 10th Edition

Goodness-of-Fit Test in Multinomial Experiments Test Statistic 2 =  (O – E)2 E Critical Values 1. Found in Table A- 4 using k – 1 degrees of freedom, where k = number of categories. 2. Goodness-of-fit hypothesis tests are always right-tailed.

Goodness-of-Fit Test in Multinomial Experiments A close agreement between observed and expected values will lead to a small value of 2 and a large P-value. A large disagreement between observed and expected values will lead to a large value of 2 and a small P-value. A significantly large value of 2 will cause a rejection of the null hypothesis of no difference between the observed and the expected.

Relationships Among the 2 Test Statistic, P-Value, and Goodness-of-Fit Figure 11-3 page 594 of Elementary Statistics, 10th Edition

Example: Last Digits of Weights When asked, people often provide weights that are somewhat lower than their actual weights. So how can researchers verify that weights were obtained through actual measurements instead of asking subjects? page 591 of Elementary Statistics, 10th Edition The analysis of the last digits of data is often used to detect data that have been reported instead of measured.

Example: Last Digits of Weights Test the claim that the digits in Table 11-2 do not occur with the same frequency. Table 11-2 summarizes the last digit of weights of 80 randomly selected students. page 592 of Elementary Statistics, 10th Edition

Example: Last Digits of Weights Verify that the assumptions for the Hypothesis test have been met. 1. Assume a random sample of student weights. 2. Expected counts are 8 for every category which is greater than 5 therefore, the sample is large enough. page 591 of Elementary Statistics, 10th Edition

Example: Last Digit Analysis Test the claim that the digits in Table 11-2 do not occur with the same frequency. H0: p0 = p1 =  = p9 H1: At least one of the probabilities is different from the others.  = 0.05 k – 1 = 9 2 critical value= 16.919 page 595 of Elementary Statistics, 10th Edition

Example: Last Digit Analysis Test the claim that the digits in Table 11-2 do not occur with the same frequency. Because the 80 digits would be uniformly distributed through the 10 categories, each expected frequency should be 8.

Example: Last Digit Analysis Test the claim that the digits in Table 11-2 do not occur with the same frequency. page 596 of Elementary Statistics, 10th Edition

Example: Last Digit Analysis Test the claim that the digits in Table 11-2 do not occur with the same frequency. From Table 11-3, the test statistic is 2 = 156.500. Since the critical value is 16.919, we reject the null hypothesis of equal probabilities. There is sufficient evidence to support the claim that the last digits do not occur with the same relative frequency.

Example: Detecting Fraud Unequal Expected Frequencies In the Chapter Problem, it was noted that statistics can be used to detect fraud. Table 11-1 lists the percentages for leading digits from Benford’s Law. page 597 of Elementary Statistics, 10th Edition

Example: Detecting Fraud Unequal Expected Frequencies In the Chapter Problem, it was noted that statistics can be used to detect fraud. Table 11-1 lists the percentages for leading digits from Benford’s Law. page 597 of Elementary Statistics, 10th Edition

Example: Detecting Fraud Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks. Observed Frequencies and Frequencies Expected with Benford’s Law

Example: Detecting Fraud Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks. H0: p1 = 0.301, p2 = 0.176, p3 = 0.125, p4 = 0.097, p5 = 0.079, p6 = 0.067, p7 = 0.058, p8 = 0.051 and p9 = 0.046 H1: At least one of the proportions is different from the claimed values.  = 0.01 k – 1 = 8 2.01,8 = 20.090 page 597 of Elementary Statistics, 10th Edition

Example: Detecting Fraud Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks. The test statistic is 2 = 3650.251. Since the critical value is 20.090, we reject the null hypothesis. There is sufficient evidence to reject the null hypothesis. At least one of the proportions is different than expected.

Example: Detecting Fraud Test the claim that there is a significant discrepancy between the leading digits expected from Benford’s Law and the leading digits from the 784 checks. Figure 11-5 page 596 of Elementary Statistics, 10th Edition

Example: Detecting Fraud page 599 of Elementary Statistics, 10th Edition Figure 11-6 Comparison of Observed Frequencies and Frequencies Expected with Benford’s Law