Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 10 The Analysis of Variance.
Advertisements

Tests of Hypotheses Based on a Single Sample
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Goodness-of-Fit Tests.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Tests for Homogeneity.
Inference about the Difference Between the
Copyright © Cengage Learning. All rights reserved. 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Chapter 10 Chi-Square Tests and the F- Distribution 1 Larson/Farber 4th ed.
Discrete (Categorical) Data Analysis
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
Chapter 11 Chi-Square Procedures 11.1 Chi-Square Goodness of Fit.
Chapter 16 Chi Squared Tests.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Copyright © Cengage Learning. All rights reserved. 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Simple Linear Regression and Correlation
Chi-Square Tests and the F-Distribution
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Section 10.1 Goodness of Fit. Section 10.1 Objectives Use the chi-square distribution to test whether a frequency distribution fits a claimed distribution.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Chapter 26 Chi-Square Testing
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Chapter 10 Chi-Square Tests and the F-Distribution
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
GOODNESS OF FIT Larson/Farber 4th ed 1 Section 10.1.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Copyright © 2010 Pearson Education, Inc. Slide
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics 300: Elementary Statistics Section 11-2.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter 11 Chi-Square Procedures 11.1 Chi-Square Goodness of Fit.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Copyright © Cengage Learning. All rights reserved. 14 Goodness-of-Fit Tests and Categorical Data Analysis.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Section 10.1 Goodness of Fit © 2012 Pearson Education, Inc. All rights reserved. 1 of 91.
Chapter 12 Tests with Qualitative Data
Elementary Statistics: Picturing The World
Goodness-of-Fit Tests
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Presentation transcript:

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc Goodness-of-fit Tests When Category Probabilities Are Completely Specified

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Terminology A binomial experiment consists of a sequence of independent trials in which each trial can result in one of two possible outcomes. A multinomial experiment generalizes a binomial experiment by allowing each trial to result in one of k outcomes, where k is an integer greater than 2.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Multinomial Experiment The expected number of trials resulting in category i is E(N i ) = np i. When H 0 :p 1 = p 10,…,p k = p k0 is true, these expected values become E(N 1 ) = np 10,…, E(N k ) = np k0.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Recall: Chi-squared Critical Value Let, called a chi-squared critical value, denote the number of the measurement axis such that of the area under the chi-squared curve with v df lies to the right of

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Notation Illustrated

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Multinomial Experiment Provided that np i 5 for every i, the random variable has approximately a chi-squared distribution with k – 1 df.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Test With Significance Level Null hypothesis H 0 : p 1 = p 10,…,p k = p k0 Alternative hypoth H a : at least one Test statistic value: Rejection region:

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. P-Values for Chi-Squared Tests The P-value for an upper-tailed chi- squared test is the area under the curve to the right of the calculated

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. When the p i ’s Are Functions of Other Parameters Frequently the p i ’s are hypothesized to depend on a smaller number of parameters (m < k). Then a specific hypothesis involving the yields specific p i0 ’s, which are then used in the test.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. When the Underlying Distribution is Continuous Let X denote the variable being sampled. The hypothesized pdf is f 0 (x). Subdivide the measurement scale of X into k intervals [a 0, a 1 ),…,. The cell probabilities specified by H 0 are

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc Goodness of Fit for Composite Hypotheses

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. When Parameters Are Estimated The null hypothesis states that each p i is a function of a small number of parameters with the otherwise unspecified. H a : the hypothesis is not true

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. For general k, the joint distribution of N 1,…,N k is the multinomial distribution with Joint Distribution when H 0 is true this becomes

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Method of Estimation Let n 1,…,n k denote the observed values of N 1,…,N k. Then are those values of the that maximize are the maximum likelihood estimators of

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Theorem Under general “regularity” conditions on and the ’s if are estimated by the method of maximum likelihood as described previously and n is large, has approximately a chi-squared distribution with k – 1 – m df when H 0 is true.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Level Test An approximate level test of H 0 versus H a is then to reject H 0 if. In practice, the test can be used if for every i.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Degrees of Freedom A general rule of thumb for degrees of freedom in a chi-squared test is

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Test Procedure

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Let be the maximum likelihood estimators of based on the full sample X 1,…,X n and let denote the statistic based on these estimators. Then the critical value that specifies a level upper-tailed test satisfies Goodness of Fit for Discrete Distributions

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Goodness of Fit for Continuous Distributions The chi-squared test can be used to test whether the sample comes from a specified family of continuous distributions. Once the cells are chosen (independent of the observations) it is usually difficult to estimate unspecified parameters from the observed cell counts, so mle’s based on the full sample are computed.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Special Test for Normality H 0 : the population distribution is normal H a : the pop. distribution is not normal versus consists of rejecting H 0 when and Compute r for the pairs (x (1),y 1 ),…,(x (n),y n ). The Ryan-Joiner test of

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc Two-Way Contingency Tables

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Data With Counts or Frequencies 1. There are I populations of interest, each corresponding to a different row of the table, and each population is divided into the same J categories. A sample is taken from the ith population, and the counts are entered in the cells in the ith row of the table.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Data With Counts or Frequencies 2. There is a single population of interest, with each individual in the population categorized with respect to two different factors. There are I categories associated with the first factor and J categories associated with the second factor.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. n 11 n 12 n1jn1j n1Jn1J n 21 ni1ni1 n ij nI1nI1 n IJ Two-Way Contingency Table

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Estimated Expected Counts Under H 0 (Homogeneity)

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Null hypothesis: H 0 : p 1j = p 2j =…= p Ij Alternative hypoth.: H a : H 0 is not true Test statistic value: Rejection region: Test for Homogeneity Apply as long as

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Multinomial Experiment Provided that np i 5 for every i, the random variable has approximately a chi-squared distribution with k – 1 df.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Estimated Expected Counts (Independence)

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Null hypothesis: H 0 : Alternative hypoth.: H a : H 0 is not true Test statistic value: Rejection region: Test for Independence Apply as long as