Categorical Data Analysis

Slides:



Advertisements
Similar presentations
1 2 Test for Independence 2 Test for Independence.
Advertisements

© 2011 Pearson Education, Inc
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chi-Square Tests Chapter 12.
Chapter 18: The Chi-Square Statistic
Copyright ©2006 Brooks/Cole A division of Thomson Learning, Inc. Introduction to Probability and Statistics Twelfth Edition Robert J. Beaver Barbara M.
CHI-SQUARE(X2) DISTRIBUTION
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
Inference about the Difference Between the
Discrete (Categorical) Data Analysis
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Chapter 16 Chi Squared Tests.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Chapter 16 – Categorical Data Analysis Math 22 Introductory Statistics.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
Chapter 13: Categorical Data Analysis Statistics.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.
© 2000 Prentice-Hall, Inc. Statistics The Chi-Square Test & The Analysis of Contingency Tables Chapter 13.
Copyright © 2010 Pearson Education, Inc. Slide
Introduction to Probability and Statistics Thirteenth Edition Chapter 13 Analysis of Categorical Data.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Statistics 300: Elementary Statistics Section 11-2.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chi-Två Test Kapitel 6. Introduction Two statistical techniques are presented, to analyze nominal data. –A goodness-of-fit test for the multinomial experiment.
Goodness-of-Fit and Contingency Tables Chapter 11.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
©2006 Thomson/South-Western 1 Chapter 12 – Analysis of Categorical Data Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western Concise.
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
Chapter 9: Non-parametric Tests
Chi-square test or c2 test
CHAPTER 11 CHI-SQUARE TESTS
Chapter Fifteen McGraw-Hill/Irwin
Active Learning Lecture Slides
Data Analysis for Two-Way Tables
Chapter 11 Goodness-of-Fit and Contingency Tables
Inferences on Two Samples Summary
Chi Square Two-way Tables
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Econ 3790: Business and Economics Statistics
Chapter 11: Inference for Distributions of Categorical Data
Hypothesis Tests for Proportions
Chapter 10 Analyzing the Association Between Categorical Variables
Chapter 13: Categorical Data Analysis
Overview and Chi-Square
Chapter 13 – Applications of the Chi-Square Statistic
Inference for Tables Chi-Square Tests Ch. 11.
Inference on Categorical Data
Analyzing the Association Between Categorical Variables
CHAPTER 11 CHI-SQUARE TESTS
Inference for Tables Chi-Square Tests Ch. 11.
Inference for Two Way Tables
Inference for Two-way Tables
Chapter Outline Goodness of Fit test Test of Independence.
Presentation transcript:

Categorical Data Analysis Chapter 13 Categorical Data Analysis Slides for Optional Sections No Optional Sections

Categorical Data and the Multinomial Distribution Properties of the Multinomial Experiment Experiment has n identical trials There are k possible outcomes to each trial, called classes, categories or cells Probabilities of the k outcomes remain constant from trial to trial Trials are independent Variables of interest are the cell counts, n1, n2…nk, the number of observations that fall into each of the k classes

Testing Category Probabilities: One-Way Table In a multinomial experiment with categorical data from a single qualitative variable, we summarize data in a one-way table.

Testing Category Probabilities: One-Way Table Hypothesis Testing for a One-Way Table Based on the 2 statistic, which allows comparison between the observed distribution of counts and an expected distribution of counts across the k classes Expected distribution = E(nk)=npk, where n is the total number of trials, and pk is the hypothesized probability of being in class k according to H0 The test statistic, 2, is calculated as and the rejection region is determined by the 2 distribution using k-1 df and the desired 

Testing Category Probabilities: One-Way Table Hypothesis Testing for a One-Way Table The null hypothesis is often formulated as a no difference, where H0: p1=p2=p3=…=pk=1/k, but can be formulated with non-equivalent probabilities Alternate hypothesis states that Ha: at least one of the multinomial probabilities does not equal its hypothesized value

Testing Category Probabilities: One-Way Table Hypothesis Testing for a One-Way Table The null hypothesis is often formulated as a no difference, where H0: p1=p2=p3=…=pk=1/k, but can be formulated with non-equivalent probabilities Alternate hypothesis states that Ha: at least one of the multinomial probabilities does not equal its hypothesized value

Testing Category Probabilities: One-Way Table One-Way Tables: an example H0: pLegal=.07, pdecrim=.18, pexistlaw=.65, pnone=.10 Ha: At least 2 proportions differ from proposed plan Rejection region with =.01, df = k-1 = 3 is 11.3449 Since the test statistic falls in the rejection region, we reject H0

Testing Category Probabilities: One-Way Table Conditions Required for a valid 2 Test Multinomial experiment has been conducted Sample size is large, with E(ni) at least 5 for every cell

Testing Category Probabilities: Two-Way (Contingency) Table Used when classifying with two qualitative variables H0: The two classifications are independent Ha: The two classifications are dependent Test Statistic: Rejection region:2>2, where 2 has (r-1)(c-1) df

Testing Category Probabilities: Two-Way (Contingency) Table Conditions Required for a valid 2 Test N observed counts are a random sample from the population of interest Sample size is large, with E(ni) at least 5 for every cell

Testing Category Probabilities: Two-Way (Contingency) Table Sample Statistical package output

A Word of Caution about Chi-Square Tests When an expected cell count is less than 5, 2 probability distribution should not be used If H0 is not rejected, do not accept H0 that the classifications are independent, due to the implications of a Type II error. Do not infer causality when H0 is rejected. Contingency table analysis determines statistical dependence only.