Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.

Slides:



Advertisements
Similar presentations
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Advertisements

CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Chapter 11 Inference for Distributions of Categorical Data
Chapter 11 Inference for Distributions of Categorical Data
Chapter 13: Inference for Distributions of Categorical Data
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Chapter 26: Comparing Counts
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Chapter 13: Inference for Tables – Chi-Square Procedures
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
1 Desipramine is an antidepressant affecting the brain chemicals that may become unbalanced and cause depression. It was tested for recovery from cocaine.
Chapter 11: Inference for Distributions of Categorical Data.
Chapter 26 Chi-Square Testing
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
CHAPTER 11 SECTION 2 Inference for Relationships.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Chapter 11 The Chi-Square Test of Association/Independence Target Goal: I can perform a chi-square test for association/independence to determine whether.
+ Chi Square Test Homogeneity or Independence( Association)
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Copyright © 2010 Pearson Education, Inc. Slide
Inference for Distributions of Categorical Variables (C26 BVD)
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
+ Chapter 11 Inference for Distributions of Categorical Data 11.1Chi-Square Goodness-of-Fit Tests 11.2Inference for Relationships.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
 Check the Random, Large Sample Size and Independent conditions before performing a chi-square test  Use a chi-square test for homogeneity to determine.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Check your understanding: p. 684
Chi-square test or c2 test
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 25 Comparing Counts.
Data Analysis for Two-Way Tables
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chi Square Two-way Tables
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Inference for Relationships
CHAPTER 11 Inference for Distributions of Categorical Data
Lesson 11 - R Chapter 11 Review:
Analyzing the Association Between Categorical Variables
Chapter 26 Comparing Counts.
Chapter 13: Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Inference for Two Way Tables
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Inference for Two-way Tables
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Presentation transcript:

Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n A Chi-square test for homogeneity allows us to examine how the distribution of two categorical variables differ for two or more populations.

Different Chi-square Tests n n Goodness of fit: one variable in one population. n n Homogeneity: one variable in two or more populations (groups) n n Association/independence: 2 variables in one population.

Chi-square Goodness of Fit n In the test, we compare observed counts from the expected counts that would be expected if H o is true. n The more the difference between observed and expected counts, the greater the evidence to reject the null hypothesis. n Chi-square statistic is a measure of how far the observed counts are from the expected counts.

Chi-square Goodness of Fit n

Chi-square Distributions  The chi-square distributions are a family of distributions that take only positive values and are skewed to the right.  A particular chi-square distribution is specified by giving its degree of freedom.

Chi-square Distributions  Chi-square goodness of fit test uses the chi- square distribution with degrees of freedom equal to the number of categories.  Just like t-distributions, we use a table to compute the P-value.  The mean of a particular chi-square distribution = its degrees of freedom.

Chi-square Distributions  For df > 2, the mode (peak) of the chi- square density curve is at df - 2. i.e., when df = 8, the chi-square distribution has a mean of 8 and a mode of 6.

Chi-square Goodness of Fit-Test  All expected counts must be at least 5.  This large sample size condition takes the place of the Normal condition for z and t procedures.  Must also check that Random and Independent conditions are met.

Chi-square Goodness of Fit-Test  Chi-square test statistic compares observed & expected counts. Do not try and perform calculations with observed & expected proportions.  When checking the Large Sample Size – examine the expected counts not observed.  In this test, we test H o that a categorical variable has a specified distribution.

Chi-square Goodness of Fit-Test  If the test finds a statistically significant result, a follow up analysis that compares the observed and expected counts to look for the largest components of the chi-square statistic.

Chi-square Goodness of Fit-Test n

n

Inference for Relationships n Two way tables can describe relationships between 2 categorical variables. n We can test to determine whether the distribution of a categorical variable is the same for each of several populations or treatments. n Then we can test to see if there is an association between the row & column variables in the two way table.

Inference for Relationships n Multiple comparisons: usually have two parts: –An overall test to see if there is good evidence of any differences among the parameters that we want to compare. –A detailed follow-up analysis to decide which of the parameters differ and to estimate how large the differences are.

Inference for Relationships Inference for Relationships n

Chi-square Test for Homogeneity n This test is also known as a chi-square test for homogeneity of proportions. n This test is when you are comparing two or more populations or treatments. n Ho: The true category proportions are the same for all the populations or treatments (homogeneity of populations or treatments).

Chi-square Test for Homogeneity n

Chi-square: Association/Independence n

 Ho: there is no association between 2 categorical variables in the population of interest.  Ha: there is an association between 2 categorical variables in the population of interest

Chi-square: Association/Independence  Ho: 2 categorical variables are independent in the population of interest.  Ha: 2 categorical variables are not independent in the population of interest.’  Conditions: Random, Large Sample size (at least 5), and Independent: when sampling w/o replacement, check that the population is at least 10 times as large as the sample.

Cautions & Limitations o Don’t confuse homogeneity with tests for independence. Tests for homogeneity are used when individuals in each of two or more independent samples are used when individuals in a single sample are classified according to two categorical variables.

Cautions & Limitations o Tests for independence are used when individuals in a single sample are classified according to 2 variables. o As in the hypothesis tests, we can never say that we have strong support for the null hypothesis. If we do not reject Ho for independence, we can’t conclude that the variables are independent.

Cautions & Limitations o Make sure that your assumptions are reasonable. P-values for chi-square tests are only approximate. o In the chi-square test for homogeneity, the assumption of independent samples is particularly important.

Cautions & Limitations o Don’t jump to conclusions about causation. Just as a strong correlation between two numerical values does not mean that there is a cause and effect relationship between them, an association between two categorical variables doesn’t imply a causal relationship.