Chi-square test Chi-square test or  2 test Notes: Page 217, and your own notebook paper 1.Goodness of Fit 2.Independence 3.Homogeneity.

Slides:



Advertisements
Similar presentations
Chi-square test or c2 test
Advertisements

Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
 2 test for independence Used with categorical, bivariate data from ONE sample Used to see if the two categorical variables are associated (dependent)
AP Statistics Tuesday, 15 April 2014 OBJECTIVE TSW (1) identify the conditions to use a chi-square test; (2) examine the chi-square test for independence;
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Goodness-of-Fit Tests.
The Analysis of Categorical Data and Goodness of Fit Tests
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
Chapter 11 Inference for Distributions of Categorical Data
AP Statistics Thursday, 24 April 2014 OBJECTIVE TSW review for tomorrow’s Chi-Square Inference test. DUAL CREDIT FINAL: NEXT WEEK –Everyone will take this.
Please turn in your signed syllabus. We will be going to get textbooks shortly after class starts. Homework: Reading Guide – Chapter 2: The Chemical Context.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
CHAPTER 11 Inference for Distributions of Categorical Data
Chi-square Goodness of Fit Test
Does your zodiac sign determine how successful you will be? Fortune magazine collected the zodiac signs of 256 heads of the largest 400 companies. Is there.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Chapter 13: Inference for Tables – Chi-Square Procedures
 Involves testing a hypothesis.  There is no single parameter to estimate.  Considers all categories to give an overall idea of whether the observed.
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
 2 test for independence Used with categorical, bivariate data from ONE sample Used to see if the two categorical variables are associated (dependent)
13.1 Goodness of Fit Test AP Statistics. Chi-Square Distributions The chi-square distributions are a family of distributions that take on only positive.
Chapter 11: Inference for Distributions of Categorical Data.
Chapter 11: Inference for Distributions of Categorical Data
Chi-square test Chi-square test or  2 test. crazy What if we are interested in seeing if my “crazy” dice are considered “fair”? What can I do?
Chi-square test or c2 test
Chi-square test Chi-square test or  2 test Notes: Page Goodness of Fit 2.Independence 3.Homogeneity.
13.1 Test for Goodness of Fit.  Perform and analyze a chi-squared test for goodness of fit.
Chapter 26 Chi-Square Testing
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
Section 11.1 Chi-Square Goodness-of-Fit Tests
The Practice of Statistics Third Edition Chapter (13.1) 14.1: Chi-square Test for Goodness of Fit Copyright © 2008 by W. H. Freeman & Company Daniel S.
Warm up On slide.
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
Chi-square test Chi-square test or  2 test. crazy What if we are interested in seeing if my “crazy” dice are considered “fair”? What can I do?
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
AGENDA:. AP STAT Ch. 14.: X 2 Tests Goodness of Fit Homogeniety Independence EQ: What are expected values and how are they used to calculate Chi-Square?
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
+ Chapter 11 Inference for Distributions of Categorical Data 11.1Chi-Square Goodness-of-Fit Tests 11.2Inference for Relationships.
Chapter 12 The Analysis of Categorical Data and Goodness of Fit Tests.
Chi squared. A biologist randomly selects 20 fruit flies. All of them are placed in a tube with vinegar on one side and mustard on the other side. Do.
Chi-square test Chi-square test or  2 test. Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate)
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
Chi-Squared Test of Homogeneity Are different populations the same across some characteristic?
The χ 2 (Chi-Squared) Test. Crazy Dice? You roll a die 60 times and get: 3 ones, 6 twos, 19 threes, 22 fours, 6 fives, and 4 sixes  Is this a fair die?
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
AP Statistics Tuesday, 05 April 2016 OBJECTIVE TSW (1) identify the conditions to use a chi-square test; (2) examine the chi- square test for independence;
Inference for Tables Chi-Square Tests Chi-Square Test Basics Formula for test statistic: Conditions: Data is from a random sample/event. All individual.
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
Chapter 12 Lesson 12.2b Comparing Two Populations or Treatments 12.2: Test for Homogeneity and Independence in a Two-way Table.
Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a goodness-of-fit.
Chi-square test or c2 test
Inference for Tables: Chi-Squares procedures (2 more chapters to go!)
Chi-square test or χ2 test
Chi-squared test or c2 test
Chi-square test or c2 test
Chi-square test or c2 test
Chi-square test or c2 test
Chi-square test or c2 test
The Analysis of Categorical Data and Chi-Square Procedures
Chi-square test or c2 test
The Analysis of Categorical Data and Goodness of Fit Tests
The Analysis of Categorical Data and Goodness of Fit Tests
The Analysis of Categorical Data and Goodness of Fit Tests
The Analysis of Categorical Data and Goodness of Fit Tests
Chi-square test or c2 test
Inference for Distributions of Categorical Data
Presentation transcript:

Chi-square test Chi-square test or  2 test Notes: Page 217, and your own notebook paper 1.Goodness of Fit 2.Independence 3.Homogeneity

Chi-square test countsUsed to test the counts of categorical data ThreeThree types –Goodness of fit (univariate) –Independence (bivariate) –Homogeneity (univariate with two samples)

 2 distribution – df=3 df=5 df=10

 2 distributions Different df have different curves Skewed right normal curveAs df increases, curve shifts toward right & becomes more like a normal curve

 2 assumptions SRSSRS – reasonably random sample countsHave counts of categorical data & we expect each category to happen at least once Sample sizeSample size – to ensure that the sample size is large enough we should expect at least five in each category. ***Be sure to list expected counts!! Combine these together: All expected counts are at least 5.

 2 formula

 2 Goodness of fit test Uses univariate data Want to see how well the observed counts “fit” what we expect the counts to be  2 cdf function p-valuesUse  2 cdf function on the calculator to find p-values Based on df df = number of categories - 1

Hypotheses – written in words H 0 : proportions are equal H a : at least one proportion is not the same Be sure to write in context!

Does your zodiac sign determine how successful you will be? Fortune magazine collected the zodiac signs of 256 heads of the largest 400 companies. Is there sufficient evidence to claim that successful people are more likely to be born under some signs than others? Aries 23Libra18Leo20 Taurus20Scorpio21Virgo19 Gemini18Sagittarius19Aquarius24 Cancer23Capricorn22Pisces29 How many would you expect in each sign if there were no difference between them? How many degrees of freedom? I would expect CEOs to be equally born under all signs. So 256/12 = df = #categories - 1 Since there are 12 signs, df = 12 – 1 = 11

Does your zodiac sign determine how successful you will be? Fortune magazine collected the zodiac signs of 256 heads of the largest 400 companies. Is there sufficient evidence to claim that successful people are more likely to be born under some signs than others? Aries 23Libra18Leo20 Taurus20Scorpio21Virgo19 Gemini18Sagittarius19Aquarius24 Cancer23Capricorn22Pisces29 Calculator: Enter the data into L1

Assumptions: Have a random sample of CEO’s All expected counts are greater than 5. (I expect CEO’s to be born in each sign.) H 0 : The proportions of CEO’s born under each sign are the same. H a : At least one of the proportions of CEO’s born under each sign is different. P-value =  2 cdf(5.094, 10^99, 11) =.9265  =.05 Since p-value > , I fail to reject H 0. There is not sufficient evidence to suggest that the CEOs are born under some signs more than others. Newer Calculators: L1: Enter observed data L2: Enter expected values (256/12) = STAT,CALC, X 2 GOF-Test… Newer Calculators: L1: Enter observed data L2: Enter expected values (256/12) = STAT,CALC, X 2 GOF-Test…

Assumptions: Have a random sample of CEO’s All expected counts are greater than 5. (I expect CEO’s to be born in each sign.) H 0 : The proportions of CEO’s born under each sign are the same. H a : At least one of the proportions of CEO’s born under each sign is different. P-value =  2 cdf(5.094, 10^99, 11) =.9265  =.05 Since p-value > , I fail to reject H 0. There is not sufficient evidence to suggest that the CEOs are born under some signs more than others. OR, if you have an older calculator: L2: (L ) 2 / STAT,CALC, 1-VarStats L2 …gives Σx=5.094 OR, if you have an older calculator: L2: (L ) 2 / STAT,CALC, 1-VarStats L2 …gives Σx=5.094 Calculator for p-value: 2ndVARS, 8:X 2 cdf Calculator for p-value: 2ndVARS, 8:X 2 cdf

A company says its premium mixture of nuts contains 10% Brazil nuts, 20% cashews, 20% almonds, 10% hazelnuts and 40% peanuts. You buy a large can and separate the nuts. Upon weighing them, you find there are 112 g Brazil nuts, 183 g of cashews, 207 g of almonds, 71 g or hazelnuts, and 446 g of peanuts. You wonder whether your mix is significantly different from what the company advertises? Why is the chi-square goodness-of-fit test NOT appropriate here? What might you do instead of weighing the nuts in order to use chi-square? counts Because we do NOT have counts of the type of nuts. count We could count the number of each type of nut and then perform a  2 test.

Offspring of certain fruit flies may have yellow or ebony bodies and normal wings or short wings. Genetic theory predicts that these traits will appear in the ratio 9:3:3:1 (yellow & normal, yellow & short, ebony & normal, ebony & short) A researcher checks 100 such flies and finds the distribution of traits to be 59, 20, 11, and 10, respectively. What are the expected counts? df? Are the results consistent with the theoretical distribution predicted by the genetic model? (see next page) Expected counts: Y & N = Y & S = E & N = E & S = 6.25 We expect 9/16 of the 100 flies to have yellow and normal wings. (Y & N) So, we expect (Y & N) Since there are 4 categories, df = 4 – 1 = 3

Assumptions: Have a random sample of fruit flies All expected counts are greater than 5. Expected counts: Y & N = 56.25, Y & S = 18.75, E & N = 18.75, E & S = 6.25 H 0 : The proportions of fruit flies are the same as the theoretical model. H a : At least one of the proportions of fruit flies is not the same as the theoretical model. P-value =  2 cdf(5.671, 10^99, 3) =.129  =.05 Since p-value > , I fail to reject H 0. There is not sufficient evidence to suggest that the distribution of fruit flies is not the same as the theoretical model. L1: Observed data L2: Expected values L3: (L1 – L2) 2 / L2 Then, use STAT,CALC,1-VarStats of L3 to see Σx=5.671 L1: Observed data L2: Expected values L3: (L1 – L2) 2 / L2 Then, use STAT,CALC,1-VarStats of L3 to see Σx= nd VARS

Homework: Page 222, 223