Two-Sample Proportions Inference. Sampling Distributions for the difference in proportions When tossing pennies, the probability of the coin landing.

Slides:



Advertisements
Similar presentations
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 22 Comparing Two Proportions.
Advertisements

Small differences. Two Proportion z-Interval and z-Tests.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Two Sample Proportions Large Sample Difference of Proportions z Test & Confidence.
Inference on Proportions. What are the steps for performing a confidence interval? 1.Assumptions 2.Calculations 3.Conclusion.
Testing a Claim about a Proportion Assumptions 1.The sample was a simple random sample 2.The conditions for a binomial distribution are satisfied 3.Both.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Two-Sample Inference Procedures with Means. Remember: We will be intereste d in the differen ce of means, so we will use this to find standard error.
SECTION 12.2 Tests About a Population Proportion.
Confidence Interval and Hypothesis Testing for:
Chapter 11: Inference for Distributions
Significance Tests for Proportions Presentation 9.2.
Two-Sample Proportions Inference Notes: Page 211.
Inference for Two Proportions Chapter 22
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Two-Sample Proportions Inference. Sampling Distributions for the difference in proportions When tossing pennies, the probability of the coin landing on.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
10.3 Estimating a Population Proportion
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
CHAPTER 21: Comparing Two Proportions
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Two-Sample Inference Procedures with Means. Two-Sample Procedures with means two treatments two populationsThe goal of these inference procedures is to.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Chapter 11 Comparing Two Populations or Treatments.
8.2 - Estimating a Population Proportion
Confidence Intervals and Hypothesis tests with Proportions.
Comparing Two Proportions
Hypothesis Testing for Proportions
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 11 Comparing Two Populations or Treatments.
Inference for Proportions One Sample. Confidence Intervals One Sample Proportions.
Two-Sample Inference Procedures with Means. Suppose we have a population of adult men with a mean height of 71 inches and standard deviation of 2.6 inches.
STT 315 Ashwini Maurya Acknowledgement: Author is indebted to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit many.
Chapter 23 Inference for One- Sample Means. Steps for doing a confidence interval: 1)State the parameter 2)Conditions 1) The sample should be chosen randomly.
AP Statistics Chapter 22 Notes “Comparing Two Proportions”
Chapter 22 Two-Sample Proportions Inference. Steps for doing a confidence interval: 1)State the parameter 2)Assumptions – 1) The samples are chosen randomly.
Two-Sample Inference Procedures with Means. Remember: We will be interested in the difference of means, so we will use this to find standard error.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
Copyright © 2010 Pearson Education, Inc. Slide Beware: Lots of hidden slides!
Chapter 20 Testing Hypothesis about proportions
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Section 10.1 Estimating with Confidence AP Statistics February 11 th, 2011.
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Inference on Proportions. Assumptions: SRS Normal distribution np > 10 & n(1-p) > 10 Population is at least 10n.
Suppose we have a population of adult men with a mean height of 71 inches and standard deviation of 2.6 inches. We also have a population of adult women.
Inference for Proportions Section Starter Do dogs who are house pets have higher cholesterol than dogs who live in a research clinic? A.
Chapter 22 Comparing Two Proportions.  Comparisons between two percentages are much more common than questions about isolated percentages.  We often.
Chapter 22 Comparing two proportions Math2200. Are men more intelligent? Gallup poll A random sample of 520 women and 506 men 28% of the men thought men.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Warm Up Exercise Warm Up Write down the four steps to confidence interval. 2.
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
Sampling Distributions of Proportions. Toss a penny 20 times and record the number of heads. Calculate the proportion of heads & mark it on the dot plot.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Two-Sample Inference Procedures with Means. Two independent samples Difference of Means.
Two-Sample Proportions Inference. Conditions TwoindependentTwo independent SRS’s (or randomly assigned treatments) Populations > 10n Both sampling dist.’s.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Two-Sample Proportions Inference
Two-Sample Proportions Inference
One-Sample Inference for Proportions
Two-Sample Proportions Inference
CHAPTER 6 Statistical Inference & Hypothesis Testing
Comparing Two Proportions
Chapter 8: Inference for Proportions
Two-Sample Proportions Inference
Two-Sample Hypothesis Test of Proportions
Two-Sample Proportions Inference
Comparing Two Proportions
Two-Sample Inference Procedures with Means
Presentation transcript:

Two-Sample Proportions Inference

Sampling Distributions for the difference in proportions When tossing pennies, the probability of the coin landing on heads is 0.5. However, when spinning the coin, the probability of the coin landing on heads is 0.4. Let’s investigate. Pairs of students will be given pennies and assigned to either flip or spin the penny

Looking at the sampling distribution of the difference in sample proportions: What is the mean of the difference in sample proportions (flip - spin)? What is the standard deviation of the difference in sample proportions (flip - spin)? Can the sampling distribution of difference in sample proportions (flip - spin) be approximated by a normal distribution? What is the probability that the difference in proportions (flipped – spun) is at least.25? Yes, since n 1 p 1 =12.5, n 1 (1-p 1 )=12.5, n 2 p 2 =10, n 2 (1-p 2 )=15 – so all are at least 5)

Assumptions: TwoindependentTwo, independent SRS’s from populations ( or randomly assigned treatments) Populations at least 10n Normal approximation for both

The National Sleep Foundation asked a random sample of U.S. adults questions about their sleep habits. One of the questions asked about snoring. Of the 995 respondents, 37% of adults reported that they snored at least a few nights a week during the past year. Would you expect that percentage to be the same for all age groups? Split into two age categories, 26% of the 184 people under 30 snored, compared with 39% of the 811 in the older group. Is this difference of 13% real, or due only to natural fluctuations in the sample we’ve chose? and overTotal Snore Don’t snore Total HYPOTHESIS TEST! For the true DIFFERENCE between snoring rates!

Steps: 1)Assumptions 2)Hypothesis statements & define parameters 3)Calculations 4)Conclusion, in context

Assumptions: TwoindependentTwo, independent SRS’s from populations ( or randomly assigned treatments) Populations at least 10n Normal approximation for both

and overTotal Snore48 (26%)318 (39%)366 Don’t snore Total Assumptions: Ages are independent of each other in random sampleAges are independent of each other in random sample 995 is less than 10% of all adults Normal approximation for both

Hypothesis statements: H 0 : p 1 - p 2 = 0 H a : p 1 - p 2 > 0 H a : p 1 - p 2 < 0 H a : p 1 - p 2 ≠ 0 Be sure to define both p 1 & p 2 ! H 0 : p 1 = p 2 H a : p 1 > p 2 H a : p 1 < p 2 H a : p 1 ≠ p 2

Hypothesis statements: H 0 : There is no difference in snoring rates in the two age groups p old – p young = 0 p 1 = p 2 H a : There is a difference in snoring rates in the two age groups p old – p young ≠ 0 p 1 ≠ p 2

Since we assume that the population proportions are equal in the null hypothesis, the variances are equal. Therefore, we combine the variances!

Formula for Hypothesis test: Test statistic:

P-value = 2(area to right of z = 3.33) =.0008

“Since the p-value ) , I reject (fail to reject) the H 0. There is (is not) sufficient evidence to suggest that H a.” Conclusion: Since the p-value is less that my significance level, I reject the null hypothesis of no difference. There is sufficient evidence to suggest that there is a difference in the rate of snoring between older adults and younger adults.

Formula for Hypothesis test: Usually p 1 – p 2 =0

Example - Student Retention A group of college students were asked what they thought the “issue of the day”. Without a pause the class almost to a person said “student retention”. The class then went out and obtained a random sample (questionable) and asked the question, “Do you plan on returning next year?” The responses along with the gender of the person responding are summarized in the following table. Test to see if the proportion of students planning on returning is the same for both genders at the 0.05 level of significance.

Assumptions: The two samples are independently chosen random samples. Sample is less than 10% of college population. The sample sizes are large enough since n 1 p 1 = 211  10, n 1 (1- p 1 ) = 64  10 n 2 p 2 = 141  10, n 2 (1- p 2 ) = 41  10 so an approx normal model can be used. Significance level:  = 0.05 Example - Student Retention

p m = true proportion of males who plan on returning p f = true proportion of females who plan on returning n m = 275 (number of males surveyed) n f = 182 (number of females surveyed) (sample proportion of males who plan on returning) (sample proportion of females who plan on returning) Null hypothesis: H 0 : p m – p f = 0 Alternate hypothesis: H a : p m – p f  0 Example - Student Retention

Calculations : Test statistic:

P-value: The P-value for this test is 2 times the area under the z curve to the left of the computed z = P-value = 2(0.4247) = Conclusion: Since P-value = > 0.05 = , the hypothesis H 0 is not rejected at significance level There is no evidence that the return rate is different for males and females.

Example 4: A forest in Oregon has an infestation of spruce moths. In an effort to control the moth, one area has been regularly sprayed from airplanes. In this area, a random sample of 495 spruce trees showed that 81 had been killed by moths. A second nearby area receives no treatment. In this area, a random sample of 518 spruce trees showed that 92 had been killed by the moth. Do these data indicate that the proportion of spruce trees killed by the moth is different for these areas?

Assumptions: Have 2 independent SRS of spruce trees Both distributions are approximately normal since n 1 p 1 =81, n 1 (1-p 1 )=414, n 2 p 2 =92, n 2 (1-p 2 )=426 and all > 10 Population of spruce trees is at least 10,130. H 0 : p 1 = p 2 where p 1 is the true proportion of trees killed by moths H a : p 1 ≠ p 2 in the treated area p 2 is the true proportion of trees killed by moths in the untreated area P-value =  = 0.05 Since p-value > , I fail to reject H 0. There is not sufficient evidence to suggest that the proportion of spruce trees killed by the moth is different for these areas

and overTotal Snore Don’t snore Total Back to snoring…. What if I wanted to know the true difference in the population proportion of young adults who snore and the proportion of older adults who snore? CONFIDENCE INTERVAL! For the true DIFFERENCE between snoring rates!

What are the steps for performing a confidence interval? 1.) Identify the interval by name or formula (CI for two-sample proportion) 2.) Assumptions TwoindependentTwo, independent SRS’s from populations ( or randomly assigned treatments) Populations at least 10n Normal approximation for both 3.) Calculations 4.) Conclusion (in context of problem)

Formula for confidence interval: Note: use p-hat when p is not known Standard error! Margin of error!

Two-sample Confidence Interval for Proportions Conditions for inference have previously been met We are 95% confidence that the proportion of people who snore is between 5.92% and 20.28% higher for older adults than for younger adults.

Example 1: At Community Hospital, the burn center is experimenting with a new plasma compress treatment. A random sample of 316 patients with minor burns received the plasma compress treatment. Of these patients, it was found that 259 had no visible scars after treatment. Another random sample of 419 patients with minor burns received no plasma compress treatment. For this group, it was found that 94 had no visible scars after treatment. What is the shape & standard error of the sampling distribution of the difference in the proportions of people with no visible scars between the two groups? Since n 1 p 1 =259, n 1 (1-p 1 )=57, n 2 p 2 =94, n 2 (1-p 2 )=325 and all > 5, then the distribution of difference in proportions is approximately normal.

Example 1: At Community Hospital, the burn center is experimenting with a new plasma compress treatment. A random sample of 316 patients with minor burns received the plasma compress treatment. Of these patients, it was found that 259 had no visible scars after treatment. Another random sample of 419 patients with minor burns received no plasma compress treatment. For this group, it was found that 94 had no visible scars after treatment. What is a 95% confidence interval of the difference in proportion of people who had no visible scars between the plasma compress treatment & control group?

Assumptions: Have 2 independent randomly assigned treatment groups Both distributions are approximately normal since n 1 p 1 =259, n 1 (1-p 1 )=57, n 2 p 2 =94, n 2 (1-p 2 )=325 and all > 5 Population of burn patients is at least Since these are all burn patients, we can add = 735. If not the same – you MUST list separately. We are 95% confident that the true proportion of people who had no visible scars between the plasma compress treatment is between 53.7% and 65.4% higher than for the control group.

Example 2: Suppose that researchers want to estimate the difference in proportions of people who are against the death penalty in Texas & in California. If the two sample sizes are the same, what size sample is needed to be within 2% of the true difference at 90% confidence? Since both n’s are the same size, you have common denominators – so add! n = 3383

Do you think that the proportion of defective PEANUT M&Ms is higher than The proportion of defective PLAIN M&MS?