Inference for proportions - Inference for a single proportion IPS chapter 8.1 © 2006 W.H. Freeman and Company.

Slides:



Advertisements
Similar presentations
From the Data at Hand to the World at Large Chapters 19, 23 Confidence Intervals Estimation of population parameters: an unknown population proportion.
Advertisements

Part II: Significance Levels, Type I and Type II Errors, Power
Inference for Proportions Inference for a Single Proportion
Significance Tests About
Business Statistics for Managerial Decision
Lecture Unit 5 Section 5.4 Testing Hypotheses about Proportions 1.
Inference about a population proportion Chapter 20 © 2006 W.H. Freeman and Company.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 9 Hypothesis Testing.
LECTURE UNIT 5 Confidence Intervals (application of the Central Limit Theorem) Sections 5.1, 5.2 Introduction and Confidence Intervals for Proportions.
Chapter 20 Testing Hypotheses about Proportions 1.
From the Data at Hand to the World at Large Chapter 19 Confidence Intervals for an Unknown Population p Estimation of a population parameter: Estimating.
Confidence Intervals Mrs. Medina.
Chapters 20, 21 Testing Hypotheses about Proportions 1.
Confidence Intervals and Hypothesis Testing - II
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Inference about a population proportion BPS chapter 20 © 2006 W.H. Freeman and Company.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
October 15. In Chapter 9: 9.1 Null and Alternative Hypotheses 9.2 Test Statistic 9.3 P-Value 9.4 Significance Level 9.5 One-Sample z Test 9.6 Power and.
Confidence intervals for Proportions
CHAPTER 20: Inference About a Population Proportion ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Inference for Proportions
Significance Tests: THE BASICS Could it happen by chance alone?
Significance Toolbox 1) Identify the population of interest (What is the topic of discussion?) and parameter (mean, standard deviation, probability) you.
Objectives (BPS chapter 20) Inference for a population proportion  The sample proportion  The sampling distribution of  Large sample confidence interval.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
CHAPTER 9 Testing a Claim
Chapter 8 Testing Hypotheses about Proportions Part II: Significance Levels, Type I and Type II Errors, Power 1.
Essential Statistics Chapter 141 Thinking about Inference.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Section 9.2 Tests About a Population Proportion. Section 9.2 Tests About a Population Proportion After this section, you should be able to… CHECK conditions.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Chapter 9: Testing a Claim Section 9.2 Tests About a Population Proportion.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Inference about a Population Proportion BPS chapter 19 © 2010 W.H. Freeman and Company.
CHAPTER 20: Inference About a Population Proportion ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
12.2 (13.2) Comparing Two Proportions. The Sampling Distribution of.
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
From the Data at Hand to the World at Large Chapter 19 Confidence Intervals for an Unknown Population p Estimation of a population parameter: Estimating.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
© Copyright McGraw-Hill 2004
Lecture Unit 5 Section 5.4 Testing Hypotheses about Proportions 1.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 8: Tests of significance and confidence.
Hypothesis Tests for 1-Proportion Presentation 9.
Objectives (PSLS Chapter 19) Inference for a population proportion  Conditions for inference on proportions  The sample proportion (p hat )  The sampling.
Daniel S. Yates The Practice of Statistics Third Edition Chapter 12: Significance Tests in Practice Copyright © 2008 by W. H. Freeman & Company.
The Practice of Statistics Third Edition Chapter 12: Significance Tests in Practice Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Confidence Interval for a Proportion Adapted from North Carolina State University.
Chapter Nine Hypothesis Testing.
Chapter 19 Testing Hypotheses about Proportions
Chapter 6 Testing Hypotheses about Proportions
The Practice of Statistics in the Life Sciences Fourth Edition
CHAPTER 22: Inference about a Population Proportion
Presentation transcript:

Inference for proportions - Inference for a single proportion IPS chapter 8.1 © 2006 W.H. Freeman and Company

Objectives (IPS chapter 8.1) Inference for a single proportion  Conditions for inference on p  Large-sample confidence interval for p  “Plus four” confidence interval for p  Significance test for a proportion  Sample size for a desired margin of error

Sampling distribution of p ^ — reminder The sampling distribution of a sample proportion is approximately normal (normal approximation of a binomial distribution) when the sample size is large enough.

Conditions for inference on p Assumptions: 1.The data used for the estimate are an SRS from the population studied. 2.The population is at least 10 times as large as the sample used for inference. This ensures that the standard deviation of is close to 3.The sample size n is large enough that the sampling distribution can be approximated with a normal distribution. How large a sample size is required depends in part on the value of p and the test conducted. Otherwise, rely on the binomial distribution.

Large-sample confidence interval for p Use this method when the number of successes and the number of failures are both at least 15. C Z*Z*−Z*−Z* m Confidence intervals contain the population proportion p in C% of samples. For an SRS of size n drawn from a large population and with sample proportion calculated from the data, an approximate level C confidence interval for p is: C is the area under the standard normal curve between −z* and z*.

Medication side effects Arthritis is a painful, chronic inflammation of the joints. An experiment on the side effects of pain relievers examined arthritis patients to find the proportion of patients who suffer side effects. What are some side effects of ibuprofen? Serious side effects (seek medical attention immediately): Allergic reaction (difficulty breathing, swelling, or hives), Muscle cramps, numbness, or tingling, Ulcers (open sores) in the mouth, Rapid weight gain (fluid retention), Seizures, Black, bloody, or tarry stools, Blood in your urine or vomit, Decreased hearing or ringing in the ears, Jaundice (yellowing of the skin or eyes), or Abdominal cramping, indigestion, or heartburn, Less serious side effects (discuss with your doctor): Dizziness or headache, Nausea, gaseousness, diarrhea, or constipation, Depression, Fatigue or weakness, Dry mouth, or Irregular menstrual periods

Let’s calculate a 90% confidence interval for the population proportion of arthritis patients who suffer some “adverse symptoms.” What is the sample proportion ? What is the sampling distribution for the proportion of arthritis patients with adverse symptoms for samples of 440? For a 90% confidence level, z* = Using the large sample method, we calculate a margin of error m:  With 90% confidence level, between 3.46% and 6.94% of arthritis patients taking this pain medication experience some adverse symptoms.

Because we have to use an estimate of p to compute the margin of error, confidence intervals for a population proportion are not very accurate. Specifically, we tend to be incorrect more often than the confidence level would indicate. But there is no systematic amount (because it depends on p). Use with caution!

“Plus four” confidence interval for p A simple adjustment produces more accurate confidence intervals. We act as if we had four additional observations, two being successes and two being failures. Thus, the new sample size is n + 4 and the count of successes is X + 2. The “plus four” estimate of p is: And an approximate level C confidence interval is: Use this method when C is at least 90% and sample size is at least 10.

We now use the “plus four” method to calculate the 90% confidence interval for the population proportion of arthritis patients who suffer some “adverse symptoms.” An approximate 90% confidence interval for p using the “plus four” method is:  With 90% confidence level, between 3.8% and 7.4% of arthritis patients taking this pain medication experience some adverse symptoms. What is the value of the “plus four” estimate of p?

Significance test for p The sampling distribution for is approximately normal for large sample sizes and its shape depends solely on p and n. Thus, we can easily test the null hypothesis: H 0 : p = p 0 (a given value we are testing). If H 0 is true, the sampling distribution is known  The likelihood of our sample proportion given the null hypothesis depends on how far from p 0 our is in units of standard deviation. This is valid when both expected counts—expected successes np 0 and expected failures n(1 − p 0 )—are each 10 or larger.

P-values and one or two sided hypotheses — reminder And as always, if the p-value is smaller than the chosen significance level , then the difference is statistically significant and we reject H 0.

A national survey by the National Institute for Occupational Safety and Health on restaurant employees found that 75% said that work stress had a negative impact on their personal lives. You investigate a restaurant chain to see if the proportion of all their employees negatively affected by work stress differs from the national proportion p 0 = H 0 : p = p 0 = 0.75 vs. H a : p ≠ 0.75 (2 sided alternative) In your SRS of 100 employees, you find that 68 answered “Yes” when asked, “Does work stress have a negative impact on your personal life?” The expected counts are 100 × 0.75 = 75 and 25. Both are greater than 10, so we can use the z-test. The test statistic is:

From Table A we find the area to the left of z=1.62 is Thus P(Z ≥ 1.62) = 1 − , or Since the alternative hypothesis is two-sided, the P-value is the area in both tails, and P = 2 × =  The chain restaurant data are not significantly different from the national survey results (pˆ = 0.68, z = 1.62, P = 0.11).

Software gives you summary data (sample size and proportion) as well as the actual p-value. Minitab Crunch It!

Interpretation: magnitude vs. reliability of effects The reliability of an interpretation is related to the strength of the evidence. The smaller the p-value, the stronger the evidence against the null hypothesis and the more confident you can be about your interpretation. The magnitude or size of an effect relates to the real-life relevance of the phenomenon uncovered. The p-value does NOT assess the relevance of the effect, nor its magnitude. A confidence interval will assess the magnitude of the effect. However, magnitude is not necessarily equivalent to how theoretically or practically relevant an effect is.

Sample size for a desired margin of error You may need to choose a sample size large enough to achieve a specified margin of error. However, because the sampling distribution of is a function of the population proportion p, this process requires that you guess a likely value for p: p*. The margin of error will be less than or equal to m if p* is chosen to be 0.5. Remember, though, that sample size is not always stretchable at will. There are typically costs and constraints associated with large samples.

What sample size would we need in order to achieve a margin of error no more than 0.01 (1%) for a 90% confidence interval for the population proportion of arthritis patients who suffer some “adverse symptoms.” We could use 0.5 for our guessed p*. However, since the drug has been approved for sale over the counter, we can safely assume that no more than 10% of patients should suffer “adverse symptoms” (a better guess than 50%). For a 90% confidence level, z* =  To obtain a margin of error no more than 1%, we would need a sample size n of at least 2435 arthritis patients.