Objectives (PSLS Chapter 19) Inference for a population proportion  Conditions for inference on proportions  The sample proportion (p hat )  The sampling.

Slides:



Advertisements
Similar presentations
Comparing Two Proportions
Advertisements

From the Data at Hand to the World at Large Chapters 19, 23 Confidence Intervals Estimation of population parameters: an unknown population proportion.
Part II: Significance Levels, Type I and Type II Errors, Power
Inference for Proportions Inference for a Single Proportion
9.2a Tests about a Population Proportion Target Goal: I can check the conditions for carrying out a test about a population proportion. I can perform a.
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
Lecture Unit 5 Section 5.4 Testing Hypotheses about Proportions 1.
Statistical Inference: Confidence Intervals
© 2010 Pearson Prentice Hall. All rights reserved Hypothesis Testing Using a Single Sample.
Introduction to Inference Estimating with Confidence Chapter 6.1.
Inference about a population proportion Chapter 20 © 2006 W.H. Freeman and Company.
Chapter 11: Inference for Distributions
LECTURE UNIT 5 Confidence Intervals (application of the Central Limit Theorem) Sections 5.1, 5.2 Introduction and Confidence Intervals for Proportions.
Chapter 20 Testing Hypotheses about Proportions 1.
From the Data at Hand to the World at Large Chapter 19 Confidence Intervals for an Unknown Population p Estimation of a population parameter: Estimating.
Confidence Intervals Mrs. Medina.
Chapters 20, 21 Testing Hypotheses about Proportions 1.
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Comparing 2 proportions BPS chapter 21 © 2006 W. H. Freeman and Company These PowerPoint files were developed by Brigitte Baldi at the University of California,
Inferences Based on Two Samples
Inference about a population proportion BPS chapter 20 © 2006 W.H. Freeman and Company.
Chapter 8 Inference for Proportions
+ DO NOW What conditions do you need to check before constructing a confidence interval for the population proportion? (hint: there are three)
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.3 Estimating a Population Mean.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Inference for proportions - Inference for a single proportion IPS chapter 8.1 © 2006 W.H. Freeman and Company.
Confidence intervals for Proportions
CHAPTER 20: Inference About a Population Proportion ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
CHAPTER 18: Inference about a Population Mean
Significance Tests: THE BASICS Could it happen by chance alone?
Objectives (BPS chapter 20) Inference for a population proportion  The sample proportion  The sampling distribution of  Large sample confidence interval.
BPS - 5th Ed. Chapter 191 Inference about a Population Proportion.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
CHAPTER 20: Inference About a Population Proportion
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
CHAPTER 16: Inference in Practice ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.3 Estimating a Population Mean.
Chapter 8 Testing Hypotheses about Proportions Part II: Significance Levels, Type I and Type II Errors, Power 1.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
Essential Statistics Chapter 141 Thinking about Inference.
AP STATISTICS LESSON INFERENCE FOR A POPULATION PROPORTION.
CHAPTER 20: Inference About a Population Proportion ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
From the Data at Hand to the World at Large Chapter 19 Confidence Intervals for an Unknown Population p Estimation of a population parameter: Estimating.
CHAPTER 16: Inference in Practice ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
BPS - 5th Ed. Chapter 151 Thinking about Inference.
1 Chapter 18 Inference about a Population Proportion.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 11.1 Estimating a Population Mean.
Lecture Unit 5 Section 5.4 Testing Hypotheses about Proportions 1.
BPS - 3rd Ed. Chapter 191 Comparing Two Proportions.
+ Unit 5: Estimating with Confidence Section 8.3 Estimating a Population Mean.
AP Statistics Confidence intervals for Proportions Chapter 19.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 8: Tests of significance and confidence.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Confidence Interval for a Proportion Adapted from North Carolina State University.
21. The chi-square test for goodness of fit
Chapter 19 Testing Hypotheses about Proportions
Chapter 8: Inference for Proportions
Chapter 6 Testing Hypotheses about Proportions
The Practice of Statistics in the Life Sciences Fourth Edition
CHAPTER 20: Inference About a Population Proportion
CHAPTER 22: Inference about a Population Proportion
Inference for Proportions
Presentation transcript:

Objectives (PSLS Chapter 19) Inference for a population proportion  Conditions for inference on proportions  The sample proportion (p hat )  The sampling distribution of  Significance test for a proportion  Confidence interval for p  Sample size for a desired margin of error

Conditions for inference on proportions Assumptions: 1.The data used for the estimate are a random sample from the population studied. 2.The population is at least 20 times as large as the sample. This ensures independence of successive trials in the random sampling. 3.The sample size n is large enough that the shape of the sampling distribution is approximately Normal. How large depends on the type of inference conducted.

We treat a group of 120 Herpes patients with a new drug; 30 get better: p ̂ = (??)/(??) = ?? (proportion of patients improving in sample) The sample proportion p ̂ We now study categorical data and draw inference on the proportion, or percentage, of the population with a specific characteristic. If we call a given categorical characteristic in the population “success,” then the sample proportion of successes, (p hat ) is:

Sampling distribution of p ̂

The mean and standard deviation (width) of the sampling distribution are both completely determined by p and n.

Significance test for p When testing: H 0 : p = p 0 (a given value we are testing) This is valid when both expected counts — expected successes np 0 and expected failures n(1 − p 0 ) — are each 10 or larger.

P-value for a one or two sided alternative The P-value is the probability, if H 0 was true, of obtaining a test statistic like the one computed or more extreme in the direction of H a.

Aphids evade predators (ladybugs) by dropping off the leaf. An experiment examined the mechanism of aphid drops. “When dropped upside-down from delicate tweezers, live aphids landed on their ventral side in 95% (sample proportion) of the trials (19 out of 20). In contrast, dead aphids landed on their ventral side in 52.2% of the trials (12 out of 23).” Is there evidence (at significance level 5%) that live aphids land right side up (on their ventral side) more often than chance would predict? Here, “chance” would be 50% ventral landings. So we test: The expected counts of success and failure are each 10, so the z procedure is valid. The test P-value is P(Z ≥ 4.02). From Table B, P = P(Z < -4.02) <.0002, highly significant. We reject H 0. There is very strong evidence (P <.0002) that the righting behavior of live aphids is better than chance.

Mendel’s first law of genetic inheritance states that crossing dominant and recessive homozygote parents yields a second generation made of 75% of dominant-trait individuals. When Mendel crossed pure breeds of plants producing smooth peas and plants producing wrinkled peas, the second generation (F2), was made of 5474 smooth peas and 1850 wrinkled peas. Do these data provide evidence that the proportion of smooth peas in the F2 population is not 75%? The sample proportion of smooth peas is: We test: From Table B, we find P = 2P(Z < -.59) = 2 x.2776 =.56, not significant. We fail to reject H 0. The data are consistent with a dominant-recessive genetic model.

Confidence interval for p When p is unknown, both the center and the spread of the sampling distribution are unknown  problem. We need to “guess” a value for p. Our options: This is the “large sample method”. It performs poorly when sample size is small. This is the “plus four method”. It is reasonably accurate. Always use with caution

Large-sample confidence interval for p Use this method when the number of successes and the number of failures are both at least 15. C z*-z* m Confidence intervals contain the population proportion p in C % of samples. For an SRS of size n drawn from a large population and with sample proportion p ̂ calculated from the data, an approximate level C confidence interval for p is C is the area under the standard normal curve between -z* and z*.

Medication side effects Arthritis is a painful, chronic inflammation of the joints. An experiment on the side effects of pain relievers examined arthritis patients to find the proportion of patients who suffer side effects. What are some side effects of ibuprofen? Serious side effects (seek medical attention immediately): Allergic reactions (difficulty breathing, swelling, or hives) Muscle cramps, numbness, or tingling Ulcers (open sores) in the mouth Rapid weight gain (fluid retention) Seizures Black, bloody, or tarry stools Blood in your urine or vomit Decreased hearing or ringing in the ears Jaundice (yellowing of the skin or eyes) Abdominal cramping, indigestion, or heartburn Less serious side effects (discuss with your doctor): Dizziness or headache Nausea, gaseousness, diarrhea, or constipation Depression Fatigue or weakness Dry mouth Irregular menstrual periods.

We compute a 90% confidence interval for the population proportion of arthritis patients who suffer some "adverse symptoms." What is the sample proportion p ̂ ? For a 90% confidence level, z* = Using the large sample method:  With 90% confidence level, between 3.5% and 6.9% of arthritis patients taking this pain medication experience some adverse symptoms.

“Plus four” confidence interval for p The “plus four” method gives reasonably accurate confidence intervals. We act as if we had four additional observations, two successes and two failures. Thus, the new sample size is n + 4 and the count of successes is X + 2. The “plus four” estimate of p is: An approximate level C confidence interval is: Use this method when C is at least 90% and sample size is at least 10.

We want a 90% CI for the population proportion of arthritis patients who suffer some “adverse symptoms.” An approximate 90% confidence interval for p using the “plus four” method is:  With 90% confidence, between 3.8% and 7.4% of the population of arthritis patients taking this pain medication experience some adverse symptoms. What is the value of the “plus four” estimate of p?

Sample size for a desired margin of error You may need to choose a sample size large enough to achieve a specified margin of error. Because the sampling distribution of p ̂ is a function of the unknown population proportion p this process requires that you guess a likely value for p: p*. Make an educated guess, or use p* = 0.5 (most conservative estimate).

What sample size would we need in order to achieve a margin of error no more than 0.01 (1 percentage point) with a 90% confidence level? We could use 0.5 for our guessed p*. However, since the drug has been approved for sale over the counter, we can safely assume that no more than 10% of patients should suffer “adverse symptoms” (a better guess than 50%). For a 90% confidence level, z* =  To obtain a margin of error no more than 0.01 we need a sample size n of at least 2436 arthritis patients.

Sample size and margin of error continued  Ex) What sample size would we need in order to achieve a margin of error no more than 0.03 (3 percentage point) with a 95% confidence level? We need at least 385.