Confidence Interval (CI) for a Proportion Choosing the Sample Size.

Slides:



Advertisements
Similar presentations
Estimating Population Values
Advertisements

Inference about a Population Proportion
Example 8.12 Controlling Confidence Interval Length.
BUS 220: ELEMENTARY STATISTICS
Copyright © 2010 Pearson Education, Inc. Slide
Estimating a Population Proportion
SADC Course in Statistics Further ideas concerning confidence intervals (Session 06)
The basics for simulations
Chapter 8 Interval Estimation
Understanding p-values Annie Herbert Medical Statistician Research and Development Support Unit
Healey Chapter 7 Estimation Procedures
 With replacement or without replacement?  Draw conclusions about a population based on data about a sample.  Ask questions about a number which describe.
Module 16: One-sample t-tests and Confidence Intervals
Putting Statistics to Work
Confidence Intervals with Proportions
Introduction to Inference
Statistical Inferences Based on Two Samples
Dr Richard Bußmann CHAPTER 12 Confidence intervals for means.
Chapter 8 Estimation Understandable Statistics Ninth Edition
The Right Questions about Statistics: How confidence intervals work Maths Learning Centre The University of Adelaide A confidence interval is designed.
CHAPTER 20: Inference About a Population Proportion
CHAPTER 14: Confidence Intervals: The Basics
Multiple Regression and Model Building
6.2 Confidence Intervals for the Mean (Small Samples) Statistics Mrs. Spitz Spring 2009.
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Unit 4 – Inference from Data: Principles
Chapter 8 Estimating with Confidence
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Choosing Sample Size and Using Your Calculator Presentation 9.3.
Chapter 19 Confidence Intervals for Proportions.
Statistics Interval Estimation.
Chapter 19: Confidence Intervals for Proportions
Chapter 8 Inference for Proportions
Calculating sampling error Actually, we’ll calculate what is known (technically) as the confidence interval. This is the “+ or =” number that you hear.
7-2 Estimating a Population Proportion
Choosing the Sample Size. Confidence Interval for a Mean Given A random sample of size n from a Normal population with mean . (n/N  0.05) Result A confidence.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.2 Estimating a Population Proportion Objective Find the confidence.
Sample size. Ch 132 Sample Size Formula Standard sample size formula for estimating a percentage:
Chapter 7 Confidence Intervals and Sample Sizes
1 1 Slide © 2006 Thomson/South-Western Chapter 8 Interval Estimation Population Mean:  Known Population Mean:  Known Population Mean:  Unknown Population.
Chapter 8 - Interval Estimation
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
Statistics: Concepts and Controversies What Is a Confidence Interval?
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Statistical Sampling & Analysis of Sample Data
2.6 Confidence Intervals and Margins of Error. What you often see in reports about studies… These results are accurate to within +/- 3.7%, 19 times out.
10.1 DAY 2: Confidence Intervals – The Basics. How Confidence Intervals Behave We select the confidence interval, and the margin of error follows… We.
10.1: Confidence Intervals – The Basics. Review Question!!! If the mean and the standard deviation of a continuous random variable that is normally distributed.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
AP STATISTICS LESSON INFERENCE FOR A POPULATION PROPORTION.
Inference about a Population Proportion BPS chapter 19 © 2010 W.H. Freeman and Company.
Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
Chapter 8: Estimating with Confidence
CONFIDENCE STATEMENT MARGIN OF ERROR CONFIDENCE INTERVAL 1.
Ch 12 – Inference for Proportions YMS 12.1
4.4.2 Normal Approximations to Binomial Distributions
Sample Size Mahmoud Alhussami, DSc., PhD. Sample Size Determination Is the act of choosing the number of observations or replicates to include in a statistical.
Margin of Error S-IC.4 Use data from a sample survey to estimate a population mean or proportion; develop a margin of error through the use of simulation.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
O Make Copies of: o Areas under the Normal Curve o Appendix B.1, page 784 o (Student’s t distribution) o Appendix B.2, page 785 o Binomial Probability.
Sampling: Distribution of the Sample Mean (Sigma Known) o If a population follows the normal distribution o Population is represented by X 1,X 2,…,X N.
Chapter 9 Estimation and Confidence Intervals. Our Objectives Define a point estimate. Define level of confidence. Construct a confidence interval for.
Estimation and Confidence Intervals. Point Estimate A single-valued estimate. A single element chosen from a sampling distribution. Conveys little information.
St. Edward’s University
Sample Size and Accuracy
Estimating a Population Proportion
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 8: Confidence Intervals
Presentation transcript:

Confidence Interval (CI) for a Proportion Choosing the Sample Size

Confidence Interval for p When we collect data from our 1 random sample and compute the sample proportion, the interval of values forms an (approximate) confidence interval (CI) for p.

Example A marketing firm intends to survey newspaper readers to determine what percent of readers notice a particular ad campaign. They will summarize their data and form a 95% confidence interval; the interval will be reported back to the company purchasing the ads. The company has requested an error margin no greater than What should be the sample size n?

Error Margin Before the study is conducted, E is unknown. If the goal is to sample enough to have an error margin no greater than a target value E:

Example A marketing firm intends to survey newspaper readers to determine what percent of readers notice a particular ad campaign. They will summarize their data and form a 95% confidence interval; the interval will be reported back to the company purchasing the ads. The company has requested an error margin no greater than 0.05.

“Ignorant” Solution Use 0.5 as the prevalence. It’s impossible to sample 0.16 of a reader. Answer: Sample at least 385 readers. Sample sizes must be whole numbers.

Implementing the Solution 320 of the 385 readers noticed the campaign. The actual error margin is Had 0.83 had been known in advance (it wasn’t), the required sample size would have been 217.

Flaw in Ignorant Solution If you use 0.5 to determine the sample size, you will get an error margin no more than the desired value. But…Unless the prevalence turns out to be 0.5 exactly, the error margin will be less than desired. The error margin will be considerably less than desired if the prevalence is far from 0.5. Time and money are wasted.

Example How many trees should we sample in order to estimate the proportion expected to die with error margin no greater than 0.02 = 2%? Assume we want 99% confidence in our result. Past history suggests a result in the vicinity of A 90% CI based on 216 trees yielded ± We might try values 0.15 to 0.25 in order to obtain an indication of what the sample size should be.

Example The actual error margin will depend on the observed proportion. If it is closer to 0.5 than what we use to get a sample size, then we will not meet the desired error margin. We won’t meet the goal. That is why 0.5 is guaranteed to get the error margin. If it is further from 0.5 than what we use, then we will undershoot the desired error margin. This is fine, except that it adds to the expense of the study. For values 0.35 – 0.65, using 0.5 as a guess generally doesn’t oversample by too much.

Example The proportion of students who have had the flu (through 11/4/2009) was estimated with a sample of n = 62. The 90% confidence interval was:  0.080(0.097, 0.257) How many students should be sampled to reduce the error margin by half to  0.04?

Example  0.080(0.097, 0.257) Our result supports a future result between about 0.10 and Our best guess would be about AssumingRequired n 0.18

Example  0.080(0.097, 0.257) Our result supports a future result between about 0.10 and Our best guess would be about AssumingRequired n (about 4  62)

Relation between n and E In general, the larger n is, the smaller E is. If we only compare situations with the same confidence and proportion, then Reducing the error margin by a multiplicative factor of k requires increasing the sample size by a factor of k 2. Ex: Making the error margin twice (2 times) as small requires making the sample size 2 2 = 4 times bigger. For E = 0.02 (4 times smaller), n = 16(62) = 992. For E = 0.01 (8 times smaller), n = 64(62) = 3968

Example  0.080(0.097, 0.257) Our result supports a future result between about 0.10 and Our best guess would be about AssumingRequired n (about 4  62) (ignorant)423

Example

A computer manufacturer’s tech support office wants to assess the percent of customers who make service calls within the first month. The company wants to be 90% confident that the sample percentage is within two percentage points of the true percentage Past surveys have revealed this figure to be in the 5 – 15% range.

Example You try (The confidence is 90%. The desired error margin is 2%.) What is the required sample size? (Give a whole number as answer.) The required sample size is 609.

Solutions Guessed value Minimum n

Solutions Guessed value Minimum n

Trade-off for any solutions Some prevalence must be assumed to obtain a sample size. If the result is closer to 0.5 than what was assumed, the error margin will be larger than desired. If the result is farther from 0.5 that what was assumed, resources will have been wasted.

Reasonable Compromise Use 0.5 if you have no idea, or if you anticipate a prevalence close to 0.5. In public opinion polls for a 2-candidate election, the prevalence is often near 0.5. So 0.5 is used. (Remember: 95% confidence for media polls.) If you have a guessed (range of) value(s), use it, but recognize that: an actual result closer to 0.5 will cause you to miss the objective; a result further from 0.5 will cause you to oversample.