Chapter 7 Statistical Inference: Confidence Intervals

Slides:



Advertisements
Similar presentations
Chapter 8: Estimating with Confidence
Advertisements

Chapter 10: Estimating with Confidence
5. Statistical Inference: Estimation
INFERENCE: CONFIDENCE INTERVALS
Chapter 19 Confidence Intervals for Proportions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Sampling Distributions
7-2 Estimating a Population Proportion
Chapter 11: Inference for Distributions
BCOR 1020 Business Statistics
Chapter 10: Estimating with Confidence
Inferential Statistics
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Confidence Intervals for the Mean (σ Unknown) (Small Samples)
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved Estimation and Confidence Intervals.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Chapter 6 Confidence Intervals 1 Larson/Farber 4th ed.
Chapter 8: Statistical Inference: Confidence Intervals
1 Math 10 Part 5 Slides Confidence Intervals © Maurice Geraghty, 2009.
Chapter 7 Statistical Inference: Confidence Intervals
6 Chapter Confidence Intervals © 2012 Pearson Education, Inc.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Agresti/Franklin Statistics, 1e, 1 of 139  Section 6.4 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
1 Chapter 6 Estimates and Sample Sizes 6-1 Estimating a Population Mean: Large Samples / σ Known 6-2 Estimating a Population Mean: Small Samples / σ Unknown.
Unit 6 Confidence Intervals If you arrive late (or leave early) please do not announce it to everyone as we get side tracked, instead send me an .
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Lecture 7 Dustin Lueker. 2  Point Estimate ◦ A single number that is the best guess for the parameter  Sample mean is usually at good guess for the.
Agresti/Franklin Statistics, 1 of 87  Section 7.2 How Can We Construct a Confidence Interval to Estimate a Population Proportion?
Chapter 8: Estimating with Confidence
Estimation of a Population Mean
+ DO NOW. + Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Chapter 19 Confidence intervals for proportions
CONFIDENCE INTERVALS.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 11.1 Estimating a Population Mean.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Section 6.2 Confidence Intervals for the Mean (Small Samples) Larson/Farber 4th ed.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 8 Statistical Inference: Confidence Intervals Section 8.1 Point and Interval Estimates.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
FINAL EXAMINATION STUDY MATERIAL II
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Sampling Distributions and Estimation
Chapter 6 Confidence Intervals.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Presentation transcript:

Chapter 7 Statistical Inference: Confidence Intervals Learn …. How to Estimate a Population Parameter Using Sample Data

What Are Point and Interval Estimates of Population Parameters? Section 7.1 What Are Point and Interval Estimates of Population Parameters?

Point Estimate A point estimate is a single number that is our “best guess” for the parameter

Interval Estimate An interval estimate is an interval of numbers within which the parameter value is believed to fall.

Point Estimate vs Interval Estimate

Point Estimate vs Interval Estimate A point estimate doesn’t tell us how close the estimate is likely to be to the parameter An interval estimate is more useful It incorporates a margin of error which helps us to gauge the accuracy of the point estimate

Use an appropriate sample statistic: Point Estimation: How Do We Make a Best Guess for a Population Parameter? Use an appropriate sample statistic: For the population mean, use the sample mean For the population proportion, use the sample proportion

Point Estimation: How Do We Make a Best Guess for a Population Parameter? Point estimates are the most common form of inference reported by the mass media

Properties of Point Estimators Property 1: A good estimator has a sampling distribution that is centered at the parameter An estimator with this property is unbiased The sample mean is an unbiased estimator of the population mean The sample proportion is an unbiased estimator of the population proportion

Properties of Point Estimators Property 2: A good estimator has a small standard error compared to other estimators This means it tends to fall closer than other estimates to the parameter

Interval Estimation: Constructing an Interval that Contains the Parameter (We Hope!) Inference about a parameter should provide not only a point estimate but should also indicate its likely precision

Confidence Interval A confidence interval is an interval containing the most believable values for a parameter The probability that this method produces an interval that contains the parameter is called the confidence level This is a number chosen to be close to 1, most commonly 0.95

What is the Logic Behind Constructing a Confidence Interval? To construct a confidence interval for a population proportion, start with the sampling distribution of a sample proportion

The Sampling Distribution of the Sample Proportion Gives the possible values for the sample proportion and their probabilities Is approximately a normal distribution for large random samples Has a mean equal to the population proportion Has a standard deviation called the standard error

A 95% Confidence Interval for a Population Proportion Fact: Approximately 95% of a normal distribution falls within 1.96 standard deviations of the mean That means: With probability 0.95, the sample proportion falls within about 1.96 standard errors of the population proportion

Margin of Error The margin of error measures how accurate the point estimate is likely to be in estimating a parameter The distance of 1.96 standard errors in the margin of error for a 95% confidence interval

Confidence Interval A confidence interval is constructed by adding and subtracting a margin of error from a given point estimate When the sampling distribution is approximately normal, a 95% confidence interval has margin of error equal to 1.96 standard errors

Section 7.2 How Can We Construct a Confidence Interval to Estimate a Population Proportion?

Finding the 95% Confidence Interval for a Population Proportion We symbolize a population proportion by p The point estimate of the population proportion is the sample proportion We symbolize the sample proportion by

Finding the 95% Confidence Interval for a Population Proportion A 95% confidence interval uses a margin of error = 1.96(standard errors) [point estimate ± margin of error] =

Finding the 95% Confidence Interval for a Population Proportion The exact standard error of a sample proportion equals: This formula depends on the unknown population proportion, p In practice, we don’t know p, and we need to estimate the standard error

Finding the 95% Confidence Interval for a Population Proportion In practice, we use an estimated standard error:

Finding the 95% Confidence Interval for a Population Proportion A 95% confidence interval for a population proportion p is:

Example: Would You Pay Higher Prices to Protect the Environment? In 2000, the GSS asked: “Are you willing to pay much higher prices in order to protect the environment?” Of n = 1154 respondents, 518 were willing to do so

Example: Would You Pay Higher Prices to Protect the Environment? Find and interpret a 95% confidence interval for the population proportion of adult Americans willing to do so at the time of the survey

Example: Would You Pay Higher Prices to Protect the Environment?

Sample Size Needed for Large-Sample Confidence Interval for a Proportion For the 95% confidence interval for a proportion p to be valid, you should have at least 15 successes and 15 failures:

“95% Confidence” With probability 0.95, a sample proportion value occurs such that the confidence interval contains the population proportion, p With probability 0.05, the method produces a confidence interval that misses p

How Can We Use Confidence Levels Other than 95%? In practice, the confidence level 0.95 is the most common choice But, some applications require greater confidence To increase the chance of a correct inference, we use a larger confidence level, such as 0.99

A 99% Confidence Interval for p

Different Confidence Levels

Different Confidence Levels In using confidence intervals, we must compromise between the desired margin of error and the desired confidence of a correct inference As the desired confidence level increases, the margin of error gets larger

What is the Error Probability for the Confidence Interval Method? The general formula for the confidence interval for a population proportion is: Sample proportion ± (z-score)(std. error) which in symbols is

What is the Error Probability for the Confidence Interval Method?

Summary: Confidence Interval for a Population Proportion, p A confidence interval for a population proportion p is:

Increases as the confidence level increases Summary: Effects of Confidence Level and Sample Size on Margin of Error The margin of error for a confidence interval: Increases as the confidence level increases Decreases as the sample size increases

What Does It Mean to Say that We Have “95% Confidence”? If we used the 95% confidence interval method to estimate many population proportions, then in the long run about 95% of those intervals would give correct results, containing the population proportion

A recent survey asked: “During the last year, did anyone take something from you by force?” Of 987 subjects, 17 answered “yes” Find the point estimate of the proportion of the population who were victims .17 .017 .0017

Section 7.3 How Can We Construct a Confidence Interval To Estimate a Population Mean?

How to Construct a Confidence Interval for a Population Mean Point estimate ± margin of error The sample mean is the point estimate of the population mean The exact standard error of the sample mean is σ/ In practice, we estimate σ by the sample standard deviation, s

How to Construct a Confidence Interval for a Population Mean For large n… and also For small n from an underlying population that is normal… The confidence interval for the population mean is:

How to Construct a Confidence Interval for a Population Mean In practice, we don’t know the population standard deviation Substituting the sample standard deviation s for σ to get se = s/ introduces extra error To account for this increased error, we replace the z-score by a slightly larger score, the t-score

How to Construct a Confidence Interval for a Population Mean In practice, we estimate the standard error of the sample mean by se = s/ Then, we multiply se by a t-score from the t-distribution to get the margin of error for a confidence interval for the population mean

Properties of the t-distribution The t-distribution is bell shaped and symmetric about 0 The probabilities depend on the degrees of freedom, df The t-distribution has thicker tails and is more spread out than the standard normal distribution

t-Distribution

Summary: 95% Confidence Interval for a Population Mean A 95% confidence interval for the population mean µ is: To use this method, you need: Data obtained by randomization An approximately normal population distribution

Example: eBay Auctions of Palm Handheld Computers Do you tend to get a higher, or a lower, price if you give bidders the “buy-it-now” option?

Example: eBay Auctions of Palm Handheld Computers Consider some data from sales of the Palm M515 PDA (personal digital assistant) During the first week of May 2003, 25 of these handheld computers were auctioned off, 7 of which had the “buy-it-now” option

Example: eBay Auctions of Palm Handheld Computers “Buy-it-now” option: 235 225 225 240 250 250 210 Bidding only: 250 249 255 200 199 240 228 255 232 246 210 178 246 240 245 225 246 225

Example: eBay Auctions of Palm Handheld Computers Summary of selling prices for the two types of auctions: buy_now N Mean StDev Minimum Q1 Median Q3 no 18 231.61 21.94 178.00 221.25 240.00 246.75 yes 7 233.57 14.64 210.00 225.00 235.00 250.00 buy_now Maximum no 255.00 yes 250.00

Example: eBay Auctions of Palm Handheld Computers

Example: eBay Auctions of Palm Handheld Computers To construct a confidence interval using the t-distribution, we must assume a random sample from an approximately normal population of selling prices

Example: eBay Auctions of Palm Handheld Computers Let µ denote the population mean for the “buy-it-now” option The estimate of µ is the sample mean: x = $233.57 The sample standard deviation is: s = $14.64

Example: eBay Auctions of Palm Handheld Computers The 95% confidence interval for the “buy-it-now” option is: which is 233.57 ± 13.54 or (220.03, 247.11)

Example: eBay Auctions of Palm Handheld Computers The 95% confidence interval for the mean sales price for the bidding only option is: (220.70, 242.52)

Example: eBay Auctions of Palm Handheld Computers Notice that the two intervals overlap a great deal: “Buy-it-now”: (220.03, 247.11) Bidding only: (220.70, 242.52) There is not enough information for us to conclude that one probability distribution clearly has a higher mean than the other

How Do We Find a t- Confidence Interval for Other Confidence Levels? The 95% confidence interval uses t.025 since 95% of the probability falls between - t.025 and t.025 For 99% confidence, the error probability is 0.01 with 0.005 in each tail and the appropriate t-score is t.005

If the Population is Not Normal, is the Method “Robust”? A basic assumption of the confidence interval using the t-distribution is that the population distribution is normal Many variables have distributions that are far from normal

If the Population is Not Normal, is the Method “Robust”? How problematic is it if we use the t- confidence interval even if the population distribution is not normal?

If the Population is Not Normal, is the Method “Robust”? For large random samples, it’s not problematic The Central Limit Theorem applies: for large n, the sampling distribution is bell-shaped even when the population is not

If the Population is Not Normal, is the Method “Robust”? What about a confidence interval using the t-distribution when n is small? Even if the population distribution is not normal, confidence intervals using t-scores usually work quite well We say the t-distribution is a robust method in terms of the normality assumption

Cases Where the t- Confidence Interval Does Not Work With binary data With data that contain extreme outliers

The Standard Normal Distribution is the t-Distribution with df = ∞

The 2002 GSS asked: “What do you think is the ideal number of children in a family?” The 497 females who responded had a median of 2, mean of 3.02, and standard deviation of 1.81. What is the point estimate of the population mean? 497 2 3.02 1.81

How Do We Choose the Sample Size for a Study? Section 7.4 How Do We Choose the Sample Size for a Study?

How are the Sample Sizes Determined in Polls? It depends on how much precision is needed as measured by the margin of error The smaller the margin of error, the larger the sample size must be

Choosing the Sample Size for Estimating a Population Proportion? First, we must decide on the desired margin of error Second, we must choose the confidence level for achieving that margin of error In practice, 95% confidence intervals are most common

Example: What Sample Size Do You Need For An Exit Poll? A television network plans to predict the outcome of an election between two candidates – Levin and Sanchez They will do this with an exit poll that randomly samples votes on election day

Example: What Sample Size Do You Need For An Exit Poll? The final poll a week before election day estimated Levin to be well ahead, 58% to 42% So the outcome is not expected to be close The researchers decide to use a sample size for which the margin of error is 0.04

Example: What Sample Size Do You Need For An Exit Poll? What is the sample size for which a 95% confidence interval for the population proportion has margin of error equal to 0.04?

Example: What Sample Size Do You Need For An Exit Poll? The 95% confidence interval for a population proportion p is: If the sample size is such that 1.96(se) = 0.04, then the margin of error will be 0.04

Example: What Sample Size Do You Need For An Exit Poll? Find the value of the sample size n for which 0.04 = 1.96(se):

Example: What Sample Size Do You Need For An Exit Poll? A random sample of size n = 585 should give a margin of error of about 0.04 for a 95% confidence interval for the population proportion

How Can We Select a Sample Size Without Guessing a Value for the Sample Proportion In the formula for determining n, setting = 0.50 gives the largest value for n out of all the possible values to substitute for Doing this is the “safe” approach that guarantees we’ll have enough data

Sample Size for Estimating a Population Parameter The random sample size n for which a confidence interval for a population proportion p has margin of error m (such as m = 0.04) is

Sample Size for Estimating a Population Parameter The z-score is based on the confidence level, such as z = 1.96 for 95% confidence You either guess the value you’d get for the sample proportion based on other information or take the safe approach of setting = 0.50

Sample Size for Estimating a Population Mean The random sample size n for which a 95% confidence interval for a population mean has margin of error approximately equal to m is To use this formula, you guess the value you’ll get for the sample standard deviation, s

Sample Size for Estimating a Population Mean In practice, since you don’t yet have the data, you don’t know the value of the sample standard deviation, s You must substitute an educated guess for s You can use the sample standard deviation from a similar study

Example: Finding n to Estimate Mean Education in South Africa A social scientist plans a study of adult South Africans to investigate educational attainment in the black community How large a sample size is needed so that a 95% confidence interval for the mean number of years of education has margin of error equal to 1 year?

Example: Finding n to Estimate Mean Education in South Africa No prior information about the standard deviation of educational attainment is available We might guess that the sample education values fall within a range of about 18 years

Example: Finding n to Estimate Mean Education in South Africa If the data distribution is bell-shaped, the range from – 3 to + 3 will contain nearly all the distribution The distance – 3 to + 3 equals 6s Solving 18 = 6s for s yields s = 3 So ‘3’ is a crude estimate of s

Example: Finding n to Estimate Mean Education in South Africa The desired margin of error is m = 1 year The required sample size is:

What Factors Affect the Choice of the Sample Size? The first is the desired precision, as measured by the margin of error, m The second is the confidence level

What Other Factors Affect the Choice of the Sample Size? A third factor is the variability in the data If subjects have little variation (that is, s is small), we need fewer data than if they have substantial variation A fourth factor is financial Cost is often a major constraint

What if You Have to Use a Small n? The t- methods for a mean are valid for any n However, you need to be extra cautious to look for extreme outliers or great departures from the normal population assumption

What if You Have to Use a Small n? In the case of the confidence interval for a population proportion, the method works poorly for small samples

Constructing a Small-Sample Confidence Interval for a Proportion Suppose a random sample does not have at least 15 successes and 15 failures The confidence interval formula: Is still valid if we use it after adding ‘2’ to the original number of successes and ‘2’ to the original number of failures This results in adding ‘4’ to the sample size n