Week 6 October 6-10 Four Mini-Lectures QMM 510 Fall 2014.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Confidence Intervals Chapter 8.
9.1 confidence interval for the population mean when the population standard deviation is known
Parameter Estimation Chapter 8 Homework: 1-7, 9, 10 Focus: when  is known (use z table)
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 7 Introduction to Sampling Distributions
BCOR 1020 Business Statistics Lecture 17 – March 18, 2008.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 7-3 Estimating a Population Mean:  Known Created by.
Chapter 7 Sampling and Sampling Distributions
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Chapter 8 Estimation: Single Population
Fall 2006 – Fundamentals of Business Statistics 1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 7 Estimating Population Values.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 7 Estimation: Single Population
Chapter 7 Estimating Population Values
BCOR 1020 Business Statistics
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Business Statistics: Communicating with Numbers
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Confidence Intervals Chapter 7.
Section 8.2 Estimating  When  is Unknown
Chapter 6 Confidence Intervals.
Prepared by Lloyd R. Jaisingh
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
Dan Piett STAT West Virginia University
Estimation of Statistical Parameters
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Estimating a Population Mean: σ Known 7-3, pg 355.
McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. Sampling Distributions and Estimation (Part 1) Chapter88 Sampling Variation Estimators.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. Sampling Distributions and Estimation (Part 2) Chapter88 Sample Size Determination.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Sampling Distribution and the Central Limit Theorem.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Estimating a Population Mean
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Estimating a Population Mean:  Known
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
8-1 Confidence Intervals Chapter Contents Confidence Interval for a Mean (μ) with Known σ Confidence Interval for a Mean (μ) with Unknown σ Confidence.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Sampling Distributions and Estimation Chapter88 Sampling Variation Estimators and Sampling Distributions Sample Mean and the Central Limit Theorem Confidence.
Chapter 6 Sampling and Sampling Distributions
CHAPTER 8 Estimating with Confidence
Sampling Distributions
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Sampling Distributions and Estimation
Estimates and Sample Sizes Sections 6-2 & 6-4
Sampling Distributions and Estimation
CHAPTER 8 Confidence Estimating with Estimating a Population 8.3 Mean
Chapter 7 Sampling Distributions.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Chapter 7 Sampling Distributions.
LESSON 18: CONFIDENCE INTERVAL ESTIMATION
Chapter 8 Confidence Intervals.
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
Presentation transcript:

Week 6 October 6-10 Four Mini-Lectures QMM 510 Fall 2014

8-2 Sampling Distributions ML 6.1 Chapter Contents 8.1 Sampling Variation 8.2 Estimators and Sampling Errors 8.3 Sample Mean and the Central Limit Theorem 8.4 Confidence Interval for a Mean (μ) with Known σ 8.5 Confidence Interval for a Mean (μ) with Unknown σ 8.6 Confidence Interval for a Proportion (π) 8.7 Estimating from Finite Populations 8.8 Sample Size Determination for a Mean 8.9 Sample Size Determination for a Proportion 8.10 Confidence Interval for a Population Variance,  2 (Optional 8.10 Confidence Interval for a Population Variance,  2 (Optional) Chapter 8 So many topics, so little time …

8-3 Learning Objectives LO8-1: Define sampling error, parameter, and estimator. LO8-2: Explain the desirable properties of estimators. LO8-3: State the Central Limit Theorem for a mean. LO8-4: Explain how sample size affects the standard error. Chapter 8 Sampling Distributions

8-4 Sample statistic – a random variable whose value depends on which population items are included in the random sample. Depending on the sample size, the sample statistic could either represent the population well or differ greatly from the population. This sampling variation can be illustrated. Here are 100 individual items drawn from a population. When n = 1, the histogram of the sampled items resembles the population, but not exactly. Chapter 8 Sampling Variation

8-5 Chapter 8 Consider eight random samples of size n = 5 from a large population of GMAT scores for MBA applicants.Consider eight random samples of size n = 5 from a large population of GMAT scores for MBA applicants. The sample items vary, but the means tend to be close to the population mean (  = ).The sample items vary, but the means tend to be close to the population mean (  = ). Sampling Variation Example: GMAT Scores Example: GMAT Scores

8-6 Sample dot plots show that the sample means have much less variation than the individual sample items.Sample dot plots show that the sample means have much less variation than the individual sample items. Chapter 8 Sampling Variation Example: GMAT Scores Example: GMAT Scores

8-7 Estimator – a statistic derived from a sample to infer the value of a population parameter.Estimator – a statistic derived from a sample to infer the value of a population parameter. Estimate – the value of the estimator in a particular sample.Estimate – the value of the estimator in a particular sample. A population parameter is usually represented by a Greek letter and the corresponding statistic by a Roman letter.A population parameter is usually represented by a Greek letter and the corresponding statistic by a Roman letter. Some Terminology Some Terminology Chapter 8 Estimators and Sampling Distributions

8-8 Examples of Estimators Examples of Estimators Chapter 8 Sampling Distributions The sampling distribution of an estimator is the probability distribution of all possible values the statistic may assume when a random sample of size n is taken. Estimators and Sampling Distributions Note: An estimator is a random variable since samples vary.

8-9 BiasBias is the difference between the expected value of the estimator and the true parameter. Example for the mean, An estimator is unbiased if its expected value is the parameter being estimated. The sample mean is an unbiased estimator of the population mean sinceAn estimator is unbiased if its expected value is the parameter being estimated. The sample mean is an unbiased estimator of the population mean since On average, an unbiased estimator neither overstates nor understates the true parameter.On average, an unbiased estimator neither overstates nor understates the true parameter. Chapter 8 Sampling error Sampling error is the difference between an estimate and the corresponding population parameter. For example, if we use the sample mean as an estimate for the population mean, then the Estimators and Sampling Distributions

8-10 Chapter 8 Estimators and Sampling Distributions A desirable property for an estimator is for it to be unbiased. Unbiased

8-11 Efficiency refers to the variance of the estimator’s sampling distribution.Efficiency refers to the variance of the estimator’s sampling distribution. A more efficient estimator has smaller variance.A more efficient estimator has smaller variance. Efficiency Efficiency Figure 8.6 Chapter 8 Estimators and Sampling Distributions

8-12 Consistency Consistency A consistent estimator converges toward the parameter being estimated as the sample size increases. Figure 8.6 Chapter 8 Estimators and Sampling Distributions

8-13 Chapter 8 Central Limit Theorem The Central Limit Theorem is a powerful result that allows us to approximate the shape of the sampling distribution of the sample mean even when we don’t know what the population looks like.

8-14 If the population is exactly normal, then the sample mean follows a normal distribution. Chapter 8 As the sample size n increases, the distribution of sample means narrows in on the population mean µ. Central Limit Theorem

8-15 If the sample is large enough, the sample means will have approximately a normal distribution even if your population is not normal. Chapter 8 Central Limit Theorem

8-16 Illustrations of Central Limit Theorem Note: Chapter 8 Using the uniform and a right-skewed distribution. Central Limit Theorem

8-17 The Central Limit Theorem permits us to define an interval within which the sample means are expected to fall. As long as the sample size n is large enough, we can use the normal distribution regardless of the population shape (or any n if the population is normal to begin with). Applying The Central Limit Theorem Applying The Central Limit Theorem Chapter 8 Central Limit Theorem

8-18 Sample Size and Standard Error Sample Size and Standard Error Chapter 8 For example, when n = 4 the standard error is halved. To halve it again requires n = 16, and to halve it again requires n = 64. To halve the standard error, you must quadruple the sample size (the law of diminishing returns). Central Limit Theorem standard error The sample means tend to fall within a narrower interval as n increases. The key is the standard error :

8-19 Consider a discrete uniform population consisting of the integers {0, 1, 2, 3}.Consider a discrete uniform population consisting of the integers {0, 1, 2, 3}. The population parameters are:  = 1.5,  = The population parameters are:  = 1.5,  = Illustration: All Possible Samples from a Uniform Population Illustration: All Possible Samples from a Uniform Population Chapter 8 Central Limit Theorem

8-20 The population is uniform, yet the distribution of all possible sample means of size 2 has a peaked triangular shape. Illustration: All Possible Samples from a Uniform Population Illustration: All Possible Samples from a Uniform Population Chapter 8 Central Limit Theorem

8-21 The population is uniform, yet the histogram of sample means has a peaked triangular shape starting with n = 2. By n = 8, the histogram appears normal. Illustration: 100 Samples from a Uniform Population Illustration: 100 Samples from a Uniform Population Chapter 8 Central Limit Theorem

8-22 The population is skewed, yet the histogram of sample means starts to have a normal shape starting with n = 4. By n = 16, the histogram appears arguably normal. Illustration: 100 Samples from a Skewed Population Illustration: 100 Samples from a Skewed Population Chapter 8 Central Limit Theorem

8-23 What Is a Confidence Interval? What Is a Confidence Interval? Chapter 8 Confidence Interval for ML 6.2 a Mean (  ) with Known 

8-24 What is a Confidence Interval? What is a Confidence Interval? The confidence interval for  with known  is:The confidence interval for  with known  is: Chapter 8 Confidence Interval for a Mean (  ) with Known  z-values for commonly- used confidence levels

8-25 Example: Bottle Fill Example: Bottle Fill Chapter 8 Confidence Interval for a Mean (  ) with Known  … but usually we do NOT know σ

8-26 A higher confidence level leads to a wider confidence interval.A higher confidence level leads to a wider confidence interval. Choosing a Confidence Level Choosing a Confidence Level Greater confidence implies loss of precision (i.e. greater margin of error).Greater confidence implies loss of precision (i.e. greater margin of error). 95% confidence is most often used.95% confidence is most often used. Chapter 8 Confidence Intervals for Example 8.2 Confidence Interval for a Mean (  ) with Known 

8-27 A confidence interval either does or does not contain .A confidence interval either does or does not contain . The confidence level quantifies the risk.The confidence level quantifies the risk. Out of 100 confidence intervals, approximately 95% may contain , while approximately 5% might not contain  when constructing 95% confidence intervals (for example, sample 14 below).Out of 100 confidence intervals, approximately 95% may contain , while approximately 5% might not contain  when constructing 95% confidence intervals (for example, sample 14 below). Interpretation Interpretation Chapter 8 Confidence Interval for a Mean (  ) with Known 

8-28 Chapter 8 If  is known and the population is normal, then we can safely use the formula to compute the confidence interval. If  is known and we do not know whether the population is normal, a common rule of thumb is that n  30 is sufficient to use the formula as long as the distribution is approximately symmetric with no outliers. Confidence Interval for a Mean (  ) with Known  When Can We Assume Normality? When Can We Assume Normality? Larger n may be needed to assume normality if you are sampling from a strongly skewed population or one with outliers.

8-29 Use the Student’s t distribution instead of the normal distribution when the population is normal but the standard deviation s is unknown and the sample size is small. Student’s t Distribution Student’s t Distribution Chapter 8 Confidence Interval for ML 6.3 a Mean (  ) with Unknown  … and usually we do NOT know σ …

8-30 Student’s t Distribution Student’s t Distribution Chapter 8 Confidence Interval for a Mean (  ) with Unknown 

8-31 Student’s t Distribution Student’s t Distribution t distributions are symmetric and shaped like the standard normal distribution.t distributions are symmetric and shaped like the standard normal distribution. The t distribution is dependent on the size of the sample.The t distribution is dependent on the size of the sample. Figure 8.11 Chapter 8 Comparison of Normal and Student’s t Confidence Interval for a Mean (  ) with Unknown 

8-32 Degrees of Freedom Degrees of Freedom Degrees of freedom (d.f.) is a parameter based on the sample size that is used to determine the t distribution.Degrees of freedom (d.f.) is a parameter based on the sample size that is used to determine the t distribution. The d.f. for the t distribution in this case is given by d.f. = n  1.The d.f. for the t distribution in this case is given by d.f. = n  1. Chapter 8 As n increases, the t distribution approaches the shape of the normal distribution.As n increases, the t distribution approaches the shape of the normal distribution. For a given confidence level, t is always larger than z, so a confidence interval based on t is always wider than if z were used.For a given confidence level, t is always larger than z, so a confidence interval based on t is always wider than if z were used. Confidence Interval for a Mean (  ) with Unknown  Comparison of Normal and Student’s t

8-33 Comparison of z and t Comparison of z and t For very small samples, t-values differ substantially from the normal. As degrees of freedom increase, the t-values approach the normal z- values. Chapter 8 So for a 90 percent confidence interval, we would use t = 1.697, which is slightly larger than z = Confidence Interval for a Mean (  ) with Unknown  Note: Note: the z and t distributions are almost the same for d.f. = 30 For example, for n = 31, the degrees of freedom would be d.f. = 31 – 1 = 30.

8-34 Figure 8.13 Chapter 8 Confidence Interval for a Mean (  ) with Unknown  Example: GMAT Scores Again Example: GMAT Scores Again

8-35 Construct a 90% confidence interval for the mean GMAT score of all MBA applicants.Construct a 90% confidence interval for the mean GMAT score of all MBA applicants. x = 510 s = Since  is unknown, use the Student’s t for the confidence interval with d.f. = 20 – 1 = 19.Since  is unknown, use the Student’s t for the confidence interval with d.f. = 20 – 1 = 19. Find t  /2 = t.05 = from Appendix D.Find t  /2 = t.05 = from Appendix D. Chapter 8 Confidence Interval for a Mean (  ) with Unknown  Example: GMAT Scores Again Example: GMAT Scores Again

8-36 For a 90% confidence interval, use Appendix D to find t 0.05 = with d.f. = 19.For a 90% confidence interval, use Appendix D to find t 0.05 = with d.f. = 19. Note: Note: We could also use Excel, MINITAB, etc. to obtain t.05 values as well as to construct confidence intervals. Chapter 8 We are 90 percent confident that the true mean GMAT score might be within the interval [481.48, ] Confidence Interval for a Mean (  ) with Unknown  Example: GMAT Scores Again Example: GMAT Scores Again =T.INV.2T(0.1,19) = 1.729

8-37 Confidence Interval Width Confidence Interval Width Confidence interval width reflects - the sample size, - the confidence level and - the standard deviation. To obtain a narrower interval and more precision - increase the sample size, or - lower the confidence level (e.g., from 90% to 80% confidence ). Chapter 8 Confidence Interval for a Mean (  ) with Unknown  There is no free lunch!

8-38 Using Appendix D Using Appendix D Beyond d.f. = 50, Appendix D shows d.f. in steps of 5 or 10.Beyond d.f. = 50, Appendix D shows d.f. in steps of 5 or 10. If the table does not give the exact degrees of freedom, use the t-value for the next lower degrees of freedom.If the table does not give the exact degrees of freedom, use the t-value for the next lower degrees of freedom. This is a conservative procedure since it causes the interval to be slightly wider.This is a conservative procedure since it causes the interval to be slightly wider. A conservative statistician may use the t distribution for confidence intervals when σ is unknown because using z would underestimate the margin of error. Chapter 8 Confidence Interval for a Mean (  ) with Unknown 

8-39 If the population is normal, then the sample variance s 2 follows the chi-square distribution (  2 ) with degrees of freedom d.f.  = n – 1.If the population is normal, then the sample variance s 2 follows the chi-square distribution (  2 ) with degrees of freedom d.f.  = n – 1. Lower (  2 L ) and upper (  2 U ) tail percentiles for the chi-square distribution can be found using Appendix E.Lower (  2 L ) and upper (  2 U ) tail percentiles for the chi-square distribution can be found using Appendix E. Chi-Square Distribution Chi-Square Distribution Confidence Interval for a ML 6.4 Population Variance,  2. Chapter 8 Note: Note: The chi-square distribution is skewed right, but less so for larger d.f.

8-40 Using the sample variance s 2, the confidence interval isUsing the sample variance s 2, the confidence interval is Confidence Interval Confidence Interval To obtain a confidence interval for the standard deviation , just take the square root of the interval bounds.To obtain a confidence interval for the standard deviation , just take the square root of the interval bounds. Confidence Interval for a Population Variance,  2 Chapter 8

8-41 You can use Appendix E to find critical chi-square values. Chapter 8 or from Excel: =CHISQ.INV(0.025,39)= =CHISQ.INV(0.975,39) = Confidence Interval for a Population Variance,  2

8-42 Estimating a variance is easy.Estimating a variance is easy. But you don’t see it very often.But you don’t see it very often. Maybe because the chi-square distribution is less familiar?Maybe because the chi-square distribution is less familiar? Maybe because we usually are more about the mean?Maybe because we usually are more about the mean? Bottom Line: Bottom Line: Chapter 8 Confidence Interval for a Population Variance,  2