Sampling Distributions and The Central Limit Theorem

Slides:



Advertisements
Similar presentations
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Advertisements

Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Sampling Distributions
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 18 Sampling Distribution Models
Copyright © 2010 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Sampling Variability & Sampling Distributions.
© 2010 Pearson Prentice Hall. All rights reserved Sampling Distributions and the Central Limit Theorem.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Modular 13 Ch 8.1 to 8.2.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Chapter Sampling Distributions 8 © 2010 Pearson Prentice Hall. All rights reserved.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
Section 8.2 Estimating  When  is Unknown
Overview 7.2 Central Limit Theorem for Means Objectives: By the end of this section, I will be able to… 1) Describe the sampling distribution of x for.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Objectives 1.Describe the distribution of the sample mean: samples from normal populations 2.Describe.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
Distribution of the Sample Means
Section 5.4 Sampling Distributions and the Central Limit Theorem Larson/Farber 4th ed.
Distributions of the Sample Mean
Chapter 6.3 The central limit theorem. Sampling distribution of sample means A sampling distribution of sample means is a distribution using the means.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Biostatistics Unit 5 – Samples. Sampling distributions Sampling distributions are important in the understanding of statistical inference. Probability.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
6.3 THE CENTRAL LIMIT THEOREM. DISTRIBUTION OF SAMPLE MEANS  A sampling distribution of sample means is a distribution using the means computed from.
Chapter 8 Sampling Variability and Sampling Distributions.
Chapter 18: Sampling Distribution Models
Review of Statistical Terms Population Sample Parameter Statistic.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Chapter 18 Sampling Distribution Models *For Means.
Unit 6 Section : The Central Limit Theorem  Sampling Distribution – the probability distribution of a sample statistic that is formed when samples.
Sec 6.3 Bluman, Chapter Review: Find the z values; the graph is symmetrical. Bluman, Chapter 63.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Sampling Distributions
Sampling and Sampling Distributions
Chapter 5 Normal Probability Distributions.
Chapter Eight Estimation.
Sampling Distribution Models
Sampling Distributions
Understanding Sampling Distributions: Statistics as Random Variables
Sampling Variability & Sampling Distributions
6-3The Central Limit Theorem.
The Normal Distribution
Chapter 7 Sampling Distributions.
Chapter 18: Sampling Distribution Models
Sampling Distributions and The Central Limit Theorem
Sampling Distribution Models
Chapter 5 Normal Probability Distributions.
Chapter 7 Sampling Distributions.
Central Limit Theorem.
Chapter 7 Sampling Distributions.
Confidence Intervals for a Standard Deviation
Normal Probability Distributions
Sampling Distributions
Sampling Distributions
AGENDA: DG minutes Begin Part 2 Unit 1 Lesson 11.
Chapter 7 Sampling Distributions.
Sampling Distributions and the Central Limit Theorem
Estimates and Sample Sizes Lecture – 7.4
Day 13 AGENDA: DG minutes Begin Part 2 Unit 1 Lesson 11.
Chapter 7 Sampling Distributions.
Chapter 5 Normal Probability Distributions.
Ch. 8 Central Limit Theorem 8.1: Sample means 8.2: Sample proportions
Chapter 5: Sampling Distributions
Presentation transcript:

Sampling Distributions and The Central Limit Theorem Section 7.3

Objectives Construct the sampling distribution of a sample mean Use the Central Limit Theorem to compute probabilities involving sample means

Construct the sampling distribution of a sample mean Objective 1 Construct the sampling distribution of a sample mean

Sampling Distribution of the Sample Mean In real situations, statistical studies involve sampling several individuals then computing numerical summaries of the samples. Most often the sample mean, 𝑥 , is computed. If several samples are drawn from a population, they are likely to have different values for 𝑥 . Because the value of 𝑥 varies each time a sample is drawn, 𝒙 is a random variable. For each value of the random variable, 𝑥 , we can compute a probability. The probability distribution of 𝑥 is called the sampling distribution of 𝑥 .

An Example of a Sampling Distribution Tetrahedral dice are shaped like a pyramid with four faces. Each face corresponds to a number between 1 and 4. Tossing a tetrahedral die is like sampling a value from the population 1, 2, 3, 4 . We can easily find the population mean, 𝜇=2.5, and the population standard deviation 𝜎=1.118. If a tetrahedral die is tossed three times, the sequence of three numbers observed is a sample of size 3 drawn with replacement. The table displays all possible samples of size 3 and their sample mean 𝑥 . The mean of all of values of 𝑥 is 𝜇 𝑥 =2.5 and the standard deviation of all values of 𝑥 is 𝜎 𝑥 =0.6455. Next, we compare these values to the population mean (2.5) and population standard deviation (1.118).

Mean and Standard Deviation of a Sampling Distribution In the previous table, the mean of the sampling distribution is 𝜇 𝑥 =2.5, which is the same as the mean of the population, 𝜇=2.5. This relation always holds. The standard deviation of the sampling distribution is 𝜎 𝑥 =0.6455, which is less than the population standard deviation 𝜎=1.118. It is not immediately obvious how these two quantities are related. Note, however, that 𝜎 𝑥 =0.6455= 1.118 3 = 𝜎 3 . Recall that the sample size is 𝑛=3, which suggests that 𝜎 𝑥 = 𝜎 𝑛 . The mean of the sampling distribution is denoted by 𝝁 𝒙 and equals the mean of the population: 𝝁 𝒙 =𝝁 The standard deviation of the sampling distribution, sometimes called the standard error, is denoted by 𝝈 𝒙 and equals the standard deviation of the population divided by the square root of the sample size: 𝝈 𝒙 = 𝝈 𝒏

Example – Sampling Distribution Among students at a certain college, the mean number of hours of television watched per week is 𝜇 = 10.5, and the standard deviation is 𝜎 = 3.6. A simple random sample of 16 students is chosen for a study of viewing habits. Let 𝑥 be the mean number of hours of TV watched by the sampled students. Find the mean 𝜇 𝑥 and the standard deviation 𝜎 𝑥 of 𝑥 . Solution: The mean of 𝑥 is: 𝜇 𝑥 = 𝜇 = 10.5 The sample size is 𝑛 = 16. Therefore, the standard deviation of 𝑥 is: 𝜎 𝑥 = 𝜎 𝑛 = 3.6 16 = 0.9

Sampling Distribution for Sample of Size 3 Consider again the tetrahedral die example. The sampling distribution for 𝑥 can be determined from the table of all possible values of 𝑥 for a sample of size 3. The probability that the sample mean is 1.00 is 1 64 , or 0.015625, because out of the 64 possible samples, only 1 has a sample mean equal to 1.00. Similarly, the probability that 𝑥 =1.33 is 3 64 , or 0.046875, because there are 3 samples whose sample mean is 1.33. The probability distribution is: 𝑥 𝑃( 𝑥 ) 1.00 0.015625 1.33 0.046875 1.67 0.093750 2.00 0.156250 2.33 0.187500 2.67 3.00 3.33 3.67 4.00

Probability Histogram for a Sampling Distribution In the tetrahedral die example, the population is 1, 2, 3, 4 . When a die is rolled, each number has the same chance of appearing, 1 4 or 0.25. The probability histogram for the sampling distribution of 𝑥 with sample size 3 is obtained from the sampling distribution on the previous slide. The probability histogram for the sampling distribution looks a lot like the normal curve, whereas the probability histogram for the population does not. Remarkably, it is true that, for any population, if the sample size is large enough, the sample mean 𝑥 will be approximately normally distributed. For a symmetric population like the tetrahedral die population, the sample mean is approximately normally distributed even for a small sample size like 𝑛 = 3.

Sampling Distribution of a Skewed Population If a population is skewed, a larger sample size is necessary for the sampling distribution of 𝑥 to be approximately normal. Consider the following probability distribution. Below are the probability histograms for the sampling distribution of 𝑥 for samples of size 3, 10, and 30. Note that the shapes of the distributions begin to approximate a normal curve as the sample size increases. The size of the sample needed to obtain approximate normality depends mostly on the skewness of the population. In practice, a sample of size 𝑛 > 30 is large enough.

The Central Limit Theorem The remarkable fact that the sampling distribution of 𝑥 is approximately normal for a large sample from any distribution is part of one of the most used theorems in Statistics, the Central Limit Theorem. The Central Limit Theorem applies for all populations. However, for symmetric populations, a smaller sample size may suffice. If the population itself is normal, the sample mean 𝒙 will be normal for any sample size. Let 𝑥 be the mean of a large (𝑛 > 30) simple random sample from a population with mean 𝜇 and standard deviation 𝜎. Then 𝑥 has an approximately normal distribution, with mean 𝜇 𝑥 = 𝜇 and standard deviation 𝜎 𝑥 = 𝜎 𝑛 . The Central Limit Theorem

Examples – The Central Limit Theorem Example 1: A sample of size 45 will be drawn from a population with mean 𝜇 = 15 and standard deviation 𝜎 = 3.5. Is it appropriate to use the normal distribution to find probabilities for 𝑥 ? Solution: Yes, by The Central Limit Theorem, since 𝑛 > 30, 𝑥 has an approximately normal distribution. Example 2: A sample of size 8 will be drawn from a normal population with mean 𝜇 = –60 and standard deviation 𝜎 = 5. Is it appropriate to use the normal distribution to find probabilities for 𝑥 ? Yes, since the population itself is approximately normal, 𝑥 has an approximately normal distribution. Example 3: A sample of size 24 will be drawn from a population with mean 𝜇 = 35 and standard deviation 𝜎 = 1.2. Is it appropriate to use the normal distribution to find probabilities for 𝑥 ? Solution: No, since the population is not known to be normal and 𝑛 is not greater than 30, we cannot be certain that 𝑥 has an approximately normal distribution.

Objective 2 Use the Central Limit Theorem to compute probabilities involving sample means

Example 1 – Using the Central Limit Theorem Based on data from the U.S. Census, the mean age of college students in 2011 was 𝜇 = 25 years, with a standard deviation of 𝜎 = 9.5 years. A simple random sample of 125 students is drawn. What is the probability that the sample mean age of the students is greater than 26 years? Solution: The sample size is 125, which is greater than 30. Therefore, we may use the normal curve. We compute 𝜇 𝑥 and 𝜎 𝑥 𝜇 𝑥 = 𝜇 = 25 and 𝜎 𝑥 = 𝜎 𝑛 = 9.5 125 = 0.85 Find the area under the normal curve. The probability that the sample mean age of the students is greater than 26 years is approximately 0.1197.

Example 2 – Using the Central Limit Theorem Hereford cattle are one of the most popular breeds of cattle. Based on data from the Hereford Cattle Society, the mean weight of a one-year-old Hereford bull is 1135 pounds, with a standard deviation of 97 pounds. Would it be unusual for the mean weight of 100 head of cattle to be less than 1100 pounds? Solution: The sample size is 100, which is greater than 30. Therefore, we may use the normal curve. We compute 𝜇 𝑥 and 𝜎 𝑥 𝜇 𝑥 = 𝜇 = 1135 and 𝜎 𝑥 = 𝜎 𝑛 = 97 100 = 9.7 Find the area under the normal curve. This probability is less than 0.05, so it would be unusual for the sample mean to be less than 1100.

You Should Know… How to construct the sampling distribution of a sample mean How to find the mean and standard deviation of a sampling distribution of 𝑥 The Central Limit Theorem How to use the Central Limit Theorem to compute probabilities involving sample means