Distributions of the Sample Mean

Slides:



Advertisements
Similar presentations
Chapter 6 Sampling and Sampling Distributions
Advertisements

Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Chapter 6 – Normal Probability Distributions
Sampling Distributions
Reminders: Parameter – number that describes the population Statistic – number that is computed from the sample data Mean of population = µ Mean of sample.
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
Chapter 10: Sampling and Sampling Distributions
Chapter 7 Introduction to Sampling Distributions
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Sampling Distributions
Chapter 6 Introduction to Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Sampling Distributions
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
6-5 The Central Limit Theorem
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Chapter Sampling Distributions 8 © 2010 Pearson Prentice Hall. All rights reserved.
Review of normal distribution. Exercise Solution.
From Last week.
Slide Slide 1 Chapter 8 Sampling Distributions Mean and Proportion.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
Chapter 7: Sampling Distributions
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
AP Statistics Chapter 9 Notes.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Probability and Samples
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 9 Samples.
Estimation This is our introduction to the field of inferential statistics. We already know why we want to study samples instead of entire populations,
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Chapter 6.3 The central limit theorem. Sampling distribution of sample means A sampling distribution of sample means is a distribution using the means.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
1 Chapter 8 Sampling Distributions of a Sample Mean Section 2.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Lesson Introduction to the Practice of Statistics.
Chapter 10: Introduction to Statistical Inference.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
1 Sampling Distribution of Arithmetic Mean Dr. T. T. Kachwala.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
1 CHAPTER 6 Sampling Distributions Homework: 1abcd,3acd,9,15,19,25,29,33,43 Sec 6.0: Introduction Parameter The "unknown" numerical values that is used.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Introduction to Inference Sampling Distributions.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Lecture 5 Introduction to Sampling Distributions.
Lesson Distribution of the Sample Mean. Objectives Understand the concept of a sampling distribution Describe the distribution of the sample mean.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
MATH Section 4.4.
Chapter 7: The Distribution of Sample Means
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Lecture Slides Elementary Statistics Twelfth Edition
Sampling Distributions
Introduction to Sampling Distributions
Sampling Distributions
Chapter 7 Sampling Distributions.
The Normal Probability Distribution
Chapter 7 Sampling Distributions.
MATH 2311 Section 4.4.
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 7 Sampling Distributions.
Click the mouse button or press the Space Bar to display the answers.
CHAPTER 15 SUMMARY Chapter Specifics
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
MATH 2311 Section 4.4.
Presentation transcript:

Distributions of the Sample Mean Chapter 8 Section 1 Distributions of the Sample Mean

Chapter 8 – Section 1 Learning objectives Understand the concept of a sampling distribution Describe the distribution of the sample mean for samples obtained from normal populations Describe the distribution of the sample mean for samples obtained from a population that is not normal 1 2 3

Chapter 8 – Section 1 Learning objectives Understand the concept of a sampling distribution Describe the distribution of the sample mean for samples obtained from normal populations Describe the distribution of the sample mean for samples obtained from a population that is not normal 1 2 3

Chapter 8 – Section 1 Often the population is too large to perform a census … so we take a sample How do the results of the sample apply to the population? What’s the relationship between the sample mean and the population mean? What’s the relationship between the sample standard deviation and the population standard deviation? This is statistical inference Often the population is too large to perform a census … so we take a sample Often the population is too large to perform a census … so we take a sample How do the results of the sample apply to the population? What’s the relationship between the sample mean and the population mean What’s the relationship between the sample standard deviation and the population standard deviation?

Chapter 8 – Section 1 We want to use the sample mean x to estimate the population mean μ If we want to estimate the heights of eight year old girls, we can proceed as follows Randomly select 100 eight year old girls Compute the sample mean of the 100 heights Use that as our estimate This is using the sample mean to estimate the population mean We want to use the sample mean x to estimate the population mean μ We want to use the sample mean x to estimate the population mean μ If we want to estimate the heights of eight year old girls, we can proceed as follows Randomly select 100 eight year old girls Compute the sample mean of the 100 heights Use that as our estimate

Chapter 8 – Section 1 However, if we take a series of different random samples Sample 1 – we compute sample mean x1 Sample 2 – we compute sample mean x2 Sample 3 – we compute sample mean x3 Etc. Each time we sample, we may get a different result The sample mean x is a random variable! However, if we take a series of different random samples Sample 1 – we compute sample mean x1 Sample 2 – we compute sample mean x2 Sample 3 – we compute sample mean x3 Etc.

Chapter 8 – Section 1 Because the sample mean is a random variable The sample mean has a mean The sample mean has a standard deviation The sample mean has a probability distribution This is called the sampling distribution of the sample mean

Chapter 8 – Section 1 When we use the sample mean to estimate the population mean, we are estimating a parameter (number) with a random variable The sampling distribution of the sample mean has the parameters of A sample of size n A population with mean μ A population with standard deviation σ When we use the sample mean to estimate the population mean, we are estimating a parameter (number) with a random variable

Chapter 8 – Section 1 Example We have the data 1, 7, 11, 12, 17, 17, 17, 21, 21, 21, 22, 22 and we want to take samples of size n = 3 First, a histogram of the entire data set

Chapter 8 – Section 1 A histogram of the entire data set Definitely skewed left … not bell shaped

Chapter 8 – Section 1 Taking some samples of size 3 The first sample, 17, 21, 12, has a mean of 16.7

Chapter 8 – Section 1 More sample means from more samples

Chapter 8 – Section 1 A histogram of 20 sample means

Chapter 8 – Section 1 The original data set was skewed left, but the set of sample means is close to bell shaped

Chapter 8 – Section 1 Learning objectives Understand the concept of a sampling distribution Describe the distribution of the sample mean for samples obtained from normal populations Describe the distribution of the sample mean for samples obtained from a population that is not normal 3 1 2

Chapter 8 – Section 1 If we know that the population has a normal distribution, then the sampling distribution (i.e. the distribution of x) will also be normal In fact, the sampling distribution Will be normally distributed Will have a mean equal to the mean of the population Will have a standard deviation less than the standard deviation of the population If we know that the population has a normal distribution, then the sampling distribution (i.e. the distribution of x) will also be normal

Chapter 8 – Section 1 Why does it have a smaller standard deviation? The population standard deviation Is a measure of the distance between an individual value and the mean The sampling error for a sample of size n = 1 The standard deviation of the sample mean Is a measure of the distance between the sample mean and the mean It makes sense that the estimate is more accurate if we’ve taken more values (a larger n) Why does it have a smaller standard deviation? The population standard deviation Is a measure of the distance between an individual value and the mean The sampling error for a sample of size n = 1 The standard deviation of the sample mean Is a measure of the distance between the sample mean and the mean Why does it have a smaller standard deviation? The population standard deviation Is a measure of the distance between an individual value and the mean The sampling error for a sample of size n = 1 Why does it have a smaller standard deviation?

Chapter 8 – Section 1 The Law of Large Numbers says that As we take more observations to the sample (i.e. as n gets larger), the difference between the sample mean x and the population mean μ approaches 0 That is to say, we get to the right answer with larger and larger sample sizes

Chapter 8 – Section 1 The standard error, , is the standard deviation of the sample mean The formula for is This is an extremely important formula The standard error, , is the standard deviation of the sample mean

Chapter 8 – Section 1 What does this mean? If we have a normally distributed random variable X, then the distribution of the sample mean x is completely determined We know that it’s also normally distributed We know its mean We know its standard deviation So … we can do all of our calculations!

Chapter 8 – Section 1 If a simple random sample of size n is drawn from a large population, then the sampling distribution has Mean and Standard deviation In addition, if the population is normally distributed, then The sampling distribution is normally distributed If a simple random sample of size n is drawn from a large population, then the sampling distribution has Mean and Standard deviation

Chapter 8 – Section 1 Example If the random variable X has a normal distribution with a mean of 20 and a standard deviation of 12 If we choose samples of size n = 4, then the sample mean will have a normal distribution with a mean of 20 and a standard deviation of 6 If we choose samples of size n = 9, then the sample mean will have a normal distribution with a mean of 20 and a standard deviation of 4

Chapter 8 – Section 1 Learning objectives Understand the concept of a sampling distribution Describe the distribution of the sample mean for samples obtained from normal populations Describe the distribution of the sample mean for samples obtained from a population that is not normal 1 2 3

Chapter 8 – Section 1 This is great if our random variable X has a normal distribution However … what if X does not have a normal distribution What can we do? Wouldn’t it be very nice if the sampling distribution for X also was normal? This is almost true …

Chapter 8 – Section 1 The Central Limit Theorem states Regardless of the shape of the distribution, the sampling distribution becomes approximately normal as the sample size n increases Thus If the random variable X is normally distributed, then the sampling distribution is normally distributed also For all other random variables X, the sampling distributions are approximately normally distributed The Central Limit Theorem states Regardless of the shape of the distribution, the sampling distribution becomes approximately normal as the sample size n increases

Chapter 8 – Section 1 This approximation, of the sampling distribution being normal, is good for large sample sizes … large values of n How large does n have to be? A rule of thumb – if n is 30 or higher, this approximation is probably pretty good

Chapter 8 – Section 1 Example We’ve been told that the average weight of giraffes is 2400 pounds with a standard deviation of 300 pounds We’ve measured 50 giraffes and found that the sample mean was 2600 pounds Is our data consistent with what we’ve been told?

Chapter 8 – Section 1 The sample mean is approximately normal with mean 2400 (the same as the population) and a standard deviation of 300 / √ 50 = 42.4 Using our calculations for the general normal distribution, 2600 is 200 pounds over 2400, and 200 pounds is 200 / 42.4 = 4.7 From our normal calculator, there is about a 1 chance in 1 million of this occurring Something is definitely strange … we’ll see what to do later in inferential statistics The sample mean is approximately normal with mean 2400 (the same as the population) and a standard deviation of 300 / √ 50 = 42.4 Using our calculations for the general normal distribution, 2600 is 200 pounds over 2400, and 200 pounds is 200 / 42.4 = 4.7 From our normal calculator, there is about a 1 chance in 1 million of this occurring The sample mean is approximately normal with mean 2400 (the same as the population) and a standard deviation of 300 / √ 50 = 42.4 Using our calculations for the general normal distribution, 2600 is 200 pounds over 2400, and 200 pounds is 200 / 42.4 = 4.7 The sample mean is approximately normal with mean 2400 (the same as the population) and a standard deviation of 300 / √ 50 = 42.4

Summary: Chapter 8 – Section 1 The sample mean is a random variable with a distribution called the sampling distribution If the sample size n is sufficiently large (30 or more is a good rule of thumb), then this distribution is approximately normal The mean of the sampling distribution is equal to the mean of the population The standard deviation of the sampling distribution is equal to

Chapter 8 – Example 1 The combined (verbal + quantitative reasoning) score on the GRE is normally distributed with mean 1066 and standard deviation 191. (Source: www.ets.org/Media/Tests/GRE/pdf/01210.pdf.) Suppose n = 15 randomly selected students take the GRE on the same day. a. What is the probability that a randomly selected student scores above 1100 on the GRE? b. Describe the sampling distribution of the sample mean. c. What is the probability that a random sample of 15 students has a mean GRE score that is less than 1100? d. What is the probability that a random sample of 15 students has a mean GRE score that is 1100 or above?

Chapter 8 – Example 1 The combined (verbal + quantitative reasoning) score on the GRE is normally distributed with mean 1066 and standard deviation 191. (Source: www.ets.org/Media/Tests/GRE/pdf/01210.pdf.) Suppose n = 15 randomly selected students take the GRE on the same day. a. What is the probability that a randomly selected student scores above 1100 on the GRE? (0.4286) b. Describe the sampling distribution of the sample mean. (It is normal with mean 1066 and standard deviation 49.3) c. What is the probability that a random sample of 15 students has a mean GRE score that is less than 1100? (0.7549) d. What is the probability that a random sample of 15 students has a mean GRE score that is 1100 or above? (0.2451)

Chapter 8 – Example 2 In the United States, the year each coin was minted is printed on the coin. To find the age of a coin, simply subtract the current year from the year printed on the coin. The ages of circulating pennies are right skewed. Assume the ages of circulating pennies have a mean of 12.2 years and a standard deviation of 9.9 years. a. Based on the information given, can we determine the probability that a randomly selected penny is over 10 years old? b. What is the probability that a random sample of 40 circulating pennies has a mean less than 10 years? c. What is the probability that a random sample of 40 circulating pennies has a mean greater than 10 years? d. What is the probability that a random sample of 40 circulating pennies has a mean greater than 15 years? Would this be unusual?

Chapter 8 – Example 2 In the United States, the year each coin was minted is printed on the coin. To find the age of a coin, simply subtract the current year from the year printed on the coin. The ages of circulating pennies are right skewed. Assume the ages of circulating pennies have a mean of 12.2 years and a standard deviation of 9.9 years. a. Based on the information given, can we determine the probability that a randomly selected penny is over 10 years old? (No, because the population of the ages of circulating pennies is not normally distributed.) b. What is the probability that a random sample of 40 circulating pennies has a mean less than 10 years? (0.0793) c. What is the probability that a random sample of 40 circulating pennies has a mean greater than 10 years? (0.9207) d. What is the probability that a random sample of 40 circulating pennies has a mean greater than 15 years? Would this be unusual? (0.0367; yes)