Chapter 7 Probability and Samples: The Distribution of Sample Means.

Slides:



Advertisements
Similar presentations
Chapter 7: The Distribution of Sample Means
Advertisements

Probability and Samples: The Distribution of Sample Means
Chapter 10: Sampling and Sampling Distributions
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter 7 Introduction to Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Chapter Sampling Distributions and Hypothesis Testing.
Sampling Distributions
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Chapter Six z-Scores and the Normal Curve Model. Copyright © Houghton Mifflin Company. All rights reserved.Chapter The absolute value of a number.
Lecture 6: Let’s Start Inferential Stats Probability and Samples: The Distribution of Sample Means.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 5 Chicago School of Professional Psychology.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 11: Random Sampling and Sampling Distributions
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
Chapter 6: Probability.
Chapter 5: z-scores.
Chapter 5: z-scores STAT 252 Spring 2013 Gerald D. Nunn, Ph.D., NCSP.
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Chapter 8 Introduction to Hypothesis Testing. Hypothesis Testing Hypothesis testing is a statistical procedure Allows researchers to use sample data to.
Chapter 5 Z-Scores. Review ► We have finished the basic elements of descriptive statistics. ► Now we will begin to develop the concepts and skills that.
From Last week.
Chapter 3 EDRS 5305 Fall 2005 Gravetter and Wallnau 5 th edition.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Probability & the Normal Distribution
Essentials of Marketing Research
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Chapter 6 Probability. Introduction We usually start a study asking questions about the population. But we conduct the research using a sample. The role.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Understanding the scores from Test 2 In-class exercise.
Probability and Samples
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Statistics for the Behavioral Sciences, Sixth Edition by Frederick J. Gravetter and Larry B. Wallnau Copyright © 2004 by Wadsworth Publishing, a division.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
Chapter 6 USING PROBABILITY TO MAKE DECISIONS ABOUT DATA.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
Anthony J Greene1 Where We Left Off What is the probability of randomly selecting a sample of three individuals, all of whom have an I.Q. of 135 or more?
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 9 Probability. 2 More Statistical Notation  Chance is expressed as a percentage  Probability is expressed as a decimal  The symbol for probability.
Determination of Sample Size: A Review of Statistical Theory
Distributions of the Sample Mean
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Thursday August 29, 2013 The Z Transformation. Today: Z-Scores First--Upper and lower real limits: Boundaries of intervals for scores that are represented.
Introduction to Statistics Chapter 6 Feb 11-16, 2010 Classes #8-9
Chapter 2 EDRS 5305 Fall Descriptive Statistics  Organize data into some comprehensible form so that any pattern in the data can be easily seen.
Chapter 10: Introduction to Statistical Inference.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
© aSup-2007 THE DISTRIBUTION OF SAMPLE MEANS   1 Chapter 7 THE DISTRIBUTION OF SAMPLE MEANS.
Lecture 5 Introduction to Sampling Distributions.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 16, 2009.
Distributions of Sample Means. z-scores for Samples  What do I mean by a “z-score” for a sample? This score would describe how a specific sample is.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Chapter 7: The Distribution of Sample Means
Chapter 8 Sampling Methods and the Central Limit Theorem.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chapter 7 Probability and Samples
LEARNING OUTCOMES After studying this chapter, you should be able to
Probability and the Sampling Distribution
Econ 3790: Business and Economics Statistics
Chapter 7: The Distribution of Sample Means
Chapter 4 (cont.) The Sampling Distribution
Presentation transcript:

Chapter 7 Probability and Samples: The Distribution of Sample Means

Samples and Sampling Error The scores we have looked at thus far are z-scores and probabilities where the sample consists of a single score. The scores we have looked at thus far are z-scores and probabilities where the sample consists of a single score. This chapter will extend the concepts of z- scores and probability to cover situations with larger samples. This chapter will extend the concepts of z- scores and probability to cover situations with larger samples. Ex: A z-score for an entire sample Ex: A z-score for an entire sample

Z-scores (review) Describes exactly where the score is located in the distribution Describes exactly where the score is located in the distribution Ex: a z-score of is extreme Ex: a z-score of is extreme

Figure 6.4 The normal distribution following a z-score transformation Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning Extreme Sample Central, Representative Sample

Probability (review) If the score is normal, should be able to determine the probability value for each score. If the score is normal, should be able to determine the probability value for each score. A score with a z-score of has a probability of only p =.0028 A score with a z-score of has a probability of only p =.0028

Figure 6.4 The normal distribution following a z-score transformation Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning Extreme Sample Central, Representative Sample

Z-Scores So far we have been limited to situations where the sample consists of a single score. So far we have been limited to situations where the sample consists of a single score. Most studies have larger samples Most studies have larger samples We will now extend the concepts of z-scores and probability to cover situations with larger samples. We will now extend the concepts of z-scores and probability to cover situations with larger samples.

A z-score near zero indicates a central, representative sample A z-score near zero indicates a central, representative sample A z-score beyond +/ indicates an extreme example A z-score beyond +/ indicates an extreme example It will be possible to determine exact probabilities for a sample It will be possible to determine exact probabilities for a sample

Figure 6.4 The normal distribution following a z-score transformation Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning Extreme Sample Central, Representative Sample

Difficulties with using samples Samples provide an incomplete picture of the population Samples provide an incomplete picture of the population Any stats computed will not be identical to the corresponding parameters for the entire population Any stats computed will not be identical to the corresponding parameters for the entire population Ex: IQ for a sample of 25 students is different for IQ of all population Ex: IQ for a sample of 25 students is different for IQ of all population The difference is called a sampling error The difference is called a sampling error

Sampling Error This difference, or error between the sample stats and the corresponding population parameters, is called sampling error This difference, or error between the sample stats and the corresponding population parameters, is called sampling error A sampling error is the discrepancy, or amount of error between a sample statistic and its corresponding population parameter.

Questions How can you tell which sample is giving the best description of the population? How can you tell which sample is giving the best description of the population? Can you predict how a sample will describe its population? Can you predict how a sample will describe its population? What is the probability of selecting a sample that has a certain sample mean? What is the probability of selecting a sample that has a certain sample mean? We can answer these, but we need to set rules that relate samples to populations. We can answer these, but we need to set rules that relate samples to populations.

Distribution of Sample Means Many different samples come up with different results. Many different samples come up with different results. A huge set of possible samples forms a relatively simple, orderly, and predictable pattern A huge set of possible samples forms a relatively simple, orderly, and predictable pattern makes it possible to predict the characteristics of a sample with some accuracy. makes it possible to predict the characteristics of a sample with some accuracy.

Distribution of Sample Means (cont.) The ability to predict sample characteristics is based on the. The ability to predict sample characteristics is based on the distribution of sample means. The distribution of sample means is the collection of sample means for all the possible random samples of a particular size (n) that can be obtained from a population The distribution of sample means is the collection of sample means for all the possible random samples of a particular size (n) that can be obtained from a population

Distribution of Sample Means (cont.) It is necessary to have all the possible values in order to compute probabilities. It is necessary to have all the possible values in order to compute probabilities. If a set has 100 samples, the probability of obtaining any specific sample is 1 out of 100 or p = 1/100. If a set has 100 samples, the probability of obtaining any specific sample is 1 out of 100 or p = 1/100.

Before we only discussed scores, now we are discussing statistics (sample means); Before we only discussed scores, now we are discussing statistics (sample means); Because statistics are obtained from samples, a distribution of statistics is referred to as a. Because statistics are obtained from samples, a distribution of statistics is referred to as a sampling distribution.

Sampling Distribution A sampling distribution is a distribution of statistics obtained by selecting all the possible samples of a specific size from a population. A sampling distribution is a distribution of statistics obtained by selecting all the possible samples of a specific size from a population.

To construct a sample mean: Take a sample Take a sample Get the mean Get the mean Replace Replace Get the sample Get the sample Get the mean Get the mean Replace Replace Do this until you have gotten all possible sample combinations. Do this until you have gotten all possible sample combinations. Look at Ex. 7.1 – 4 scores n=2 16 sample means – look at histogram p Look at Ex. 7.1 – 4 scores n=2 16 sample means – look at histogram p. 147.

Sample Means Note that the sample means tend to pile up around the population mean Note that the sample means tend to pile up around the population mean  5  5 The sample means are clustered around a value of 5 The sample means are clustered around a value of 5

Sample Means (cont.) Samples are supposed to be representative of the population Samples are supposed to be representative of the population Therefore, the sample means tend to approximate the population mean. Therefore, the sample means tend to approximate the population mean.

Sample Means (cont.) The distribution of sample means is approximately normal in shape. The distribution of sample means is approximately normal in shape. Can use the distribution of sample means to answer probability questions about sample means. Can use the distribution of sample means to answer probability questions about sample means. Ex: if you take a sample of n=2 scores from the original population, what is the probability of obtaining a sample mean greater than 7? Ex: if you take a sample of n=2 scores from the original population, what is the probability of obtaining a sample mean greater than 7? P (X > 7) = ? P (X > 7) = ?

Figure 7.1 Frequency distribution for a population of four scores: 2, 4, 6, 8 Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning

Table 7.1 The possible samples of n = 2 scores from the population in Figure 7.1 Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning

Ex: if you take a sample of n=2 scores from the original population, what is the probability of obtaining a sample mean greater than 7? Ex: if you take a sample of n=2 scores from the original population, what is the probability of obtaining a sample mean greater than 7? P (X > 7) = ? P (X > 7) = ? Because probability is equivalent to proportion, the probability question can be restated as follows: Because probability is equivalent to proportion, the probability question can be restated as follows:

Of all the possible sample means, what proportion has values greater than 7? Of all the possible sample means, what proportion has values greater than 7? In Figure 7.2 – All the possible sample means are pictured, and only 1 out of the 16 means has a value greater than 7. In Figure 7.2 – All the possible sample means are pictured, and only 1 out of the 16 means has a value greater than 7. Answer: 1 out of 16 or p = 1/16 Answer: 1 out of 16 or p = 1/16

Figure 7.2 The distribution of sample means for n = 2 Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning

The Central Limit Theorem It might not be possible to list all the samples and compute all the possible sample means. It might not be possible to list all the samples and compute all the possible sample means. As the size of n increases, the number of possible samples increases too. As the size of n increases, the number of possible samples increases too. Therefore, it is necessary to develop the general characteristics of the distribution of sample means that can be applied in any situation. Therefore, it is necessary to develop the general characteristics of the distribution of sample means that can be applied in any situation. Characteristics are specified in Central Limit Theorem Characteristics are specified in Central Limit Theorem Cornerstone for much of inferential statistics Cornerstone for much of inferential statistics

Central Limit Theorem For any population with mean  and standard deviation  the distribution of sample means for sample size n will have a mean of  and a standard deviation of For any population with mean  and standard deviation  the distribution of sample means for sample size n will have a mean of  and a standard deviation of  n and will approach a normal distribution as n approaches infinity.

Central Limit Theorem Describes the distribution of sample means for any population, no matter what shape, mean, or standard deviation. Describes the distribution of sample means for any population, no matter what shape, mean, or standard deviation. The distribution of sample means “approaches” a normal distribution very rapidly. The distribution of sample means “approaches” a normal distribution very rapidly. Describes the distribution of sample means by identifying the three basic characteristics that describe any distribution: shape, central tendency, and variability. Describes the distribution of sample means by identifying the three basic characteristics that describe any distribution: shape, central tendency, and variability.

Shape of the Distribution of Means Sample means tends to be a normal distribution Sample means tends to be a normal distribution Can be almost perfect shape if: Can be almost perfect shape if: The population from which the samples are selected is a normal distribution The population from which the samples are selected is a normal distribution The number of scores (n) in each sample is relatively large, around 30 or more. The number of scores (n) in each sample is relatively large, around 30 or more.

Mean of the Distribution of Means The expected value of X The expected value of X The mean of the distribution of sample means is equal to  (the population mean) and is called the expected value of X. The mean of the distribution of sample means is equal to  (the population mean) and is called the expected value of X.

Standard Error of X We have considered the shape and the central tendency of the distribution of sample means. We have considered the shape and the central tendency of the distribution of sample means. To completely describe this distribution, we need one more characteristic To completely describe this distribution, we need one more characteristic Variability Variability

Standard Error of X We will be working with the standard deviation for the distribution of sample means. We will be working with the standard deviation for the distribution of sample means. Called the standard error of X Called the standard error of X The standard error defines the standard, or typical, distance from the mean. The standard error defines the standard, or typical, distance from the mean.

Remember, a sample is not expected to provide a perfectly accurate reflection of its population. Remember, a sample is not expected to provide a perfectly accurate reflection of its population. There will be some error between the sample and the population There will be some error between the sample and the population

Standard Error of X The standard deviation of the distribution of sample means is called the standard error of X. The standard deviation of the distribution of sample means is called the standard error of X. The standard error measures the standard amount of difference between X and  due to chance The standard error measures the standard amount of difference between X and  due to chance

Standard Error of X Standard error =  x = standard distance between X and    indicates that we are measuring a standard deviation or a standard distance from the mean  The subscript x indicates that we are measuring the standard deviation for a distribution of sample means.

Standard Error Valuable because it specifies precisely how well a sample mean estimates its population mean Valuable because it specifies precisely how well a sample mean estimates its population mean How much error you should expect on the average How much error you should expect on the average Can use the sample mean as an estimate of the population mean Can use the sample mean as an estimate of the population mean

Standard Error Magnitude determined by two factors Magnitude determined by two factors Size of the sample Size of the sample The larger the sample size (n), the more probable it is that the sample mean will be close to the population The larger the sample size (n), the more probable it is that the sample mean will be close to the population The standard deviation of the population from which the sample is selected The standard deviation of the population from which the sample is selected standard error =  x =  standard error =  x =   n

Standard error When the sample size increases, the standard error decreases When the sample size increases, the standard error decreases As n decreases, the error increases As n decreases, the error increases

Probability and the Distribution of Sample Means Primary use of the distribution of sample means is to find the probability associated with any specific sample. Primary use of the distribution of sample means is to find the probability associated with any specific sample. Remember probability is equivalent to proportion. Remember probability is equivalent to proportion. Because the distribution of sample means presents the entire set of all possible X’s, we can use proportion of this distribution to determine probabilities. Because the distribution of sample means presents the entire set of all possible X’s, we can use proportion of this distribution to determine probabilities.

Example 7.2 Population of SAT scores Population of SAT scores  = 100  = 100 If you take a random sample of n = 25 students, what is the probability that the sample mean would be greater than X = 540? If you take a random sample of n = 25 students, what is the probability that the sample mean would be greater than X = 540?

Restate probability question as a proportion question Restate probability question as a proportion question Out of all the possible sample means, what proportion has values greater than 540? Out of all the possible sample means, what proportion has values greater than 540? all the possible sample means is the distribution of sample means all the possible sample means is the distribution of sample means The problems is to find a specific portion of this distribution The problems is to find a specific portion of this distribution

What we know What we know The distribution is normal becausse the population of SAT scores is normal The distribution is normal becausse the population of SAT scores is normal The distribution has a mean of 500 because the population mean is  The distribution has a mean of 500 because the population mean is  The distribution has a standard error of  X = 20 The distribution has a standard error of  X = 20  X =   X =   n 25 5

Figure 7.3 A distribution of sample means Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning

We are interested in sample means greater than 540 – the shaded area We are interested in sample means greater than 540 – the shaded area Next, find the s-score value that defines the exact location of X = 540 Next, find the s-score value that defines the exact location of X = 540 The value of 540 is located above the mean by 40 pts. The value of 540 is located above the mean by 40 pts. This is 2 s.d. (in this case, 2 standard errors) above the mean This is 2 s.d. (in this case, 2 standard errors) above the mean The z-score for X = 540 is z = The z-score for X = 540 is z = +2.00

Because this distribution of sample means is normal, you can use the unit normal table to find the probability associated with z=+2.00 Because this distribution of sample means is normal, you can use the unit normal table to find the probability associated with z=+2.00 The table indicates that of the distribution is located in the tail of the distribution beyond z = The table indicates that of the distribution is located in the tail of the distribution beyond z = Conclusion – it is very unlikely, p = (2.28%) to obtain a random sample of n = 25 students with an average SAT score greater than 540 Conclusion – it is very unlikely, p = (2.28%) to obtain a random sample of n = 25 students with an average SAT score greater than 540

Z-scores It is possible to use a z-score to describe the position of any specific sample within the distribution of sample means It is possible to use a z-score to describe the position of any specific sample within the distribution of sample means Z-score tells exactly where a specific sample is located in relation to all the other possible samples that could have been obtained. Z-score tells exactly where a specific sample is located in relation to all the other possible samples that could have been obtained.

Figure 7.8 Showing standard error in a graph Copyright © 2002 Wadsworth Group. Wadsworth is an imprint of the Wadsworth Group, a division of Thomson Learning