Probability and Samples

Slides:



Advertisements
Similar presentations
Chapter 7: The Distribution of Sample Means
Advertisements

Probability and Samples: The Distribution of Sample Means
Reminders: Parameter – number that describes the population Statistic – number that is computed from the sample data Mean of population = µ Mean of sample.
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
5.2 The Sampling Distribution of a Sample Mean
Sampling Distributions
Suppose we are interested in the digits in people’s phone numbers. There is some population mean (μ) and standard deviation (σ) Now suppose we take a sample.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 6-4 Sampling Distributions and Estimators Created by.
Distribution of Sample Means 2011, 10, 20. Today ’ s Topics What is distribution of sample means?** Properties of distribution of sample means* How to.
Sampling Distributions
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
6-5 The Central Limit Theorem
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 5 Chicago School of Professional Psychology.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 6 Normal Probability Distributions
Chapter 11: Random Sampling and Sampling Distributions
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
From Last week.
The Central Limit Theorem. 1. The random variable x has a distribution (which may or may not be normal) with mean and standard deviation. 2. Simple random.
Understanding the scores from Test 2 In-class exercise.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Chapter 7 Probability and Samples: The Distribution of Sample Means
What if the Original Distribution Is Not Normal? Use the Central Limit Theorem.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
Statistics Workshop Tutorial 5 Sampling Distribution The Central Limit Theorem.
Statistics 300: Elementary Statistics Section 6-5.
Chapter 9 Probability. 2 More Statistical Notation  Chance is expressed as a percentage  Probability is expressed as a decimal  The symbol for probability.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc.. Chap 7-1 Developing a Sampling Distribution Assume there is a population … Population size N=4.
Distributions of the Sample Mean
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Distribution of the Sample Mean (Central Limit Theorem)
Slide Slide 1 Section 6-5 The Central Limit Theorem.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Section 6-5 The Central Limit Theorem. THE CENTRAL LIMIT THEOREM Given: 1.The random variable x has a distribution (which may or may not be normal) with.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
The Central Limit Theorem 1. The random variable x has a distribution (which may or may not be normal) with mean and standard deviation. 2. Simple random.
AP Review #3: Continuous Probability (The Normal Distribution)
PSY 307 – Statistics for the Behavioral Sciences Chapter 9 – Sampling Distribution of the Mean.
Chapter 18: Sampling Distribution Models
§ 5.3 Normal Distributions: Finding Values. Probability and Normal Distributions If a random variable, x, is normally distributed, you can find the probability.
1 Chapter 5. Section 5-5. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION E LEMENTARY.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
Chapter 18 - Part 2 Sampling Distribution Models for.
Introduction to Inference Sampling Distributions.
Chapter 18: The Central Limit Theorem Objective: To apply the Central Limit Theorem to the Normal Model CHS Statistics.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Lecture 5 Introduction to Sampling Distributions.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Distributions of Sample Means. z-scores for Samples  What do I mean by a “z-score” for a sample? This score would describe how a specific sample is.
Lesson Distribution of the Sample Mean. Objectives Understand the concept of a sampling distribution Describe the distribution of the sample mean.
m/sampling_dist/index.html.
Sampling Distributions
And distribution of sample means
Chapter 7 Probability and Samples
Central Limit Theorem Section 5-5
Sampling Distribution of a sample Means
Sampling Distributions and Estimators
Sec. 7-5: Central Limit Theorem
SAMPLING DISTRIBUTION. Probability and Samples Sampling Distributions Central Limit Theorem Standard Error Probability of Sample Means.

Chapter 18: Sampling Distribution Models
Probability and the Sampling Distribution
Chapter 7: The Distribution of Sample Means
The Central Limit Theorem
Presentation transcript:

Probability and Samples Sampling Distributions Central Limit Theorem Standard Error Probability of Sample Means

Inferential Statistics tomorrow and beyond Population Sample Probability last week and today

- getting a certain type of individual when we sample once When we take a sample from a population we can talk about the probability of - getting a certain type of individual when we sample once today last Thursday - getting a certain type of sample mean when n>1

Distribution of Individuals in a Population 10 20 30 40 50 60 1 2 3 frequency 4 5 6 raw score 70 p(X > 50) = ?

Distribution of Individuals in a Population 10 20 30 40 50 60 1 2 3 frequency 4 5 6 raw score 70 1 p(X > 50) = = 0.11 9

Distribution of Individuals in a Population 10 20 30 40 50 60 1 2 3 frequency 4 5 6 raw score 70 p(X > 30) = ?

Distribution of Individuals in a Population 10 20 30 40 50 60 1 2 3 frequency 4 5 6 raw score 70 6 p(X > 30) = = 0.66 9

Distribution of Individuals in a Population 6 normally distributed  = 40,  = 10 5 frequency 4 3 2 1 10 20 30 40 50 60 70 p(40 < X < 60) = ?

Distribution of Individuals in a Population 6 normally distributed  = 40,  = 10 5 frequency 4 3 2 1 10 20 30 40 50 60 70 p(40 < X < 60) = p(0 < Z < 2) = 47.7%

Distribution of Individuals in a Population 6 normally distributed  = 40,  = 10 5 frequency 4 3 2 raw score 1 10 20 30 40 50 60 70 p(X > 60) = ?

Distribution of Individuals in a Population 6 normally distributed  = 40,  = 10 5 frequency 4 3 2 raw score 1 10 20 30 40 50 60 70 p(X > 60) = p(Z > 2) = 2.3%

For the preceding calculations to be accurate, it is necessary that the sampling process be random. A random sample must satisfy two requirements: Each individual in the population has an equal chance of being selected. If more than one individual is to be selected, there must be constant probability for each and every selection (i.e. sampling with replacement).

Distribution of Sample Means A distribution of sample means is: the collection of sample means for all the possible random samples of a particular size (n) that can be obtained from a population.

Population 6 5 frequency 4 3 2 1 raw score 1 2 3 4 5 6 7 8 9

Distribution of Sample Means from Samples of Size n = 2 Sample # Scores Mean ( ) 1 2, 2 2 2,4 3 2,6 4 2,8 5 4,2 6 4,4 7 4,6 8 4,8 9 6,2 10 6,4 11 6,6 12 6,8 13 8,2 14 8,4 15 8.6 16 8.8

Distribution of Sample Means from Samples of Size n = 2 6 5 frequency 4 3 2 1 1 2 3 4 5 6 7 8 9 sample mean We can use the distribution of sample means to answer probability questions about sample means

Distribution of Sample Means from Samples of Size n = 2 6 5 frequency 4 3 2 1 1 2 3 4 5 6 7 8 9 sample mean p( > 7) = ?

Distribution of Sample Means from Samples of Size n = 2 6 5 frequency 4 3 2 1 1 2 3 4 5 6 7 8 9 sample mean 1 p( > 7) = = 6 % 16

Distribution of Individuals in Population 1 2 3 4 5 6 frequency raw score 7 8 9  = 5,  = 2.24 X = 5, X = 1.58 Distribution of Sample Means 6 5 frequency 4 3 2 1 1 2 3 4 5 6 7 8 9 sample mean

Distribution of Individuals 1 2 3 4 5 6 frequency raw score 7 8 9  = 5,  = 2.24 Distribution of Sample Means 6 X = 5, X = 1.58 5 frequency 4 3 2 p(X > 7) = 25% 1 1 2 3 4 5 6 7 8 9 sample mean p(X> 7) = 6% , for n=2

A key distinction Population Distribution – distribution of all individual scores in the population Sample Distribution – distribution of all the scores in your sample Sampling Distribution – distribution of all the possible sample means when taking samples of size n from the population. Also called “the distribution of sample means”.

Distribution of Individuals in Population 1 2 3 4 5 6 frequency raw score 7 8 9  = 5,  = 2.24 X = 5, X = 1.58 Distribution of Sample Means 6 5 frequency 4 3 2 1 1 2 3 4 5 6 7 8 9 sample mean

Things to Notice Distribution of Sample Means frequency 1 2 3 4 5 6 frequency 7 8 9 sample mean Distribution of Sample Means Things to Notice The sample means tend to pile up around the population mean. The distribution of sample means is approximately normal in shape, even though the population distribution was not. The distribution of sample means has less variability than does the population distribution.

What if we took a larger sample?

Distribution of Sample Means from Samples of Size n = 3 24 22 20 X = 5, X = 1.29 18 frequency 16 14 1 64 = 2 % p( X > 7) = 12 10 8 6 4 2 1 2 3 4 5 6 7 8 9 sample mean

Distribution of Sample Means As the sample gets bigger, the sampling distribution… stays centered at the population mean. becomes less variable. becomes more normal.

Central Limit Theorem For any population with mean  and standard deviation , the distribution of sample means for sample size n … will have a mean of  will have a standard deviation of will approach a normal distribution as n approaches infinity

the mean of the sampling distribution Notation the mean of the sampling distribution the standard deviation of sampling distribution (“standard error of the mean”) 

Standard Error The “standard error” of the mean is: The standard deviation of the distribution of sample means. The standard error measures the standard amount of difference between x-bar and  that is reasonable to expect simply by chance. SE =

Standard Error The Law of Large Numbers states: The larger the sample size, the smaller the standard error. This makes sense from the formula for standard error …

Distribution of Individuals in Population 1 2 3 4 5 6 frequency raw score 7 8 9  = 5,  = 2.24 X = 5, X = 1.58 Distribution of Sample Means 6 5 frequency 4 3 2 1 1 2 3 4 5 6 7 8 9 sample mean

Sampling Distribution (n = 3) 1 2 3 4 5 6 frequency 8 10 12 7 9 sample mean 14 16 18 20 22 24 X = 5 X = 1.29

Clarifying Formulas Distribution of Sample Means Population Sample notice

will have a standard deviation of Central Limit Theorem For any population with mean  and standard deviation , the distribution of sample means for sample size n … will have a mean of  will have a standard deviation of will approach a normal distribution as n approaches infinity What does this mean in practice?

Practical Rules Commonly Used: 1. For samples of size n larger than 30, the distribution of the sample means can be approximated reasonably well by a normal distribution. The approximation gets better as the sample size n becomes larger. 2. If the original population is itself normally distributed, then the sample means will be normally distributed for any sample size. small n large n normal population non-normal population

Probability and the Distribution of Sample Means The primary use of the distribution of sample means is to find the probability associated with any specific sample.

Probability and the Distribution of Sample Means Example: Given the population of women has normally distributed weights with a mean of 143 lbs and a standard deviation of 29 lbs, if one woman is randomly selected, find the probability that her weight is greater than 150 lbs. if 36 different women are randomly selected, find the probability that their mean weight is greater than 150 lbs.

Given the population of women has normally distributed weights with a mean of 143 lbs and a standard deviation of 29 lbs, if one woman is randomly selected, find the probability that her weight is greater than 150 lbs. 150  = 143 = 29 Population distribution z = 150-143 = 0.24 29 0.4052 0.24

Given the population of women has normally distributed weights with a mean of 143 lbs and a standard deviation of 29 lbs, if 36 different women are randomly selected, find the probability that their mean weight is greater than 150 lbs. 150  = 143 = 4.33 Sampling distribution z = 150-143 = 1.45 4.33 0.0735 1.45

Probability and the Distribution of Sample Means Example: Given the population of women has normally distributed weights with a mean of 143 lbs and a standard deviation of 29 lbs, if one woman is randomly selected, find the probability that her weight is greater than 150 lbs. if 36 different women are randomly selected, find the probability that their mean weight is greater than 150 lbs.

Practice Given a population of 400 automobile models, with a mean horsepower = 105 HP, and a standard deviation = 40 HP, Example: What is the standard error of the sample mean for a sample of size 1? What is the standard error of the sample mean for a sample of size 4? What is the standard error of the sample mean for a sample of size 25? 40 20 8

Practice Example: Given a population of 400 automobile models, with a mean horsepower = 105 HP, and a standard deviation = 40 HP, if one model is randomly selected from the population, find the probability that its horsepower is greater than 120. If 4 models are randomly selected from the population, find the probability that their mean horsepower is greater than 120 If 25 models are randomly selected from the population, find the probability that their mean horsepower is greater than 120 .35 .23 .03