Chapter 10 Sampling and Sampling Distributions

Slides:



Advertisements
Similar presentations
Mean, Proportion, CLT Bootstrap
Advertisements

Chapter 6 Sampling and Sampling Distributions
 These 100 seniors make up one possible sample. All seniors in Howard County make up the population.  The sample mean ( ) is and the sample standard.
Sampling Distributions and Sample Proportions
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 7 Introduction to Sampling Distributions
Chapter 7 Introduction to Sampling Distributions
Weibull exercise Lifetimes in years of an electric motor have a Weibull distribution with b = 3 and a = 5. Find a guarantee time, g, such that 95% of.
Chapter 7 Sampling Distributions
Chapter 6 Introduction to Sampling Distributions
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
Part III: Inference Topic 6 Sampling and Sampling Distributions
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
9.1 Sampling Distributions A parameter is a number that describes the population. A parameter is a fixed number, but in practice we do not know its value.
QUIZ CHAPTER Seven Psy302 Quantitative Methods. 1. A distribution of all sample means or sample variances that could be obtained in samples of a given.
Chapter 6 Sampling and Sampling Distributions
Copyright © 2010 Pearson Education, Inc. Slide
Chapter 7 Sampling Distribution
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Sampling Distributions
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
Sampling Distributions Chapter 7. The Concept of a Sampling Distribution Repeated samples of the same size are selected from the same population. Repeated.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling Distributions.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc.. Chap 7-1 Developing a Sampling Distribution Assume there is a population … Population size N=4.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 7-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 7: Sampling Distributions Section 7.1 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
5.5.3 Means and Variances for Linear Combinations
Basic Business Statistics
Lecture 5 Introduction to Sampling Distributions.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Sampling Distribution, Chp Know the difference between a parameter and a statistic.
Sampling Distributions Chapter 18. Sampling Distributions If we could take every possible sample of the same size (n) from a population, we would create.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Chapter 7, part D. VII. Sampling Distribution of The sampling distribution of is the probability distribution of all possible values of the sample proportion.
Variability. The differences between individuals in a population Measured by calculations such as Standard Error, Confidence Interval and Sampling Error.
Variability.
Chapter 7 Review.
Sampling Distributions
Introduction to Sampling Distributions
Sampling Distributions
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 7 Sampling Distributions
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
CHAPTER 15 SUMMARY Chapter Specifics
Sampling Distribution of the Mean
Chapter 7 Sampling Distributions.
Chapter 7 Sampling and Sampling Distributions
Chapter 7 Sampling Distributions.
Presentation transcript:

Chapter 10 Sampling and Sampling Distributions 10.1 Random sampling 10.4 Stratified sampling 10.6 Sampling distribution 10.7 The standard error of the mean 10.8 The central limit theorem

10.1 Random sampling Example: At a parts depot the inventory shows 1000 parts in stock. What percentage of those are actually in stock? Let p = true % To estimate p, take a sample of n parts to check out in the supply room. Then How do we choose the parts to check?

Random sample A random sample is an insurance policy to protect against bias. A simple random sample gives each of the possible sample choices the same chance of being selected.

Random sampling In other words, for a finite population of N sample points: A sample of size n is random if any n sample points have a probability to be selected.

To generalize For an infinite population, or a finite population but sampling with replacement. A value is observed according to a probability distribution. A random sample of size n consists of observed values that are independent and have the same distributions.

10.2 &10.3 Skip

10.4 Stratified sample A simple random sample is not always the best option. Consider the following example. We want to estimate the average number of vireos per area. Conifer Desiduous

Then the population mean and sample mean are Question: if we use simple random sampling, is always a good estimator of ?

Answer: NO. Reasoning: A simple random sample of n=10 locations might all end up Conifer. We would be better off putting n1 samples in conifer and n2 samples in Desiduous.

Stratified sampling A stratified random sample breaks the population into strata and samples randomly within each stratum. In our previous example: Stratum 1 = Deciduous Stratum 2 = Conifer

Notations N1=number of units in stratum 1 N = N1+N2 = number of units in entire population n1=number of units sampled from stratum 1 n2=number of units sampled from stratum 2 n = n1+n2 = number of sampled units

Optimal allocation How do we decide n1 and n2 (when n is fixed)? N1=# of possible sampling locations in conifer N2=# in desiduous σ1=standard deviation of bird counts in conifer σ2=standard deviation of bird counts in desiduous Optimal allocation: (Problem 10.29) Sample more in strata, with --more units (area) --higher variability

Proportional allocation Proportional allocation doesn’t consider σ and makes the sample number from each stratum proportional to the size of the stratum.

Example of proportional allocation If N1=100 N2=300 n=40 Then

A comparison between simple random sampling and stratified sampling Example: Population: Weights of rocks 4, 6, 10, 12. So N=4 and m=8. Sample n=2. Possible results for simple random samples Sample Probability sample mean Probability 4 6 1/6 5 1/6 4 10 1/6 7 1/6 4 12 1/6 8 6 10 1/6 8 6 12 1/6 9 1/6 10 12 1/6 11 1/6 2/6

Example Continued Possible results for a stratified sample. Choose 1 from the smaller rocks and choose 1 from the larger rocks. Sample Probability sample mean Probability 4 10 1/4 7 1/4 4 12 1/4 8 6 10 1/4 8 6 12 1/4 9 1/4 1/2

A comparison between simple random sampling and stratified sampling SRS Stratified The stratified sample is more likely to be close to the true population value, m=8 here.

10.6 Sampling Distributions A statistic (e.g. sample mean) from a random sample or randomized experiment is a random variable and its probability distribution is a sampling distribution. The population distribution of a variable is the distribution of its values for all members of the population. The population distribution is also the probability distribution of the variable when choosing one subject at random from the population.

Approximating the Sampling Distribution Sampling Variability Approximating the Sampling Distribution Results of many random samples of size n=100, from a population where it is known that 60% of the people hate to shop for clothes, population proportion (parameter) p = .6) and sample proportion (statistic). Selecting many samples gives us an idea of the amount of variability between samples of a given size. Most samples of size n = 100 gave estimates close to .6, but some were far off. On average, they centered on .6, they are variable, but unbiased.

Sampling Variability (cont.) Approximating the Sampling Distribution Results of many random samples of size n=2500, from the same population, with population proportion (parameter) p = .6) and sample proportion (statistic). As the size of the sample increases the variability decreases and the degree of confidence increases. Larger samples are more precise (have less variability) AND are unbiased.

Sample size What advantage is there of taking a larger sample? Larger n? Taking a larger sample decreases the potential deviation of away from m. Let be the standard deviation of the sampling distribution of , then the larger the sample size is, the smaller is.

Unbiased estimators P.77 “Estimators having the desirable property that their values will on average equal the quantity they are supposed to estimate are said to be unbiased.” is an unbiased estimator of m. Another example of unbiased estimator is s2 for s2. Choosing non-random samples can introduce bias.

10.7 The Standard Error of the Mean If x is the mean of a sample of size n from a population having mean m and standard deviation s. The mean and standard deviation of x are: (standard error of the mean) for finite populations So, as n increases, decreases. If the sample size is multiplied by 4, the “standard deviation” (standard error of the mean) will be divided in half for infinite populations.

Standard Deviation vs. Standard Error of the Mean The standard error of the mean is smaller than the standard deviation by a factor of the square root of the sample size. The standard deviation describes the variability of individuals. The standard error of the mean describes the accuracy of means of a given size or the potential error in sample mean as a guess at m.

Examples Population size N=4, sample size n=2, 112 150 152 150 152 What is the standard error of the mean?

Examples Infinite Population P(3)=P(5)= P(7)=P(9)= 1/4 What is the standard error of the mean?

Example Governor’s Poll 1, 1, 1 0, 1, 0 0, … 0, … Estimate guess What is the standard error of the mean?

If a population has the N(m,s) distribution, then the sample mean x of n independent observations has the distribution. 10.8 Central Limit Theorem For ANY population with mean m and standard deviation s the sample mean of a random sample of size n is approximately when n is LARGE.

Central Limit Theorem If the population is normal or sample size is large, sample mean follows a normal distribution and follows a standard normal distribution.

The closer x’s distribution is to a normal distribution, the smaller n can be and have the sample mean nearly normal. If x is normal, then the sample mean is normal for any n, even n=1.

Central Limit Theorem At Work

Usually n=30 is big enough so that the sample mean is approximately normal unless the distribution of x is very asymmetrical. If x is not normal, there are often better procedures than using a normal approximation, but we won’t study those options.

Example X=ball bearing diameter X is normal distributed with m=1.0cm and s=0.02cm =mean diameter of n=25 Find out what is the probability that will be off by less than 0.01 from the true population mean.

Exercise The mean of a random sample of size n=100 is going to used to estimate the mean daily milk production of a very large herd of dairy cows. Given that the standard deviation of the population to be sampled is s=3.6 quarts, what can we assert about he probabilities that the error of this estimate will be more then 0.72 quart?