STT 350: SURVEY SAMPLING Dr. Cuixian Chen Chapter 4: Simple Random Sampling (SRS) Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow.

Slides:



Advertisements
Similar presentations
Introduction Simple Random Sampling Stratified Random Sampling
Advertisements

Estimation of Means and Proportions
Mean, Proportion, CLT Bootstrap
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sampling: Final and Initial Sample Size Determination
POINT ESTIMATION AND INTERVAL ESTIMATION
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 9.1 Chapter 9 Sampling Distributions.
Copyright © 2009 Cengage Learning 9.1 Chapter 9 Sampling Distributions.
Class notes for ISE 201 San Jose State University
The standard error of the sample mean and confidence intervals
The standard error of the sample mean and confidence intervals
Point estimation, interval estimation
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Sampling Distributions
Introduction to Probability and Statistics Chapter 7 Sampling Distributions.
AP Statistics Section 10.2 A CI for Population Mean When is Unknown.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Chapter Sampling Distributions 8 © 2010 Pearson Prentice Hall. All rights reserved.
Sampling Distributions & Point Estimation. Questions What is a sampling distribution? What is the standard error? What is the principle of maximum likelihood?
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Chapter 5 Several Discrete Distributions General Objectives: Discrete random variables are used in many practical applications. These random variables.
Chapter 5 Sampling Distributions
Slide Slide 1 Chapter 8 Sampling Distributions Mean and Proportion.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
AP Statistics Chapter 9 Notes.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
PARAMETRIC STATISTICAL INFERENCE
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
AP Statistics Section 9.3A Sample Means. In section 9.2, we found that the sampling distribution of is approximately Normal with _____ and ___________.
Sampling W&W, Chapter 6. Rules for Expectation Examples Mean: E(X) =  xp(x) Variance: E(X-  ) 2 =  (x-  ) 2 p(x) Covariance: E(X-  x )(Y-  y ) =
LECTURE 3 SAMPLING THEORY EPSY 640 Texas A&M University.
Chapter 5 Section 1 – Part 2. Samples: Good & Bad What makes a sample bad? What type of “bad” sampling did we discuss previously?
STA Lecture 181 STA 291 Lecture 18 Exam II Next Tuesday 5-7pm Memorial Hall (Same place) Makeup Exam 7:15pm – 9:15pm Location TBA.
Chapter 4 Simple Random Sampling (SRS). SRS SRS – Every sample of size n drawn from a population of size N has the same chance of being selected. Use.
Biostatistics Unit 5 – Samples. Sampling distributions Sampling distributions are important in the understanding of statistical inference. Probability.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5.
Copyright © 2009 Cengage Learning 9.1 Chapter 9 Sampling Distributions ( 표본분포 )‏
Inference: Probabilities and Distributions Feb , 2012.
Chapter 9 Sampling Distributions Sir Naseer Shahzada.
1 CHAPTER 6 Sampling Distributions Homework: 1abcd,3acd,9,15,19,25,29,33,43 Sec 6.0: Introduction Parameter The "unknown" numerical values that is used.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 13 Sampling distributions
STT 350: SURVEY SAMPLING Dr. Cuixian Chen Chapter 2: Elements of the Sampling Problem Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow.
STT 350: SURVEY SAMPLING Dr. Cuixian Chen Chapter 3: Some basic concepts of statistics Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow.
STT 350: SURVEY SAMPLING Dr. Cuixian Chen Chapter 5: Stratified Random Samples Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 1 Chapter.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
AP STATS: WARM UP I think that we might need a bit more work with graphing a SAMPLING DISTRIBUTION. 1.) Roll your dice twice (i.e. the sample size is 2,
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Estimation and Confidence Intervals. Sampling and Estimates Why Use Sampling? 1. To contact the entire population is too time consuming. 2. The cost of.
Chapter 8 Confidence Interval Estimation Statistics For Managers 5 th Edition.
Chapter 5 Stratified Random Samples. What is a stratified random sample and how to get one Population is broken down into strata (or groups) in such a.
Ch 9 實習.
Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Virtual University of Pakistan
Inference: Conclusion with Confidence
Sampling Distributions
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions
Confidence intervals for the difference between two means: Independent samples Section 10.1.
STT 350: Survey sampling Dr. Cuixian Chen
Ch 9 實習.
Presentation transcript:

STT 350: SURVEY SAMPLING Dr. Cuixian Chen Chapter 4: Simple Random Sampling (SRS) Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 1 Chapter 4

SRS  SRS – Every sample of size n drawn from a population of size N has the same chance of being selected.  Use table of random numbers (A.2) or computer software.  Using the table:  Assign every sampling unit a digit  Use table of random numbers to select sample

Example  In a population of N = 450, select a sample of size 10 using the table of random digits.  Assume: Table A.2 and use the second column; if we drop the last two digits of each number. (Note: we neglect the repeated numbers  Without replacement)  Starting digit value_______  Ending digit value_______  Line number started at _______  Sample digits selected for sample:

Review: Section 3.3 Summarizing Info in Populations and Samples: The Finite Population Case  If the population is infinitely large, we can assume sampling without replacement (probabilities of selecting observations are independent)  However, if population is finite, then probabilities of selecting elements will change as more elements are selected (Example: rolling a die versus selecting cards from standard 52 card deck) Chapter 3Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 4

Review: section 3.3 Sampling distribution from infinite populations  For randomly selected samples from infinite populations, mathematical properties of expected value can be used to derive the facts that:  It can also be shown that the variance of the sample mean can be estimated unbiasedly by: Chapter 3Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 5

Estimating population average with SRS chosen w/o replacement in Finite Population (N)  We use (  y i /n) to estimate  ( is an unbiased estimator of  )  We use s 2 to estimate  2 (unbiased estimator)  From previous, we know that V( ) =  2 /n (infinite population….or extremely large)  If finite population, then V( ) = ( (N-n)/(N-1)) (  2 /n)  When we replace  2 by s 2, this becomes estimated variance of y-bar = (1-(n/N))(s 2 /n)

Bound on the error of estimation for with SRS chosen w/o replacement in Finite Population (N)  For C=95%, use 2 standard errors as our bound (think of MOE), we have 2sqrt( (1-(n/N))(s 2 /n)).  When can the finite population correction (fpc) be dropped? A good rule of thumb is when (1-n/N) > 0.95  Want data to be approximately normal (sometimes transformations can be used…..the log transformation is one of the most popular transformations)

Estimating population average with SRS chosen w/o replacement in Finite Population (N) Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 8

Eg4.2, page 85  a) Estimate mu;  b) For C=95%, find the bound on the error of estimation for mu;  c) Find 95% CI for mu. Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 9

EX4.16, page 104 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 10  a) Estimate mu;  b) For C=95%, find the bound on the error of estimation for mu;  c) Find 95% CI for mu.

Estimating population total using SRS chosen w/o replacement in Finite Population (N)  Since a SRS assumes all observations have an equally likely chance to be selected, we set  i to be  i = n/N  We use  -hat to estimate  ( =  y i /  i =N*y-bar is an unbiased estimator of  )  Therefore, for finite population, V(  ) = N 2 ( (N-n)/(N-1)) (  2 /n)  When we replace  2 by s 2, this becomes  estimated variance of  = N 2 (1-(n/N))(s 2 /n)

Estimating population total using SRS chosen w/o replacement in Finite Population (N)

Bound on the error of estimation  For C=95%, use 2 standard errors as our bound (think of MOE), we have 2sqrt( N 2 (1-(n/N))(s 2 /n))  Normality is still important here!! (transform if necessary….i.e. small sample size and skewed data)

Eg4.4, page 87 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 14  a) Estimate the Total  ;  b) For C=95%, find the bound on the error of estimation for  ;  c) Find 95% CI for the Total .

EX4.17, page 104 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 15  a) Estimate the Total  ;  b) For C=95%, find the bound on the error of estimation for  ;  c) Find 95% CI for the Total . From above

Selecting Sample Size for  using SRS chosen w/o replacement in Finite Population (N)  Use variance of y-bar: V(y-bar) = ( (N-n)/(N-1)) (  2 /n).  Set B = 2sqrt(V(y-bar)), which is B = 2sqrt(( (N-n)/(N-1)) (  2 /n) ) and solve for n. ….which yields n = (N  2 )/((N-1)D+  2 ) where D=B 2 /4  Since  2 is usually not known, estimate it with s 2 (or s is approximately range/4)

Selecting Sample Size for  using SRS chosen w/o replacement in Finite Population (N) Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 17

Eg4.5, page 89 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 18  a) Find sample size n.

EX , page 106 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 19  a) Based on the information from EX4.23, find sample size n for EX4.24.

Selecting Sample Size for  using SRS chosen w/o replacement in Finite Population (N)  Set B = 2sqrt(N 2 V(y-bar)), which is B = 2sqrt(N 2 ( (N-n)/(N-1)) (  2 /n) ) and solve for n ….which yields n = (N  2 )/((N-1)D+  2 ) where D=B 2 /(4N 2 )  Since  2 is usually not known, estimate it with s 2 (or s is approximately range/4)

Selecting Sample Size for  using SRS chosen w/o replacement in Finite Population (N) Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 21

Eg4.6, page 90 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 22  a) Find sample size n.

EX , page 106 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 23  a) Based on the information from EX4.27, find sample size n for EX4.28.

4.5 Estimation of a Population Proportion using SRS chosen w/o replacement in Finite Population (N)  Eg: Are you Hispanic or not? (YES / NO)  Define y i as 0 (if unit does not have quantity of interest) and y i =1 (if unit does have quantity of interest)  Then p-hat =  y i /n. (special case of sample mean)  p-hat is an unbiased estimator of p.  Estimated variance of p-hat (for infinite sample sizes) is p- hat*q-hat/n  Estimated variance of p-hat (for finite sample sizes) is (1- n/N)(p-hat*q-hat)/(n-1), where q-hat= 1-p-hat  Bound = 2*sqrt(Estimated variance of p-hat)  Problem 4.14

4.5 Estimation of a Population Proportion using SRS chosen w/o replacement in Finite Population (N)

EX4.14, page 104 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 26

To estimate sample size  n = Npq/( (N-1)D + pq ) where D = B 2 /4  If p is unknown, then we use p = 0.5  Normality is important here!!  Problem 4.15  Question: All the bounds that we have looked at so far assumes what level of confidence?

4.6 Comparing two Estimates  Comparing two means, or two totals or two proportions:  Quantity of interest is  hat1 -  hat2  Variance of quantity of interest is V(  hat1 ) + V(  hat2 ) – 2cov(  hat1,  hat2 ) ********NOTE: We will NOT be using finite population correction factor in this section!!  If statistics come from two independent samples, then cov(  hat1,  hat2 ) = 0  Problem 4.18

4.6 Comparing two Estimates  When comparing means, we consider only the independent-sample case because the dependent case becomes too complicated to handle at this level.  When comparing proportions, however, a commonly occurring dependent situation does have a rather simple solution. Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 29

4.6 Comparing two Estimates  When comparing means, we consider only the independent- sample case because the dependent case becomes too complicated to handle at this level.  When comparing proportions, however, a commonly occurring dependent situation does have a rather simple solution. Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 30

EX4.18, page 104: comparing means Chapter 4 31

Eg4.11, page 98 Chapter 4Elementary Survey Sampling, 7E, Scheaffer, Mendenhall, Ott and Gerow 32

Extra Examples  A question asked to high school students was if they lied to a teacher at least one during the past year. The information is presented below MaleFemale Lied at least once Yes No Find the estimated difference in proportion for those who lied at least once to the teacher during the past year by gender. Place a bound on this estimated difference.* *Source: Moore, McCabe and Craig

Extra Multinomial example  If statistics are from a multinomial distribution, then cov(  hat1,  hat2 ) = (-p 1 p 2 /n)  In a class with 30 students, the table below illustrates the breakdown of class: Freshmen10 Sophomore5 Junior7 Senior8 Estimate the difference in percent Freshmen and percent Junior and place a bound on this difference.