Formalizing the Concepts: Simple Random Sampling.

Slides:



Advertisements
Similar presentations
Introduction Simple Random Sampling Stratified Random Sampling
Advertisements

Estimation in Sampling
Statistics for Managers Using Microsoft® Excel 5th Edition
Economics 105: Statistics Review #1 due next Tuesday in class Go over GH 8 No GH’s due until next Thur! GH 9 and 10 due next Thur. Do go to lab this week.
MKTG 3342 Fall 2008 Professor Edward Fox
Chapter 7 Sampling Distributions
Taejin Jung, Ph.D. Week 8: Sampling Messages and People
MISUNDERSTOOD AND MISUSED
Dr. Chris L. S. Coryn Spring 2012
Sampling.
Why sample? Diversity in populations Practicality and cost.
Statistical Inference and Sampling Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Sampling What is it? –Drawing a conclusion about the entire population from selection of limited elements in a.
11 Populations and Samples.
7/2/2015 (c) 2001, Ron S. Kenett, Ph.D.1 Sampling for Estimation Instructor: Ron S. Kenett Course Website:
A new sampling method: stratified sampling
SAMPLING METHODS. Reasons for Sampling Samples can be studied more quickly than populations. A study of a sample is less expensive than studying an entire.
Understanding sample survey data
Sampling Designs and Sampling Procedures
Sample Design.
Chapter 7 Estimation: Single Population
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
Sampling January 9, Cardinal Rule of Sampling Never sample on the dependent variable! –Example: if you are interested in studying factors that lead.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Sampling: Theory and Methods
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Lecture 14 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
Shooting right Sampling methods FETP India. Competency to be gained from this lecture Select a sample from a population to generate precise and valid.
Basic Sampling & Review of Statistics. Basic Sampling What is a sample?  Selection of a subset of elements from a larger group of objects Why use a sample?
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Population and Sampling
CHAPTER 12 DETERMINING THE SAMPLE PLAN. Important Topics of This Chapter Differences between population and sample. Sampling frame and frame error. Developing.
LECTURE 3 SAMPLING THEORY EPSY 640 Texas A&M University.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Lohr 2.2 a) Unit 1 is included in samples 1 and 3.  1 is therefore 1/8 + 1/8 = 1/4 Unit 2 is included in samples 2 and 4.  2 is therefore 1/4 + 3/8 =
Qualitative and quantitative sampling. Who are they Black/Blue/Green/Red Thin/Bold Smiling/Normal/Sad                        
7.1Sampling Methods 7.2Introduction to Sampling Distribution 7.0 Sampling and Sampling Distribution.
Sampling Methods and Sampling Distributions
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Data Collection & Sampling Dr. Guerette. Gathering Data Three ways a researcher collects data: Three ways a researcher collects data: By asking questions.
Chapter 10 Sampling: Theories, Designs and Plans.
LIS 570 Selecting a Sample.
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
PRESENTED BY- MEENAL SANTANI (039) SWATI LUTHRA (054)
Sampling Design and Procedure
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Lecture 13 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Sampling Why use sampling? Terms and definitions
Chapter 7 Sampling Distributions
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
STA 291 Spring 2008 Lecture 13 Dustin Lueker.
Presentation transcript:

Formalizing the Concepts: Simple Random Sampling

Purpose of sampling To study a sample of the population to acquire knowledge –by observing the units selected typified by households, persons, institutions, or physical objects – and making quantitative statements about the entire population

Purpose of sampling  Why sampling? Saves cost compared to full enumeration Saves cost compared to full enumeration Easier to control quality of sample Easier to control quality of sample More timely results from sample data More timely results from sample data Measurement can be destructive Measurement can be destructive

Unit of analysis  An object on which a measurement is taken  Most common units of analysis are persons, households, farms, and economic establishments Some concepts used in Sampling

Target population or universe  The complete collection of all the units of analysis to study.  Examples: population living in households in a country; students in primary schools Some concepts used in Sampling

Sampling frame  List of all the units of analysis whose characteristics are to be measured  Comprehensive, non-overlapping and must not contain irrelevant elements  Should be updated to ensure complete coverage  Examples: list of establishments; census; civil registration Some concepts used in Sampling

Parameter  Quantity computed from all N values in a population set  Typically, a descriptive measure of a population, such as mean, variance Poverty rate, average income, etc. Poverty rate, average income, etc.  Objective of sampling is to estimate parameters of a population Some concepts used in Sampling

 Estimator - mathematical formula or function using sample results to produce an estimate for the entire population  Estimate - numerical quantity computed from sample observations of a characteristic and intended to provide information about an unknown population value (parameter).  Examples: mean (average), total, proportion, ratio Estimation Some concepts used in Sampling

 When the mean of individual sample estimates equals the population parameter, then the estimator is unbiased  Formally, an estimator is unbiased if the expected value of the (sample) estimates is equal to the (population) parameter being estimated Unbiased estimator Some concepts used in Sampling

Random sampling  Also known as scientific sampling or probability sampling  Each unit has a non-zero and known probability of selection  Mathematical theory is available to assess the sampling error (the error caused by observing a sample instead of the whole population).

Random sampling techniques  Single stage, equal probability sampling Simple Random Sampling (SRS) Simple Random Sampling (SRS) Systematic sampling with equal probability Systematic sampling with equal probability  Stratified sampling  Multi-stages sampling In real life those techniques are usually combined in various ways – most sampling designs are complex

Single stage, equal probability sampling  Random selection of n “units” from a population of N units, so that each unit has an equal probability of selection N (population ) → n (sample) N (population ) → n (sample) Probability of selection (sampling fraction) = f = n/N Probability of selection (sampling fraction) = f = n/N Is the most basic form of probability sampling and provides the theoretical basis for more complicated techniques Random sampling techniques

Single stage, equal probability sampling (continued) 1. Simple Random Sampling. The investigator mixes up the whole target population before grabbing “n” units. 2. Systematic Random Sampling. The N units in the population are ranked 1 to N in some order (e.g., alphabetic). To select a sample of n units, calculate the step k ( k= N/n) and take a unit at random, from the 1st k units and then take every k th unit. Random sampling techniques

 Advantage self-weighting (simplifies the calculation of estimates and variances) self-weighting (simplifies the calculation of estimates and variances)  Disadvantages Sample frame may not be available Sample frame may not be available May entail high transportation costs May entail high transportation costs Single stage, equal probability sampling (continued) Random sampling techniques

Stratified sampling  The population is divided into mutually exclusive subgroups called strata.  Then a random sample is selected from each stratum. Random sampling techniques

Two-stage sampling  Units of analysis are divided into groups called Primary Sampling Units (PSUs)  A sample of PSUs is selected first  Then a sample of units is chosen in each of the selected PSUs Random sampling techniques This technique can be generalized (multi- stage sampling)

Random sampling  Estimates obtained from random samples can be accompanied by measures of the uncertainty associated with the estimate.  The uncertainty is measured by the standard error. Confidence intervals around the estimate can be calculated taking advantage of the Central Limit Theorem.

 The central limit theorem states that given a parameter with mean μ and variance σ², the sampling distribution of the mean approaches a normal distribution with mean μ and variance σ²/n  This is true even when the distribution of the parameter is not normal.  The normal distribution is widely used. Part of its appeal is that it is well behaved and mathematically tractable. Central limit theorem

Sample variance and standard error  Variance of the sample mean of an SRS of ‘n’ units for a population of size ‘N’:  e = standard error  Measure of sampling error. Depends on 3 factors: ( 1 - n/N ) = Finite Population Correction (fpc) ( 1 - n/N ) = Finite Population Correction (fpc) n = sample size n = sample size Var(X) = Population variance. Unknown, but can be estimated without bias by: Var(X) = Population variance. Unknown, but can be estimated without bias by:

Proportions  A proportion P (or prevalence) is equal to the mean of a dummy variable.  In this case Var(P) = P(1-P), and

 It is not sufficient to simple report the sample proportion obtained by Mr Green in the sample survey, we also need to give an indication of how accurate the estimate is.  Confidence intervals are used to indicate the accuracy of an estimate.  In other words, instead of estimating the parameter of interest by a single value, an interval of likely estimates is given. Confidence intervals

Confidence intervals (continued) where: t α = 1.28 for confidence level α = 80% t α = 1.64 for confidence level α = 90% t α = 1.96 for confidence level α = 95% t α = 2.58 for confidence level α = 99%

Confidence intervals In a sample of 1,000 electors, 280 of them (28 percent) say they will vote Green. Standard error is 1.42 percent.

Confidence intervals In a sample of 1,000 electors, 280 of them (28 percent) say they will vote Green. Standard error is 1.42 percent. Standard error 95 percent confidence interval: 28 ± percent confidence interval: 28 ±

The required sample size n is determined by The variability of the parameter Var(X) The variability of the parameter Var(X) But we don’t know it!But we don’t know it! The maximum margin of error E we are willing to accept The maximum margin of error E we are willing to accept How confident we want to be in that the error of our estimation will not exceed that maximum How confident we want to be in that the error of our estimation will not exceed that maximum For each confidence level α there is a coefficient t α For each confidence level α there is a coefficient t α The size of the population The size of the population But this is not very important!But this is not very important! For a proportion