Session 1: Introduction to Complex Survey Design

Slides:



Advertisements
Similar presentations
Basic Sampling Theory for Simple and Cluster Samples
Advertisements

SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)
Introduction Simple Random Sampling Stratified Random Sampling
Formalizing the Concepts: DESIGN EFFECTS. Cluster effect where: ρ = intraclass correlation coefficient – measure of homogeneity within a cluster m = number.
Statistical Sampling.
Chapter 6 Introduction to Inferential Statistics
Estimating a Population Variance
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Sampling: Final and Initial Sample Size Determination
Chapter 7 Introduction to Sampling Distributions
QBM117 Business Statistics Statistical Inference Sampling 1.
Why sample? Diversity in populations Practicality and cost.
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Sampling What is it? –Drawing a conclusion about the entire population from selection of limited elements in a.
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Chapter 10: Estimating with Confidence
Understanding sample survey data
Sampling Designs Avery and Burkhart, Chapter 3 Source: J. Hollenbeck.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
Standard error of estimate & Confidence interval.
Chapter 11 Simple Regression
Chapter 7 Estimation: Single Population
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Lecture 3 Forestry 3218 Avery and Burkhart, Chapter 3 Shiver and Borders, Chapter 2 Forest Mensuration II Lecture 3 Elementary Sampling Methods: Selective,
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
Basic Sampling & Review of Statistics. Basic Sampling What is a sample?  Selection of a subset of elements from a larger group of objects Why use a sample?
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Two THE DESIGN OF RESEARCH.
Highlighting Standards for Mathematical Practice in an Intro Stats Course by Fides Ushe Springfield College, MA.
AP STATISTICS LESSON 10 – 1 ( DAY 3 ) CHOOSING THE SAMPLE SIZE.
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Neuman and Robson Ch. 7 Qualitative and Quantitative Sampling.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Chapter 7 Statistical Inference: Estimating a Population Mean.
Example Random samples of size n =2 are drawn from a finite population that consists of the numbers 2, 4, 6 and 8 without replacement. a-) Calculate the.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
SAMPLE DESIGN: HOW MANY WILL BE IN THE SAMPLE—SAMPLE SIZE ADJUSTMENTS?
Survey and Sampling Methods Session 9. Introduction Nonprobability Sampling and Bias Stratified Random Sampling Cluster Sampling Systematic Sampling Nonresponse.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
INFO 271B LECTURE 9 COYE CHESHIRE Sampling. Agenda Info 271B 2 Non-probability Sampling Probability Sampling Probability Distributions.
Measuring change in sample survey data. Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different.
Point Estimates. Remember….. Population  It is the set of all objects being studied Sample  It is a subset of the population.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Random Sampling Error and Sample Size are Related
Example Random samples of size n =2 are drawn from a finite population that consists of the numbers 2, 4, 6 and 8 without replacement. a-) Calculate the.
Advanced Quantitative Techniques
ESTIMATION.
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Introduction to Sampling Distributions
Developing the Sampling Plan
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Estimation of Sampling Errors, CV, Confidence Intervals
Calculating Probabilities for Any Normal Variable
Sample vs Population (true mean) (sample mean) (sample variance)
Chapter 7 Lecture 3 Section: 7.5.
Presentation transcript:

Session 1: Introduction to Complex Survey Design Anthony Rafferty University of Manchester anthony.rafferty@manchester.ac.uk

Overview of Day Session 1: Basic Concepts Session 2: Implications of Complex design for Estimation (design effects) and model and design-based approaches (MCS) Workshop 1: Clustering Workshop 2: Stratification Workshop 3: Further Topics Basic concept: revise some basic ideas in statistics in particular around estimation theory that are important to understanding complex Sampling Then look at issues surroudning clustering and stratification of samples Second session ian plewis go on to discuss thfe implications of complex design and model based and design based approaches Millenium cohort study In the afternoon focus of the day is how to in STata Further topics: secondary sampling units, brief comparison of model and design based approaches, and of different methods of estimation

Random sampling A simple random sample without replacement (SRS) Every potential subset of n units has an equal probability of being selected as the sample. To ensure representative sample and guard against selection bias. Systematic sample to proxy SRS. Random start point and fixed interval across sampling frame Sampling Frame: Postal Address File (PAF); Land and Property Services Agency’s (LPSA) (Northern Ireland) -before we go on to consider complex design some revision- 20 names took a random sample from name list; by doing something similar to pulling names from hat; but selection probability independent of whoever selected before. -for national surveys bit more difficult. For a start need a bigger hat. So systematic -put names on participants list in random order then work out interval -from examples see need sampling frame (here list) uniquely identify population elements - In national surveys no lists of people, but we can sample address uses postal code address file

Use of Samples Samples are used to estimate population parameters (e.g. mean height, income). Simple random sampling allows us to use standard methods for point and variance estimates (standard Stata commands assume SRS) However, ESDS Government surveys frequently include ‘complex’ sample designs features (e.g. clustering & stratification). Why sample: expensive to interview everybody+ unnecessary. Take a sample to estimate a population estimate. Main focus of today is how complex survey design effect variance estimation and how to implement using stata

Precision & Bias of Estimates Sampling error refers to the variation in estimates calculated from different samples Variance; Standard Errors; Confidence Intervals Smaller confidence interval=more precise estimate Stratification and clustering influence ‘correct’ calculation: Design-based; model-based approaches Bias: Difference between estimator's value based on sample and the true value of the population parameter being estimated. An estimate is said to be biased if the centre of its distribution does not coincide with the population value. Important aspect of SRS concerns estimation of sampling error If we took different surveys get different results; Use variance in our survey; how much people on average differ from the mean to estimate sample variance. And in turn use standard errors to estimate confidence intervals :were we resampling we are confident that parameter would Be in intervals 95 % of the time. 1.96 X S.E

Precise Imprecise Bulls eye is true population parameter dots are different estimate produced from taking successive samples

Important Factors affecting precision/ sampling error Sample size Sample design (stratification and clustering) Key message: standard statistical procedures for estimating variances and so forth assume SRS. Complex sample design like clustering and stratification makes our sample deviate from SRS assumptions. Therefore have to ajust our method and calculations to account for this .

Design Effect Measure of effects of clustering and stratification on standard errors/ confidence intervals The design effect (Deff) is the relative size of the design based variance to the Simple random sample variance: = Var (.)design/ Var (.) srs The square root of the design effect gives the design factor (deft). This puts things back into the scale of the standard errors Way we consider effect of complex sample design is on how it differs compared to SRS . In a simple random sample, the confidence interval is defined as the mean +/- 1.96 of the standard error. For surveys with complex design features such as clustering and stratification, these figures need to be multipled by the design factor (Deft) which takes into account the effect of such design features on sampling error.

Interpretation We will consider how to incorporate complex design and calculate design effects Interpretation of Deft Deft = 1: No Effect of sample design on standard error Deft>1: Sample design inflates the standard error of the estimate Deft<1: Sample design increases efficiency (reduces s.e.) of estimate.

Clustering Rather than taking a random sample of the overall population, breaking down of a population first into a higher level characteristic or unit than the population elements, such as by geographical unit. A random sample of these units is randomly selected then population elements are drawn from the selected units. (Lohr, p 24.) ok so we have talked so far about clustering and strat without defining what it is Important to have some basic clue about sampling and design effects to understand why people stratify or cluster there survey

Why Cluster? Geographically concentrating interviews is cost-effective On the downside: ‘positive’ effect on Deft: Increases standard error relative to equivalent simple random sample In a simple random sample, each person in a population would have an equal selection probability, whereas in a clustered sample, the selection probability is influenced by the clustering (e.g. where postal code sectors are PSUs, people in the population who are not in the selected postcode sectors do not have a chance of being selected at the next stage once the sample of post codes is obtained). One consequence of this is that clustering samples tends to increase the standard error of survey estimates relative to a simple random sample, increasing the design effects. For example, clustering might lead to postcode sectors being selected that are very different from the population in general. The size of the increase of the design effect is dependant on: i) the sample size per cluster, and ii) the homogeneity of the clusters. As cluster sizes increase, standard errors tend to decrease. The homogeneity of the cluster is measured by the inter-cluster correlation coefficient (ICC, or rho). If individuals in a cluster have more in common than with people in general then the rho will be bigger than zero. If all indivudal within a cluster are identical yet there is some between cluster variation, then r will be equal to 1.? (handouts). As rho increase so does standard errors. If people are all alike in a cluster, but different from people in other clusters, then there is a greater risk of drawing a sample that is different from the overall population.

One-stage cluster sample Random sample of a population unit is selected, such as postal code sectors, then all population elements (e.g. individuals) in the sample of clusters are taken as the element level sample. All elements in selected PSU units included in sample

Survey of income individual level in london

Primary Sampling Units (PSUs) Current example post code districts are PSUs. We select 8 PSUS then interview everybody in them In a simple random sample, each person in a population would have an equal selection probability, whereas in a clustered sample, the selection probability is influenced by the clustering (e.g. where postal code sectors are PSUs, people in the population who are not in the selected postcode sectors do not have a chance of being selected at the next stage once the sample of post codes is obtained). One consequence of this is that clustering samples tends to increase the standard error of survey estimates relative to a simple random sample, increasing the design effects. For example, clustering might lead to postcode sectors being selected that are very different from the population in general. The size of the increase of the design effect is dependant on: i) the sample size per cluster, and ii) the homogeneity of the clusters. As cluster sizes increase, standard errors tend to decrease. The homogeneity of the cluster is measured by the inter-cluster correlation coefficient (ICC, or rho). If individuals in a cluster have more in common than with people in general then the rho will be bigger than zero. If all indivudal within a cluster are identical yet there is some between cluster variation, then r will be equal to 1.? (handouts). As rho increase so does standard errors. If people are all alike in a cluster, but different from people in other clusters, then there is a greater risk of drawing a sample that is different from the overall population.

Know your postcode! M13 9PL M13 is post code district M13 9 is postcode sector (used for PSUs) M13 9 PL is delivery point (used to identify addresses). Map example higher geography for PSUs then usually used.

Two-stage Example Primary Sampling Unit (PSU) Secondary Sampling Unit (SSU) At the first stage, a random sample of PSUs (e.g. postcode sectors) is selected. In the second stage, a random sample of SSUs (e.g.households) is selected within postal code sectors. Individuals (‘population elements’) are then selected within households. In this design, adults are clustered in households, and households are clustered in postcode sectors.

This idea is illustrated in Figure, which shows how a 25% two-stage sample is selected by first selecting 5 out of 10 PSUs (PAFSs), and then selecting 5 out 10 households within each of the selected PSUs

Stratification In stratified sampling, the population is partitioned into groups (strata) based on variables, such as regions, or car ownership (yes/no), and a sample is selected by some design (e.g. randomly) within each strata. Prior knowledge about some characteristics of sampling units in the population is thus used to control the number of units sampled within each stratum

Stratification Implicit/ explicit Proportionate/disproportionate Explicit: group population by strata then random sample in different stratum Implicit: Sort sampling frame by stratification variable(s) and take random starting point then fixed intervals Proportionate/disproportionate Disproportionate: Ethnicity booster samples; Child booster samples Weighting adjusts for disproportionate sample

Take 50 per cent random sample Sample frame of 4 M M F F Take 50 per cent random sample Could end up with two men or two women We know sample frame is 50/50 male/female

So stratify (explicity) M M F F Separate men and women…

M M F F

Then randomly select 1 M 1 F M M F F SAMPLE

Sample: M F

Or implicit stratification Order sample frame in a line: women, then men (but order of two men and two women is random): F F M M

implicit stratification Starting point random (as women order random) F F M M

implicit stratification Work out interval to achieve 50 per cent sample across sampling frame F F M M

implicit stratification Sample F M

Why Stratify? Enforce representativeness Small efficiency gain (proportionate stratification). i.e. reduces standard errors/negative effect on Deft. Efficiency gain in disproportionate stratification depends on level of variance in disproportionate stratum. +/-

Problem with sample?

Explicit stratum

Explicit stratum PSUs

Common Stratifiers Government Office Region; Local Authority Car ownership; % according to UK census in non-manual occupations Level of disadvantage Can combine different strata variables and implicit and explicit stratification

Post-stratification Weighting method that adjusts for any differences between the survey data and the population in terms of a few key population variables (e.g. age and sex). The aim is to reduce any bias due to sampling error and non-response effects.

Example: Health Survey of England 2007 The sample for the HSE was drawn in two stages. At the first stage a random sample of primary sampling units (PSUs), based on postcode sectors, was selected. Within each selected PSU, a random sample of postal addresses (known as delivery points) was then drawn. The list of PSUs in England was ordered by local authority and, within each local authority, by the percentage of households in the 2001 Census with a head of household in a non-manual occupation (NS-SEC groups 1-3).

HSE07 The sample of PSUs was then selected by sampling from the list at fixed intervals from a random starting point. 900 PSUs were selected with probability proportional to the total number of addresses within them. Once selected the 900 PSUs were randomly allocated to one of two groups; 720 PSUs were allocated to the core sample and 180 PSUs were allocated to the additional child boost sample. The core PSUs contained sampled addresses for both the core and child boost sample, the additional child boost PSUs only.

What is the design? One-stage/two stage? Disproportionate/proportionate stratification? Implicit/explicit stratification?

Acknowledgements Thanks to Leen Van de Casteele (CCSR/Social Statistics) for guidance and help on topics, and lecture materials from Dep. of Social Statistics (Manchester) Masters course module on Advanced Survey Methods. http://www.socialsciences.manchester.ac.uk/disciplines/socialstatistics/index.html

Appendix: Samples Simple random sample with replacement (SRSWR), one unit is randomly selected from the population then put back into the population and a second unit is drawn. The procedure is repeated until the desired n units are obtained, and due to replacement there may be duplicates in the sample (i.e. the same unit may be used twice).

ge cluster_b = cluster list cluster_b area if cluster_b == 66 list cluster_b area if cluster_b == 137 replace cluster_b = 65 in 5438 replace cluster_b = 65 in 11007/11030 replace cluster_b = 136 in 11765/11768 replace cluster_b = 136 in 12637/12644

Workshop 1: Clustering Declaring the complex sample design features of you survey to Stata using the svyset command. Primary sampling units and weights Summary statistics and cross-tabulations Conducting sub-population analysis correctly. Basic modelling and estimating design effects using svyset: Effects of clustering on standard errors and on the statistical significance of findings.

Intra-cluster correlation The homogeneity of the cluster is measured by the intra-cluster correlation coefficient (ICC, or rho). If individuals in a cluster have more in common than with people in general then the rho will be bigger than zero. As rho increase so does standard errors. If people are all alike in a cluster, but different from people in other clusters, then there is a greater risk of drawing a sample that is different from the overall population. Rho is used to calculate design effects

Workshop 2: Stratification The effects of stratification on standard errors. How to inspect and, if necessary, prepare a stratification variable for inclusion in your analysis using the svydes command.

Workshop 3: Further Topics Incorporating (or ignoring) multi-stage design (e.g. secondary sampling unit features) into your analysis and the ‘ultimate cluster method’. Comparing linearization and replicate methods. Brief comparison of model-based and design-based approaches We are using the 06 HSE data.

SSU Finite population correction (fpc) (1-n/N) used because in small samples, the greater our sample size the more information we have about the population the smaller the variances. For samples taken from extremely large populations, the fpc will approximate to 1, whereas other samples, the size of the sample determine the precision of the estimator (sse Lohr, pg. 33).

Linearization A linear estimator is a linear function of the sample observations. In simple random samples many estimators are linear estimators where the sample size n is fixed. However, in cluster sampling, situations arise where the sample size cannot be assumed fixed across different clusters, for example, in one-stage clustering where the sizes of clusters Bi vary. (see Lehtonen and Pahkinen, 1994, pg 138.). The taylor series linearization method provides a linear approximation of the non-linear estimators, to which linear estimator formulas for estimating variances are applied.

Replicate Methods In replication approaches, a defined number (K) of subsets (replicate samples) of the full sample are selected, and the estimation procedure is repeated for each subsample and the variance calculated. Information on the sum of these subset samples is used to estimate variances. Statistics of interest are calculated for each subsample (‘replicate statistics’) and then the variability between these subsample replicate statistics are used to estimate the variance of the full sample statistic.

Replicate Methods Because the formulation of the probability samples is based upon the complex design, unbiased, design corrected variance estimates can be derived.

Replicate Methods Replication techniques include ‘Balanced Repeated Replication (BRR)’ (designed for where there are only two sampling units pre strata) and ‘Jackknife’. In the Jackknife technique, the replicate samples are obtained by successively omitting one sampling unit fro each stratum and performing estimates Bootstrap is also used (currently not built in to svyset command suite).

Jackknife In unstratified surveys one PSU at a time is deleted and the others reweighted to keep the same total weight (known as the JK1 jackknife). For stratified designs the jackknife removes one PSU at a time, but reweights only the other PSUs in the same stratum. Jackknife can be used where there are more than two PSUs per stratum.

Wrapping up (1) PSU arguably (in most cases) most important aspect of complex sample design to take into account- don’t forget weighting though on disproportionately stratified samples. Many Government surveys deposited without PSU. No PSU? Contact ESDS Government

Model vs design-based approaches model based more efficient (smaller standard errors) however assumptions on population structure, wrong model, wrong results stratification is included as covariate, however we may be interested in an estimate unadjusted by the stratification variable(s) design based arguably is easier for the inexperienced although there may be circumstances where your sampling units are of substantive interest

Textbooks & Resources ESRC Research Methods Programme Resource: http://www2.napier.ac.uk/depts/fhls/peas/index.htm Lohr, S. (1999) “Sampling Design and Analysis”, Pacific Grove: Duxbury. Lehtonen, R. and Pahkinen, E. (1994) “Practical Methods for the Design and Analysis of Complex Surveys, New York, John Wiley.

The important last slide… Helpdesk: govsurveys@esds.ac.uk (0161)275 1980 Website: http://www.esds.ac.uk/government Jisc list: esds-govsurveys@jiscmail.ac.uk