Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,

Slides:



Advertisements
Similar presentations
Sampling and Sample Size Calculation
Advertisements

for epidemiological studies
17th EPIET introductory course Lazareto, Menorca, Spain
Introduction Simple Random Sampling Stratified Random Sampling
Friday, May 7, Descriptive Research Week 8 Lecture 2.
AP Statistics C5 D2 HW: p.287 #25 – 30 Obj: to understand types of samples and possible errors Do Now: How do you think you collect data?
MISUNDERSTOOD AND MISUSED
Dr. Chris L. S. Coryn Spring 2012
Who and How And How to Mess It up
Sampling.
Sampling-big picture Want to estimate a characteristic of population (population parameter). Estimate a corresponding sample statistic Sample must be representative.
Sampling and Sample Size Determination
Why sample? Diversity in populations Practicality and cost.
Chapter 11 Sampling Design. Chapter 11 Sampling Design.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Sampling What is it? –Drawing a conclusion about the entire population from selection of limited elements in a.
11 Populations and Samples.
Formalizing the Concepts: Simple Random Sampling.
Course Content Introduction to the Research Process
Sampling Designs and Sampling Procedures
Key terms in Sampling Sample: A fraction or portion of the population of interest e.g. consumers, brands, companies, products, etc Population: All the.
Sample Design.
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
Sampling January 9, Cardinal Rule of Sampling Never sample on the dependent variable! –Example: if you are interested in studying factors that lead.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Sampling: Theory and Methods
Sampling Techniques LEARNING OBJECTIVES : After studying this module, participants will be able to : 1. Identify and define the population to be studied.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
 Collecting Quantitative  Data  By: Zainab Aidroos.
Shooting right Sampling methods FETP India. Competency to be gained from this lecture Select a sample from a population to generate precise and valid.
Basic Sampling & Review of Statistics. Basic Sampling What is a sample?  Selection of a subset of elements from a larger group of objects Why use a sample?
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
CHAPTER 12 DETERMINING THE SAMPLE PLAN. Important Topics of This Chapter Differences between population and sample. Sampling frame and frame error. Developing.
1 Hair, Babin, Money & Samouel, Essentials of Business Research, Wiley, Learning Objectives: 1.Understand the key principles in sampling. 2.Appreciate.
Sampling “Sampling is the process of choosing sample which is a group of people, items and objects. That are taken from population for measurement and.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
Lecture 4. Sampling is the process of selecting a small number of elements from a larger defined target group of elements such that the information gathered.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Business Project Nicos Rodosthenous PhD 04/11/ /11/20141Dr Nicos Rodosthenous.
Learning Objectives Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Sampling Methods, Sample Size, and Study Power
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Chapter 10 Sampling: Theories, Designs and Plans.
LIS 570 Selecting a Sample.
Ch. 11 SAMPLING. Sampling Sampling is the process of selecting a sufficient number of elements from the population.
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
SAMPLING Why sample? Practical consideration – limited budget, convenience, simplicity. Generalizability –representativeness, desire to establish the broadest.
Population vs. Sample. Population: a set which includes all measurements of interest to the researcher (The collection of all responses, measurements,
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
Selecting a Sample. outline Difference between sampling in quantitative & qualitative research.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Types of method Quantitative: – Questionnaires – Experimental designs Qualitative: – Interviews – Focus groups – Observation Triangulation.
Sampling Concepts Nursing Research. Population  Population the group you are ultimately interested in knowing more about “entire aggregation of cases.
Sampling Design and Procedure
Formulation of the Research Methods A. Selecting the Appropriate Design B. Selecting the Subjects C. Selecting Measurement Methods & Techniques D. Selecting.
Graduate School of Business Leadership
Sampling.
Meeting-6 SAMPLING DESIGN
Sampling Design.
Chapter 7 Sampling Distributions
Presentation transcript:

Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait, Institut de veille sanitaire Modified: Denise Antona, EPIET 2003

Objectives of presentation Definition of sampling Why do we use samples? Concept of representativeness Main methods of sampling Sampling error Sample size calculation

Definition of sampling Procedure by which some members of a given population are selected as representatives of the entire population

Definition of sampling terms Sampling unit –Subject under observation on which information is collected Sampling fraction –Ratio between the sample size and the population size Sampling frame –Any list of all the sampling units in the population Sampling scheme –Method of selecting sampling units from sampling frame

Why do we use samples ? Get information from large populations –At minimal cost –At maximum speed –At increased accuracy –Using enhanced tools

Sampling Precision Cost

What we need to know Concepts –Representativeness –Sampling methods –Choice of the right design Calculations –Sampling error –Design effect –Sample size

Sampling and representativeness Sample Target Population Sampling Population Target Population  Sampling Population  Sample

Representativeness Person Demographic characteristics (age, sex…) Exposure/susceptibility Place (ex : u rban vs. rural) Time Seasonality Day of the week Time of the day Ensure representativeness before starting, confirm once completed !!!!!!

Types of samples Non-probability samples Probability samples

Non probability samples Quotas Sample reflects population structure Time/resources constraints Convenience samples (purposive units) Biased Best or worst scenario Probability of being chosen : unknown

Probability samples Random sampling Each subject has a known probability of being chosen Reduces possibility of selection bias Allows application of statistical theory to results

Sampling error No sample is the exact mirror image of the population Magnitude of error can be measured in probability samples Expressed by standard error –of mean, proportion, differences, etc Function of –amount of variability in measuring factor of interest –sample size

Methods used in probability samples Simple random sampling Systematic sampling Stratified sampling Multistage sampling Multiphase sampling Cluster sampling

Quality of an estimate Precision & validity No precision Random error ! Precision but no validity Systematic error (Bias) !

Simple random sampling Principle –Equal chance of drawing each unit Procedure –Number all units –Randomly draw units

Simple random sampling Advantages –Simple –Sampling error easily measured Disadvantages –Need complete list of units –Does not always achieve best representativeness –Units may be scattered

Example: evaluate the prevalence of tooth decay among the 1200 children attending a school List of children attending the school Children numerated from 1 to 1200 Sample size = 100 children Random sampling of 100 numbers between 1 and 1200 How to randomly select? Simple random sampling

Table of random numbers

EPITABLE: random number listing

Systematic sampling N = 1200, and n = 60  sampling fraction = 1200/60 = 20 List persons from 1 to 1200 Randomly select a number between 1 and 20 (ex : 8)  1 st person selected = the 8 th on the list  2 nd person = = the 28 th etc.....

Systematic sampling

……..

Systematic sampling

Stratified sampling Principle : –Classify population into internally homogeneous subgroups (strata) –Draw sample in each strata –Combine results of all strata

Stratified sampling Advantages –More precise if variable associated with strata –All subgroups represented, allowing separate conclusions about each of them Disadvantages –Sampling error difficult to measure –Loss of precision if very small numbers sampled in individual strata

Example: Stratified sampling Determine vaccination coverage in a country One sample drawn in each region Estimates calculated for each stratum Each stratum weighted to obtain estimate for country (average)

Multiple stage sampling Principle = consecutive samplings example : sampling unit = household –1 rst stage : drawing areas or blocks –2 nd stage : drawing buildings, houses –3 rd stage : drawing households

Cluster sampling Principle –Random sample of groups (“clusters”) of units –In selected clusters, all units or proportion (sample) of units included

Example: Cluster sampling Section 4 Section 5 Section 3 Section 2Section 1

Cluster sampling Advantages –Simple as complete list of sampling units within population not required –Less travel/resources required Disadvantages –Imprecise if clusters homogeneous and therefore sample variation greater than population variation (large design effect) –Sampling error difficult to measure

EPI cluster sampling To evaluate vaccination coverage: Without list of persons Total population of villages Randomly choose 30 clusters 30 cluster of 7 children each= 210 children

Drawing the clusters You need : –Map of the region –Distribution of population (by villages or area) –Age distribution (population m :3%) ABCDEFGHIJABCDEFGHIJ 12-23Pop.Village

Distribution of the clusters ABCDEFGHIJABCDEFGHIJ Total population = 9820 Compute cumulated population

Distribution of the clusters Then compute sampling fraction : K= = 327 Draw a random number (between 1 and 327) Example: 62 Start from the village including “62” and draw the clusters adding the sampling fraction ABCDEFGHIJABCDEFGHIJ I I I I I I I I I I I I I I I I I I

Drawing households and children On the spot Go to the center of the village, choose direction (random) Number the houses in this direction  Ex: 21 Draw random number (between 1 and 21) to identify the first house to visit From this house progress until finding the 7 children ( itinerary rules fixed beforehand)

Design effect Global variance p(1-p) Var srs = n Cluster variance p= global proportion pi= proportion in each stratum n= number of subjects k= number of strata Σ (pi-p)² Var clus = k(k-1) Design effect = Var srs Var clust srs= simple random sampling

EPITABLE: Calculating design effect

Selecting a sampling method Population to be studied –Size/geographical distribution –Heterogeneity with respect to variable Level of precision required Resources available Importance of having a precise estimate of the sampling error

Steps in estimating sample size Identify major study variable Determine type of estimate (%, mean, ratio,...) Indicate expected frequency of factor of interest Decide on desired precision of the estimate Decide on acceptable risk that estimate will fall outside its real population value Adjust for estimated design effect Adjust for expected response rate (Adjust for population size? In case of small size population only)

Sample size formula in descriptive survey z: alpha risk express in z-score p: expected prevalence q: 1 - p d: absolute precision g: design effect z² * p * q 1.96²*0.15*0.85 n = = 544 d²0.03² Cluster sampling z² * p * q 2*1.96²*0.15*0.85 n = g* = 1088 d² 0.03² Simple random / systematic sampling

EPITABLE: cluster sample size calculation

Place of sampling in descriptive surveys Define objectives Define resources available Identify study population Identify variables to study Define precision required Establish plan of analysis (questionnaire) Create sampling frame Select sample Pilot data collection Collect data Analyse data Communicate results Use results

Conclusions Probability samples are the best Beware of … –refusals –absentees –“do not know”

Conclusions If in doubt… Call a statistician !!!!