Basic Sampling Concepts

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)
Data collection for epidemiological statistics
Overview of Sampling Methods II
SADC Course in Statistics Further ideas concerning confidence intervals (Session 06)
Assumptions underlying regression analysis
Estimation in Stratified Random Sampling
SADC Course in Statistics Basic principles of hypothesis tests (Session 08)
SADC Course in Statistics (Session 20)
SADC Course in Statistics The binomial distribution (Session 06)
SADC Course in Statistics Sampling weights: an appreciation (Sessions 19)
SADC Course in Statistics Samples and Populations (Session 02)
SADC Course in Statistics Sample size determinations (Session 11)
SADC Course in Statistics Sampling design using the Paddy game (Sessions 15&16)
SADC Course in Statistics Multi-stage sampling (Sessions 13&14)
SADC Course in Statistics Assessing data critically Module B1 Session 17.
SADC Course in Statistics Session 4 & 5 Producing Good Tables.
SADC Course in Statistics Graphical summaries for quantitative data Module I3: Sessions 2 and 3.
SADC Course in Statistics Choosing appropriate methods for data collection.
SADC Course in Statistics Revision using CAST (Session 04)
SADC Course in Statistics Introduction to Statistical Inference (Session 03)
SADC Course in Statistics (Session 09)
Preparing & presenting demographic information: 1
SADC Course in Statistics Hierarchies of Units and non-traditional sampling approaches (Session 18)
SADC Course in Statistics Appreciating General Forms of Longitudinal Data (Session 06)
SADC Course in Statistics Overview of Sampling Methods I (Session 03)
SADC Course in Statistics General approaches to sample size determinations (Session 12)
SADC Course in Statistics To the Woods discussion (Sessions 10)
Objectives and data needs
SADC Course in Statistics Reporting on the web site Module I4, Sessions 14 and 15.
SADC Course in Statistics Setting the scene (Session 01)
1 Table design Module 3 Session 2. 2 Objectives of this session By the end of this session, you will be able to: appreciate the different type of objectives.
SADC Course in Statistics Comparing Means from Paired Samples (Session 13)
SADC Course in Statistics Revision on tests for proportions using CAST (Session 18)
SADC Course in Statistics Joint distributions (Session 05)
SADC Course in Statistics Laws of Probability (Session 02)
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 10: Sampling and Sampling Distributions
SADC Course in Statistics Introduction and Study Objectives (Session 01)
© 2002 Prentice-Hall, Inc.Chap 1-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 1 Introduction and Data Collection.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
Basic Business Statistics (8th Edition)
Chapter 4 Selecting a Sample Gay, Mills, and Airasian
Sampling Designs and Sampling Procedures
Sampling Theory and Surveys GV917. Introduction to Sampling In statistics the population refers to the total universe of objects being studied. Examples.
Sample Design.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Chapter 5 Selecting a Sample Gay, Mills, and Airasian 10th Edition
WHO ARE KENYANS PREPARED TO VOTE FOR AS OF NOW? SPEC BAROMETER RESULTS November 9th 2007.
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
Chapter 7: Sampling and Sampling Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Basic Business Statistics
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Survey Training Pack Session 7 – Basic Principles of Sampling.
Chapter 7 Sampling Distributions
Basic Sampling Concepts
Presentation transcript:

Basic Sampling Concepts SADC Course in Statistics Basic Sampling Concepts (Session 02)

Learning Objectives By the end of this session, you will be able to describe what is meant by sample, target population, sampled (study) population, sampling frame, sampling units explain what is meant by a representative sample discuss the importance of sampling to ensure survey results are generalisable – use of probability based samples describe key issues central to the sampling process

Population The population is the entire group of all entities about which we want information, e.g. All children below 16 years of age ~ to assess their nutritional levels All farmers growing maize ~ to learn about total maize production in the country All schools in a country ~ to learn about educational achievements Suppose you want to estimate the proportion of adults in rural areas with no primary education. What do you think is your population?

More on Populations Population sizes may be finite or infinite. We will allow the possibility of finite populations in this module. Describing the population clearly is necessary before sampling is considered. Why do you think this is so? Need also to distinguish between the target population, as defined above, and the study (or sampled) population. Latter excludes the inaccessible members of the target group e.g. temporary international migrant workers.

Samples A sample is a subset of units drawn from the population. This module is largely on sampling procedures. Sampling units refer to the entities on which measurements are made during a survey During planning (and implementation) of a survey, check the extent to which the target population can be (or has been) covered. Recognise that survey conclusions apply only to the population (study popn) which has a chance of inclusion in the sample.

Issues related to sampling units Part of the planning of the sampling process includes: Identifying the sampling unit. Is it a village, household, farm, school, etc? Recognising occurrence of units at different levels. In most surveys, sampling is done at different levels, e.g. district, then village, then household, then household member. Need to be specific about the selection of sampling units and sampling effort at each stage and specify this in the sampling protocol.

More on sampling units Observing units over time Unequal sized units In longitudinal surveys, or monitoring and evaluation surveys, should the same units be chosen for measurement each year? Unequal sized units Are all units, whose measurements are being summarised, of the same size? If not, e.g. with business surveys, should measurements be weighted according to size? Number of units Is the sample large enough to enable an analysis that addresses the study objectives?

Sampling frames The sampling frame is a list of all sampling units, for example list of villages in a region list of households within a village list of schools in the district Why do we need a sampling frame? Tool essential for objectively selecting a sample of units from the population of all units Different frames are needed if sampling at different hierarchical levels May develop relevant sub-frames at each stage as sampling progresses down the hierarchy

Representativeness The primary aim in a survey is to make inferences about the target population For this, need to ensure population is well represented in the sample Bring in qualitative aspects of the population to ensure there is adequate representation of divisions of the population, e.g. rural/urban, different wealth categories, geographical coverage, etc May be achieved by sampling from these different sub-groups so that all necessary factors likely to influence survey results are represented In doing so, ensure selection is such that variability of sampling units within each sub-division is captured, i.e. not just 1 unit per group.

Non-representativeness? Example: in a survey of Kenya, urban Nairobi is represented by a random sample of size 10, which, by chance, includes eight women, five being office workers and four of those employed in Ministry offices. Both the two men, by chance, are unemployed. Example: a random sample of two areas in Malawi happened to come up with urban Mzuzu and urban Zomba [2 of the 6 largest towns] In both cases, these (honest) random samples were of very small size, and by chance produced quite untypical results.

Generalisability Also key to survey success is to ensure the sampling is such that survey results can be generalised to the study population. Generalisability requires taking probability-based samples during the sampling process. Probability sampling is the general term for methods where sample selection is objectively-based on known chances of inclusion in the sample. If the probabilities are known and non-zero, they don’t have to be equal: corrections can be made at the data analysis stage.

Non-generalisability Example – a researcher selected her sample from a district, including one farm each with all possible combinations of (i) child-/female-/male-headed; (ii) Christian/Muslim; (iii) dambo/hillside land; (iv) purely vegetable-/mixed/purely maize-growing Some cases very hard to find e.g. there were few Muslim, child-headed, purely maize dambo farms. Sample of 36 does not relate to population proportions: not enough cases to be able to represent the commoner combinations Researcher’s selections not objective/random and cases only came from one smallish area, maybe untypical. Cannot generalise to country!

Summary of issues to consider when sampling Have the objectives been clearly specified? Has the target population been clearly defined and the possibility of survey results being applicable to a different ‘study’ population been recognised? What is the geographical coverage? What factors are likely to influence survey results – have they been considered in the sampling? What should be the sampling unit(s) for fieldwork?

Summary of issues to consider when sampling - continued Will the sample results lead to generalisable conclusions? Will the proposed sampling plan be possible within time and budget limitations? Is the sampling procedure practically feasible? Will the adopted sampling scheme provide results that address survey objectives with appropriate measures of precision? Developing a good sampling plan requires time and effort! Full plan should be well documented with full justification!

References Pettersson H. (2004) “Design of master sampling frames and master samples for household surveys in developing countries”. Chapter V of the UN Publication An Analysis of Operating Characteristics of Household Surveys in Developing and Transition Countries: Survey Costs, Design Effects and Non-Sampling Errors. Pp.71-94. Available at http://unstats.un.org/unsd/hhsurveys/index.htm De Vaus, A.D. (2001) “Research Design in Social Research”. Sage Publications, London - for discussion on generalisability Bechhofer, F. & Paterson, L. (2000) “Principles of research design in the social sciences”. Routledge, London – for a discussion on representativeness

Some practical work follows …