The Roots of Total Survey Design Lars Lyberg Stockholm University QMMS Seminar Leinsweiler, Nov 7-9, 2010.

Slides:



Advertisements
Similar presentations
Paul Smith Office for National Statistics
Advertisements

SADC Course in Statistics General approaches to sample size determinations (Session 12)
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
STATISTICS FOR MANAGERS LECTURE 2: SURVEY DESIGN.
Copyright EM LYON Par accord du CFC Cession et reproduction interdites Research in Entrepreneurship- The problem of unobserved heterogeneity Frédéric Delmar.
Introduction to Research Methodology
Who and How And How to Mess It up
Sample size computations Petter Mostad
Determining the Size of
Sampling.
Chapter 7 Multicollinearity. What is in this Chapter? In Chapter 4 we stated that one of the assumptions in the basic regression model is that the explanatory.
Documentation and survey quality. Introduction.
ISSUES RELATED TO SAMPLING Why Sample? Probability vs. Non-Probability Samples Population of Interest Sampling Frame.
11 Populations and Samples.
Chapter 9 Multicollinearity
Sampling Designs and Techniques
Course Content Introduction to the Research Process
BA 427 – Assurance and Attestation Services
DQOs and the Development of MQOs Carl V. Gogolak USDOE Environmental Measurements Lab.
Determining the Size of
GS/PPAL Section N Research Methods and Information Systems A QUANTITATIVE RESEARCH PROJECT - (1)DATA COLLECTION (2)DATA DESCRIPTION (3)DATA ANALYSIS.
Copyright © 2007 Pearson Education Canada 1 Chapter 12: Audit Sampling Concepts.
Key terms in Sampling Sample: A fraction or portion of the population of interest e.g. consumers, brands, companies, products, etc Population: All the.
Optimal Adaptive Survey Design Lars Lyberg, Frauke Kreuter, and James Wagner ITSEW 2010 Stowe, VT, USA, June 16.
Chapter 4 Principles of Quantitative Research. Answering Questions  Quantitative Research attempts to answer questions by ascribing importance (significance)
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Chapter 24 Survey Methods and Sampling Techniques
Determining Sample Size
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
Portfolio Management Lecture: 26 Course Code: MBF702.
Science What is “Safety” Freedom from danger Safety is the condition of being protected against failure, breakage, error, accidents, or harm. (Protection.
Volunteer Angler Data Collection and Methods of Inference Kristen Olson University of Nebraska-Lincoln February 2,
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Charteredaccountants.com.au/training Fundamentals of Auditing in 2007 Chartered Accountants Audit Conference ASA 530 – Audit Sampling and Other Means of.
Crop area estimates with area frames in the presence of measurement errors Elisabetta Carfagna University of Bologna Department.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Chapter Twelve Census: Population canvass - not really a “sample” Asking the entire population Budget Available: A valid factor – how much can we.
Research in Business. Introduction to Research Research is simply the process of finding solution to a problem after a thorough study and analysis of.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Jeroen Pannekoek - Statistics Netherlands Work Session on Statistical Data Editing Oslo, Norway, 24 September 2012 Topic (I) Selective and macro editing.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Audit Sampling: An Overview and Application to Tests of Controls
Gile Sampling1 Sampling. Fundamental principles. Daniel Gile
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Understanding Sampling
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Paul P. Biemer RTI International Lars E. Lyberg Statistics Sweden I ntroduction to S urvey Q uality.
Part III – Gathering Data
Auditing: The Art and Science of Assurance Engagements Chapter 13: Audit Sampling Concepts Copyright © 2011 Pearson Canada Inc.
Engineering Economic Analysis - 9th Edition Newnan/Eschenbach/Lavelle Copyright 2004 by Oxford University Press, Inc.1 Engineering Economic Analysis 9th.
Practical Survey Design Strategies for Minimizing MSE Lars Lyberg and Bo Sundgren Statistics Sweden
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
Portfolio Management Unit – III Session No. 19 Topic: Capital Market Expectations Unit – III Session No. 19 Topic: Capital Market Expectations.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Slide 7.1 Saunders, Lewis and Thornhill, Research Methods for Business Students, 5 th Edition, © Mark Saunders, Philip Lewis and Adrian Thornhill 2009.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
RESEARCH METHODS Lecture 28. TYPES OF PROBABILITY SAMPLING Requires more work than nonrandom sampling. Researcher must identify sampling elements. Necessary.
Sampling Design and Procedure
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Lecture 5.  It is done to ensure the questions asked would generate the data that would answer the research questions n research objectives  The respondents.
Sampling.
Professor S K Dubey,VSM Amity School of Business
The Golden Age of Survey Research
Chapter 12 Power Analysis.
Research Problem: The research problem starts with clearly identifying the problem you want to study and considering what possible methods will affect.
Statistical Thinking and Applications
Presentation transcript:

The Roots of Total Survey Design Lars Lyberg Stockholm University QMMS Seminar Leinsweiler, Nov 7-9, 2010

Early thinkers Hansen and colleagues, U.S. Bureau of the Census Deming, U.S. Bureau of the Census and consultant Kish, University of Michigan Dalenius, Statistics Sweden and Stockholm University

What were they thinking about? Nonsampling errors Balancing errors and costs Design criteria The limitations of sampling theory Standards Similarities between survey implementation and the assembly line

4 Deming (1944) On Errors in Surveys American Sociological Review! First listing of sources of problems, beyond sampling, facing surveys The 13 factors

Demings 13 factors The 13 factors that affect the usefulness of a survey -To point out the need for directing effort toward all of them in the planning process with a view to usefulness and funds available -To point out the futility of concentrating on only one or two of them -To point out the need for theories of bias and variability that correlate accumulated experience

6

Their difficult position They had to promote Neymans theory But his theory basically assumes very small nonsampling errors They were in a first-things-first situation They promoted vigorous controls hopefully leading to small biases They discussed what a Bayesian approach might offer

Lines of thought I There is as yet no universally accepted survey design formula that provides a solution to the design problem (Dalenius 1967) Thats why textbooks devote little space to design Important to control specific error sources

Lines of thought II The U.S. Bureau of the Census is a statistical factory. The main product is statistical tables (Deming and Geoffrey 1941) Concentration on QC of error sources, evaluation, and survey models Disentangling the design process

Lines of thought III Hansen-Hurwitz-Pritzker 1967 Take all error sources into account Minimize all biases and select a minimum-variance scheme so that Var becomes an approximation of (a decent) MSE The zero defects movement that later became Six Sigma Dalenius 1969 Total survey design

The design process Criterion of effectiveness: Minimum MSE per unit of cost while meeting other requirements such as timeliness of results (not just minimum variance) Good survey design calls for reasonably effective control of the accuracy through appropriate specifications for survey procedures and adequate control of the operations, i.e. proper design of the total system

Mean squared error (MSE) MSE=Var+B 2 +(Relevance error) 2 +Interaction MSE Z (y)=E(y-Y) 2 +(Y-X) 2 +(X-Z) 2 +2(Y-X)(X-Z) Z is the ideal goal, X is the defined goal, and y is the actual result Hansen-Hurwitz-Pritzker call them requirements (Z), specifications (X) and operations (y)

Design issues X-Z is crucial in the design situation Do we want an approximate solution to the right problem or an exact solution to the wrong problem?

The design approach (Dalenius and Hansen et al) Specify the ideal goal Z Analyze the survey situation (financial, methodological and information resources) Construct a small number of alternative designs Evaluate the alternatives by reference to associated MSE and costs

The design approach (contd) Make a decision Use one of the alternatives Use a modification of one of them Do not conduct the survey Develop the administrative design Feasibility The signal system A self-contained design document (tree) Plan B

What does this tell us? All error sources should be taken into account There is very little process talk such as the need for CQI However, the common situation was: no process view, no controls Concern about costs and effectiveness of all these controls The user is a somewhat distant player

The user The user was hiding under terms such as subject matter problem, study purpose or the four key functions of a statistical system (reporting, analytic, consulting, research) Tukey 1949 But there were federal statistics users conferences in the U.S. from Dalenius provides more than 200 references on users in a 1967 ISI paper

Who identifies the requirements? Usually seen as one fictive person An official An administrator A statistician acting as a subject-matter specialist Requirements define the population, types of measurement, time dimensions and statistics needed

The designers role vis-à-vis the requirements To critique the suggested requirements To suggest QC procedures, construct dummy tables to check the decision- making and perform sensitivity analysis To act as the devils advocate and discuss specific result interpretations with the user

Kishs contributions The neo-Bayesian view Appreciates the literature by Schlaifer, Ericson, Edwards, Lindman and Savage on Bayesian methods in survey sampling and psychometrics For instance, judgment estimates of measurement biases may be combined with sampling variances to construct more realistic estimates of the total survey error

More from Kish Experiments and sample surveys might not be sufficient. Other investigations collecting data with considerable care and control but without randomization and probability sampling might be necessary.

Kishs view on design Multipurpose is great from an economical point of view. If one principal statistic can be identified that alone can decide the design If a small number of principal statistics can be identified a reasonable design compromise is possible If statistics are too disparate a joint design might not be possible

Kish on economic design Requires joint consideration of sampling and nonsampling errors Sometimes demands prior or pilot studies of sufficient size Requires information about unit variance Emphasizes a small total error Appreciates the fact that a reduction of one source might increase total error

Examples of decisions Frame needs updating? Reference period? Acceptable respondent rules? Number of callbacks? Allocation of callbacks? How much and what kind of editing? Mix of modes?

Kish summed up Get a good balance between different error sources We need to know how error structures behave under different design alternatives Relevant information should be recorded during implementation (paradata) Many practical constraints The multipurpose nature calls for a compromise

Hansen, Dalenius and colleagues on standards General standards Measurable survey plan, self-contained plan, replications should generate similar results, cost- efficient, sufficiently simple plan Standards for error control Relevance control, control of accuracy (should be dominated by variance terms) Minimum performance standards Check that standards yield the results expected

Hansen, Dalenius and colleagues summed up One should be guided by common sense, experience and theory Design and execution is a management and systems analysis problem A survey is an economic production process Survey goals must be identified Standards must be dynamic End the practice that sampling error is viewed as the total error They predicted the CASM movement

More from Hansen, Dalenius and colleagues The examination of design alternatives is costly and time-consuming There is a risk of overcontrol and inadequate control. Consequences of large errors must guide any relaxation but they dont talk about CQI One might have to compromise relevance to get controllable measurements or abstain from the survey Keep bias near zero and allow variance at expected levels

What happened? Still no design formula General design principles exist for some areas Still a concentration on some error sources more than others CASM happened We got standards The TSE paradigm accepted but has some promotional problems Many of the early thoughts were just that, very little practice, but still useful