Bias in Survey Sampling. Bias Due to Unrepresentative Samples A good sample is representative. This means that each sample point represents the attributes.

Slides:



Advertisements
Similar presentations
How to survey data without adding bias.
Advertisements

Sampling Partially Adapted from The Research Methods Knowledge Base, William Trochim (2006). & Methods for Social Researchers in Developing Counries, The.
OCR Nationals Level 3 Unit 3. March 2012 M Morison Know what is meant by ‘bias’ in a study Understand that bias needs to be eliminated from a study so.
Drawing Samples in “Observational Studies” Sample vs. the Population How to Draw a Random Sample What Determines the “Margin of Error” of a Poll?
1 Sampling Partially Adapted from The Research Methods Knowledge Base, William Trochim (2006). & Methods for Social Researchers in Developing Counries,
About BIAS…. Bias A systematic error in measuring the estimateA systematic error in measuring the estimate favors certain outcomesfavors certain outcomes.
Chapter 12 Sample Surveys
LT 4.1—Sampling and Surveys Day 3 Notes--Bias
Determining the Size of
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 7 Sampling, Significance Levels, and Hypothesis Testing Three scientific traditions critical.
Dear Readers, If you had it to do all over again, would you have children? Ann Landers Ann Landers posed the question to the readers of her advice column.
Copyright © 2011 Pearson Education, Inc. Samples and Surveys Chapter 13.
Chapter 12: AP Statistics
Voluntary Sampling  Voluntary samples are surveys where members of a population are invited to respond voluntarily.  Responses are not mandatory.
1.3 Experimental Design Designing a Statistical Study
C1, L2, S1 Design Method of Data Collection Surveys and Polls Experimentation Observational Studies.
4.2 Statistics Notes What are Good Ways and Bad Ways to Sample?
Research Strategies, Part 2
 Sampling Design Unit 5. Do frog fairy tale p.89 Do frog fairy tale p.89.
Political Science 30: Political Inquiry Drawing a Good Sample.
7. Logic of Sampling Jin-Wan Seo, Professor Dept. of Public Administration, University of Incheon.
Chapter 12 Designing Good Samples. Doubting the Holocaust? An opinion poll conducted in 1992 for the American Jewish Committee asked: Does it seem possible.
DATA COLLECTION METHODS Sampling
Section 1-4. Objectives Understand the practical and ethical concerns that arise when conducting a study.
Sampling is the other method of getting data, along with experimentation. It involves looking at a sample from a population with the hope of making inferences.
Chapter 12 Notes Surveys, Sampling, & Bias Examine a Part of the Whole: We’d like to know about an entire population of individuals, but examining all.
Pitfalls of Surveys. The Literary Digest Poll 1936 US Presidential Election Alf Landon (R) vs. Franklin D. Roosevelt (D)
Sampling Design Notes Pre-College Math.
Section 2-1 Samples, Good and Bad. Remember: We select a sample in order to get information about some population (entire group of individuals about which.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Agresti/Franklin Statistics, 1 of 56 Chapter 4 Gathering data Learn …. How to gather “good” data About Experiments and Observational Studies.
Sampling Methods.
one variable What is univariate data? We are going to discuss the properties of one variable at a time, like height, time, tempurature etc.
CHAPTER 8: Producing Data Sampling ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
A Survey is a study of one or more characteristics of a group. A Survey is a study of one or more characteristics of a group.
PUT THE FOLLOWING TEST SCORES IN ORDER AND GROUP THEM INTO FOUR GROUPS; Question of day.
Bias in Sampling. Definitions Bias = where the results of the sample are not representative of the population Three sources of Bias in Sampling –Sampling.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
SECTION 4.1. INFERENCE The purpose of a sample is to give us information about a larger population. The process of drawing conclusions about a population.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 7 Sampling, Significance Levels, and Hypothesis Testing Three scientific traditions.
 An observational study observes individuals and measures variable of interest but does not attempt to influence the responses.  Often fails due to.
Bias!!. Journal Topic Why do we take Samples? Fill in the blank: A good sample is of a population. Define Bias We take samples in order to reduce the.
1)Find a distribution of 5 positive whole numbers with the following properties: a)Mean = 4 b)Median = 3 c)Mode = 3 2)How many distributions can you find.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
Types of Samples. Samples Population – the main group or set from which observations or opinions can be made Sample – a small part of a population from.
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
STATS IN THE REAL WORLD Statistics, Representation, and Interpretation.
one variable What is univariate data? the major purpose of univariate analysis is to describe one variable central tendency: mean, mode, median variability:
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 13 Samples and Surveys.
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Ten percent of U. S. households contain 5 or more people
MATH Section 6.1. Sampling: Terms: Population – each element (or person) from the set of observations that can be made Sample – a subset of the.
Math III U9D5 Warm-up: 1. Decide which method of data collection you would use to collect data for the study (observational study, experiment, simulation,
For the same confidence level, narrower confidence intervals can be achieved by using larger sample sizes.
THE EFFECT OF SAMPLING BIAS ON BIG DATA BY USING THE READERS DIGEST POLL OF THE 1936 ELECTION AS A CASE STUDY, WE EXAMINE HOW THE SAMPLE OF DATA USED AFFECTS.
LOOKING AT SOME BASICS Can you tell the difference?
Bias In Statistics BY: JACKIE SARTORELLI & ANDREW CROSBY.
What type of data is it? QualitativeQuantitative Deals with numbers. Data that can be measured. Length, height, area, volume, weight, speed, time, temperature,
Sources of Bias 1. Voluntary response 2. Undercoverage 3. Nonresponse
Bias and Error What can go wrong?.
Bias On-Level Statistics.
Honors Statistics Chapter 12 Part 2
Inference for Sampling
MA151 Lecture 2: Sampling methods
Unit 1 Day 4 Biased or Unbiased in Sampling
Sampling and Study Design
Bias in Studies Section 1.4.
What do Samples Tell Us Variability and Bias.
COLLECTING STATISTICAL DATA
Presentation transcript:

Bias in Survey Sampling

Bias Due to Unrepresentative Samples A good sample is representative. This means that each sample point represents the attributes of a known number of population elements. A good sample is representative. This means that each sample point represents the attributes of a known number of population elements.samplepopulationsamplepopulation Bias often occurs when the survey sample does not accurately represent the population. The bias that results from an unrepresentative sample is called selection bias. Bias often occurs when the survey sample does not accurately represent the population. The bias that results from an unrepresentative sample is called selection bias.

Selection Bias Under coverage occurs when some members of the population are inadequately represented in the sample. A classic example of under coverage is the Literary Digest voter survey, which predicted that Alfred Landon would beat Franklin Roosevelt in the 1936 presidential election. The survey sample suffered from under coverage of low-income voters, who tended to be Democrats. Under coverage occurs when some members of the population are inadequately represented in the sample. A classic example of under coverage is the Literary Digest voter survey, which predicted that Alfred Landon would beat Franklin Roosevelt in the 1936 presidential election. The survey sample suffered from under coverage of low-income voters, who tended to be Democrats.

How did this happen? The survey relied on a convenience sample, drawn from telephone directories and car registration lists. In 1936, people who owned cars and telephones tended to be more affluent. Under-coverage is often a problem with convenience samples. The survey relied on a convenience sample, drawn from telephone directories and car registration lists. In 1936, people who owned cars and telephones tended to be more affluent. Under-coverage is often a problem with convenience samples.convenience sampleconvenience sample

Non-Response bias Sometimes, individuals chosen for the sample are unwilling or unable to participate in the survey. Non-response bias is the bias that results when respondents differ in meaningful ways from non-respondents. Sometimes, individuals chosen for the sample are unwilling or unable to participate in the survey. Non-response bias is the bias that results when respondents differ in meaningful ways from non-respondents.

The Literary Digest experience illustrates a common problem with mail surveys. Response rate is often low, making mail surveys vulnerable to non-response bias. The Literary Digest experience illustrates a common problem with mail surveys. Response rate is often low, making mail surveys vulnerable to non-response bias.

Voluntary Response Bias Voluntary response bias occurs when sample members are self-selected volunteers, as in voluntary samples. An example would be call-in radio shows that solicit audience participation in surveys on controversial topics (abortion, affirmative action, gun control, etc.). The resulting sample tends to over represent individuals who have strong opinions. Voluntary response bias occurs when sample members are self-selected volunteers, as in voluntary samples. An example would be call-in radio shows that solicit audience participation in surveys on controversial topics (abortion, affirmative action, gun control, etc.). The resulting sample tends to over represent individuals who have strong opinions.voluntary samplesvoluntary samples

Random Sampling is a procedure for sampling from a population in which (a) the selection of a sample unit is based on chance and (b) every element of the population has a known, non-zero probability of being selected. Random sampling helps produce representative samples by eliminating voluntary response bias and guarding against under coverage bias. All probability sampling methods rely on random sampling. is a procedure for sampling from a population in which (a) the selection of a sample unit is based on chance and (b) every element of the population has a known, non-zero probability of being selected. Random sampling helps produce representative samples by eliminating voluntary response bias and guarding against under coverage bias. All probability sampling methods rely on random sampling.

Bias Due to Measurement Error A poor measurement process can also lead to bias. In survey research, the measurement process includes the environment in which the survey is conducted, the way that questions are asked, and the state of the survey respondent. A poor measurement process can also lead to bias. In survey research, the measurement process includes the environment in which the survey is conducted, the way that questions are asked, and the state of the survey respondent.

Leading questions The wording of the question may be loaded in some way to unduly favor one response over another. For example, a satisfaction survey may ask the respondent to indicate where she is satisfied, dissatisfied, or very dissatisfied. By giving the respondent one response option to express satisfaction and two response options to express dissatisfaction, this survey question is biased toward getting a dissatisfied response. The wording of the question may be loaded in some way to unduly favor one response over another. For example, a satisfaction survey may ask the respondent to indicate where she is satisfied, dissatisfied, or very dissatisfied. By giving the respondent one response option to express satisfaction and two response options to express dissatisfaction, this survey question is biased toward getting a dissatisfied response.

Social desirability Most people like to present themselves in a favorable light, so they will be reluctant to admit to unsavory attitudes or illegal activities in a survey, particularly if survey results are not confidential. Instead, their responses may be biased toward what they believe is socially desirable. Most people like to present themselves in a favorable light, so they will be reluctant to admit to unsavory attitudes or illegal activities in a survey, particularly if survey results are not confidential. Instead, their responses may be biased toward what they believe is socially desirable.

Sampling Error and Survey Bias A survey produces a sample statistic, which is used to estimate a population parameter. If you repeated a survey many times, using different samples each time, you would get a different sample statistic with each replication. And each of the different sample statistics would be an estimate for the same population parameter. A survey produces a sample statistic, which is used to estimate a population parameter. If you repeated a survey many times, using different samples each time, you would get a different sample statistic with each replication. And each of the different sample statistics would be an estimate for the same population parameter.

If the statistic is unbiased, the average of all the statistics from all possible samples will equal the true population parameter; even though any individual statistic may differ from the population parameter. The variability among statistics from different samples is called sampling error. If the statistic is unbiased, the average of all the statistics from all possible samples will equal the true population parameter; even though any individual statistic may differ from the population parameter. The variability among statistics from different samples is called sampling error.

Increasing the sample size tends to reduce the sampling error; that is, it makes the sample statistic less variable. However, increasing sample size does not affect survey bias. A large sample size cannot correct for the methodological problems (under coverage, non-response bias, etc.) that produce survey bias. Increasing the sample size tends to reduce the sampling error; that is, it makes the sample statistic less variable. However, increasing sample size does not affect survey bias. A large sample size cannot correct for the methodological problems (under coverage, non-response bias, etc.) that produce survey bias.

The Literary Digest example discussed above illustrates this point. The sample size was very large - over 2 million surveys were completed; but the large sample size could not overcome problems with the sample – under coverage and non-response bias. The Literary Digest example discussed above illustrates this point. The sample size was very large - over 2 million surveys were completed; but the large sample size could not overcome problems with the sample – under coverage and non-response bias.

Test Your Understanding Which of the following statements are true? I. Random sampling is a good way to reduce response bias. II. To guard against bias from under coverage, use a convenience sample. III. Increasing the sample size tends to reduce survey bias. IV. To guard against non-response bias, use a mail-in survey. (A) I only (B) II only (C) III only (D) IV only (E) None of the above.

And the answer is…. The correct answer is (E). None of the statements is true. The correct answer is (E). None of the statements is true. Random sampling provides strong protection against bias from under coverage bias and voluntary response bias; but it is not effective against response bias. Random sampling provides strong protection against bias from under coverage bias and voluntary response bias; but it is not effective against response bias. Random samplingunder coveragevoluntary response bias response bias Random samplingunder coveragevoluntary response bias response bias

A convenience sample does not protect against under coverage bias; in fact, it sometimes causes under coverage bias. A convenience sample does not protect against under coverage bias; in fact, it sometimes causes under coverage bias.convenience sampleconvenience sample Increasing sample size does not affect survey bias. Increasing sample size does not affect survey bias.bias Using a mail-in survey does not prevent non-response bias. In fact, mail-in surveys are quite vulnerable to non-response bias. Using a mail-in survey does not prevent non-response bias. In fact, mail-in surveys are quite vulnerable to non-response bias. non-response bias non-response bias