Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 4 Designing Studies 4.1Samples and Surveys 4.2Experiments 4.3Using Studies Wisely.

Similar presentations


Presentation on theme: "Chapter 4 Designing Studies 4.1Samples and Surveys 4.2Experiments 4.3Using Studies Wisely."— Presentation transcript:

1 Chapter 4 Designing Studies 4.1Samples and Surveys 4.2Experiments 4.3Using Studies Wisely

2 Section 4.1 Samples and Surveys After this section, you should be able to… IDENTIFY the population and sample in a sample survey IDENTIFY voluntary response samples and convenience samples DESCRIBE how to use a table of random digits to select a simple random sample (SRS) DESCRIBE simple random samples, stratified random samples, and cluster samples EXPLAIN how undercoverage, nonresponse, and question wording can lead to bias in a sample survey Learning Objectives

3 Sampling and Surveys Population and Sample The distinction between population and sample is basic to statistics. To make sense of anysample result, you must know whatpopulation the sample represents Definition: The population in a statistical study is the entire group of individuals about which we want information. A sample is the part of the population from which we actually collect information. We use information from a sample to draw conclusions about the entire population. Population Sample Collect data from a representative Sample... Make an Inference about the Population.

4 Sampling & Surveys Population: the entire group of individuals about which we want information. Sample: the part of the population from which we actually collect information. We use information from a sample to draw conclusions about the entire population. Sample survey examples: - Gallup Poll (1500 weekly) - Current Population Survey (60,000 households)

5 Sampling & Surveys Sample survey: - what population do we want to describe. - what do we want to measure. - decide how to choose a sample from a population.

6 Sampling and Surveys The Idea of a Sample Survey We often draw conclusions about a whole population on the basis of a sample. Choosing a sample from a large, varied population is not that easy. Step 1: Define the population we want to describe. Step 2: Say exactly what we want to measure. A “sample survey” is a study that uses an organized plan to choose a sample that represents some specific population. Step 3: Decide how to choose a sample from the population. Step 1: Define the population we want to describe. Step 2: Say exactly what we want to measure. A “sample survey” is a study that uses an organized plan to choose a sample that represents some specific population. Step 3: Decide how to choose a sample from the population.

7 Sampling and Surveys How to Sample Badly How can we choose a sample that we can trust to represent the population? There are a number ofdifferent methods to select samples. Definition: Choosing individuals who are easiest to reach results in a convenience sample. Definition: The design of a statistical study shows bias if it systematically favors certain outcomes. Convenience samples often produce unrepresentative data…why?

8 Sampling & Surveys AP Exam Tip: If you are asked to describe how the design of a study leads to bias, you are expected to identify the direction of the bias.

9 Sampling and Surveys How to Sample Badly Convenience samples are almost guaranteed toshow bias. So are voluntary response samples, in which people decide whether to join the samplein response to an open invitation. Definition: A voluntary response sample consists of people who choose themselves by responding to a general appeal. Voluntary response samples show bias because people with strong opinions (often in the same direction) are most likely to respond.

10 Sampling and Surveys How to Sample Well: Random Sampling The statistician’s remedy is to allow impersonal chanceto choose the sample. A sample chosen by chancerules out both favoritism by the sampler and self-selection by respondents. Random sampling, the use of chance to select a sample, is the central principle of statistical sampling. Definition: A simple random sample (SRS) of size n consists of n individuals from the population chosen in such a way that every set of n individuals has an equal chance to be the sample actually selected. In practice, people use random numbers generated by a computer or calculator to choose samples. If you don’t have technology handy, you can use a table of random digits.

11 Sampling & Surveys Graphing calculators have the ability to select random numbers. The first question to ask about any sample is whether it was chosen at random. Other alternatives to an SRS involves sampling important groups (called strata) within the population separately. Then these separate “subsamples” are combined to form one Stratified random sample: first classify the population into groups of similar individuals, called strata. Then choose a separate SRS in each stratum and combine these SRSs to form the full sample.

12 Sampling and Surveys How to Choose an SRS Definition: A table of random digits is a long string of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 with these properties: Each entry in the table is equally likely to be any of the 10 digits 0 - 9. The entries are independent of each other. That is, knowledge of one part of the table gives no information about any other part.

13 Sampling and Surveys Other Sampling Methods The basic idea of sampling is straightforward: take anSRS from the population and use your sample results togain information about the population. Sometimes thereare statistical advantages to using more complexsampling methods. One common alternative to an SRS involves samplingimportant groups (called strata) within the populationseparately. These “sub-samples” are combined to formone stratified random sample. Definition: To select a stratified random sample, first classify the population into groups of similar individuals, called strata. Then choose a separate SRS in each stratum and combine these SRSs to form the full sample.

14 Sampling and Surveys Other Sampling Methods Although a stratified random sample cansometimes give more precise information about apopulation than an SRS, both sampling methodsare hard to use when populations are large andspread out over a wide area. In that situation, we’d prefer a method that selectsgroups of individuals that are “near” one another. Definition: To take a cluster sample, first divide the population into smaller groups. Ideally, these clusters should mirror the characteristics of the population. Then choose an SRS of the clusters. All individuals in the chosen clusters are included in the sample.

15 Sampling and Surveys Other Sampling Methods Although a stratified random sample cansometimes give more precise information about apopulation than an SRS, both sampling methodsare hard to use when populations are large andspread out over a wide area. In that situation, we’d prefer a method that selectsgroups of individuals that are “near” one another. Definition: To take a cluster sample, first divide the population into smaller groups. Ideally, these clusters should mirror the characteristics of the population. Then choose an SRS of the clusters. All individuals in the chosen clusters are included in the sample.

16 Sampling & Surveys Cluster samples and clusters: divide the population into smaller groups. These clusters should mirror the characteristics of the population. Then choose an SRS of the clusters. All individuals in the chosen clusters are included in the sample. Example: large high school; 200 students; 8 homerooms with 25 students each. Cluster samples are used for practical reasons. They do not offer the statistical advantage of better information but are chosen for ease or convenience.

17 Sampling and Surveys Example: Sampling at a School Assembly Describe how you would use the followingsampling methods to select 80 students tocomplete a survey. (a) Simple Random Sample (b) Stratified Random Sample (c) Cluster Sample

18 Sampling & Surveys Understand the difference between clusters and strata. - Each stratum contains similar individuals with large differences between strata. - Each cluster should look like the population but on a smaller scale. Most large-scale sample surveys use multi-stage samples that combine two or more sampling methods.

19 Sampling & Surveys The purpose of a sample is to give us information about a larger population. The process of drawing conclusions about a population is to called inference because we infer information about the population from what we know about the sample. We rely on random sampling to eliminate bias. Why: 1. The results obey the laws of probability that govern chance behavior. 2. The laws of probability allow trustworthy inference about the population.

20 Sampling and Surveys Inference for Sampling The purpose of a sample is to give us information about alarger population. The process of drawing conclusions about a population onthe basis of sample data is called inference. Why should we rely on random sampling? 1)To eliminate bias in selecting samples from the list of available individuals. 2)The laws of probability allow trustworthy inference about the population Results from random samples come with a margin of error that sets bounds on the size of the likely error. Larger random samples give better information about the population than smaller samples.

21

22 Sampling & Surveys Even a large random sample may give results that differs from the truth. This “sample variability” is described by the margin of error. Some sample surveys can introduce bias that makes a survey meaningless. Good sampling techniques can reduce sources of error. Sampling errors: the margin of error tells us how much sampling variability to expect and we can control this by choosing the size of our random sample. - sampling frame: list of individuals from which we draw our sample.

23 Sampling and Surveys Sample Surveys: What Can Go Wrong? Most sample surveys are affected by errors in addition tosampling variability. Good sampling technique includes the art of reducing allsources of error. Definition Undercoverage occurs when some groups in the population are left out of the process of choosing the sample. Nonresponse occurs when an individual chosen for the sample can’t be contacted or refuses to participate. A systematic pattern of incorrect responses in a sample survey leads to response bias. The wording of questions is the most important influence on the answers given to a sample survey.

24 Sampling & Surveys Response bias: when someone gives an incorrect response. Example: visit a dentist last 6 months. Wording of questions: an important influence on the answers given on a survey. Order matters in asking questions.

25 Experiments A survey gathers information without disturbing the population. An observational study watches the behavior and/or interactions between participants. Observational Study: observes individuals and measures variables of interest but does not attempt to influence the responses. This study is a poor way to gauge the effect that changes in one variable has on another. Experiment: deliberately imposes some treatment on individuals to measure their responses. When we want to understand cause & effect, experiments are the only source of convincing data.

26 Experiments Response variable: measures an outcome of a study. Explanatory variable: explains or influences changes in a response variable. Lurking variables: a variable that is not among the explanatory or response variables in a study that may influence the response variable. Example: heart attack experiment of women who took hormones. LV – age, health conditions, number of visits to doctor. Confounding variables: when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other. Example: age of women

27 Experiments AP Exam Tip: If you are asked to identify a possible confounding variable in a given setting, you are expected to explain how the variable you choose 1. is associated with the explanatory variable and 2. affects the response variable.

28 Experiments Treatment: a specific condition applied to the individuals in an experiment. If an experiment has several explanatory variables, a treatment is a combination of specific values of these variables. Experimental units: the smallest collection of individuals to which treatments are applied. When the units are human beings, they are often called subjects. Experiments can give good evidence for causation. Explanatory variables in an experiment are called factors.

29 Experiments Simple experiment design: Experimental units  Treatment  Measure Response However, in field experiments, we need to make sure to reduce or eliminate confounding variables. One solution is comparative experiment or random assignment. Random assignment: experimental units are assigned to treatments at random, using some sort of chance process. Completely randomized design: the treatments are assigned to all the experimental units completely by chance.

30 Experiments Control group: used to provide a baseline comparing the effects of the other treatments. Many experiments contain a control group that receives an inactive treatment (placebo). Placebo effect is when subjects respond to a dummy treatment (confounding variable). The strength of the placebo effect is a strong argument for randomized comparative experiments. Some experimental designs don’t include a control group. This is when the researchers simply want to compare the effects of several treatments.

31 Experiments AP EXAM TIP: If you are asked to describe the design of an experiment on the AP exam, you won’t get full credit if you just draw a diagram. You are expected to describe verbally how the treatments are assigned to the experimental units and clearly state what will be measured and compared.

32 Experiments Three Principles of Experimental Design 1.Control – lurking variables that might affect the response. Use a comparative design. 2.Random assignment - use random design to assign experimental units to treatments. This helps balance the effects of lurking variables. 3.Replication – use enough experimental units in each group so that any differences in the effects of the treatments can be distinguished from chance differences between the groups.

33 Experiments Statistical significance: An observed effect so large that it would rarely occur by chance is called statistically significant. Can an “unlucky” random assignment lead to confounding? No! The design of the experiment takes care of this by randomly assigning subjects to 2 treatment groups. The “unlucky” random assignment are taken into account in determining statistical significance. Whenever possible, experiments with human subjects should be double-blind. Double-blind: in an experiment, neither the subjects nor those who interact with them and measure the response variable know which treatment a subject received.

34 Experiments Block (a form of control): a group of experimental units that are known before the experiment to be similar in some way that is expected to affect the response to the treatments. Randomized block design: the random assignment of experimental units to treatments is carried out separately within each block. Moral: control what you can, block on what can’t control, and randomize to create comparable groups!


Download ppt "Chapter 4 Designing Studies 4.1Samples and Surveys 4.2Experiments 4.3Using Studies Wisely."

Similar presentations


Ads by Google