Presentation is loading. Please wait.

Presentation is loading. Please wait.

Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 1 of 28 Chapter 1 Section 3 Other Effective Sampling.

Similar presentations


Presentation on theme: "Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 1 of 28 Chapter 1 Section 3 Other Effective Sampling."— Presentation transcript:

1 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 1 of 28 Chapter 1 Section 3 Other Effective Sampling Methods

2 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 2 of 28 Chapter 1 – Section 3 ●Learning objectives  Obtain a stratified sample  Obtain a systematic sample  Obtain a cluster sample 1 2 3

3 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 3 of 28 Chapter 1 – Section 3 ●Learning objectives  Obtain a stratified sample  Obtain a systematic sample  Obtain a cluster sample 1 2 3

4 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 4 of 28 Chapter 1 – Section 3 ●There are other effective ways to collect data  Stratified sampling  Systematic sampling  Cluster sampling ●Each of these is particularly appropriate in certain specific circumstances

5 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 5 of 28 Chapter 1 – Section 3 ●A stratified sample is obtained when we choose a simple random sample from subgroups of a population  This is appropriate when the population is made up of nonoverlapping (distinct) groups called strata ●A stratified sample is obtained when we choose a simple random sample from subgroups of a population  This is appropriate when the population is made up of nonoverlapping (distinct) groups called strata  Within each strata, the individuals are likely to have a common attribute ●A stratified sample is obtained when we choose a simple random sample from subgroups of a population  This is appropriate when the population is made up of nonoverlapping (distinct) groups called strata  Within each strata, the individuals are likely to have a common attribute  Between the stratas, the individuals are likely to have different common attributes

6 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 6 of 28 Chapter 1 – Section 3 ●Example – polling a population about a political issue  It is reasonable to divide up the population into Democrats, Republicans, and Independents  It is reasonable to believe that the opinions of individuals within each party are the same  It is reasonable to believe that the opinions differ from group to group ●Therefore it makes sense to consider each strata separately

7 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 7 of 28 Chapter 1 – Section 3 ●Example – a poll about safety within a university ●Three identified strata  Resident students  Commuter students  Faculty and staff ●Example – a poll about safety within a university ●Three identified strata  Resident students  Commuter students  Faculty and staff ●It is reasonable to assume that the opinions within each group are similar ●It is reasonable to assume that the opinions between each group are different

8 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 8 of 28 Chapter 1 – Section 3 ●Assume that the sizes of the strata are  Resident students – 5,000  Commuter students – 4,000  Faculty and staff – 1,000 ●If we wish to obtain a sample of size n = 100 that reflects the same relative proportions, we would want to choose  50 resident students  40 commuter students  10 faculty and staff

9 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 9 of 28 Chapter 1 – Section 3 ●For each strata  Choose 50 out of 5,000 resident students with a simple random sample  Choose 40 out of 4,000 commuter students with a simple random sample  Choose 10 out of 1,000 faculty and staff with a simple random sample ●This provides us with a stratified sample that reflects the actual proportions of our strata within the population

10 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 10 of 28 Chapter 1 – Section 3 ●Learning objectives  Obtain a stratified sample  Obtain a systematic sample  Obtain a cluster sample 1 2 3

11 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 11 of 28 Chapter 1 – Section 3 ●A systematic sample is obtained when we choose every k th individual in a population ●The first individual selected corresponds to a random number between 1 and k ●Systematic sampling is appropriate  When we do not have a frame  When we do not have a list of all the individuals in a population

12 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 12 of 28 Chapter 1 – Section 3 ●Example – polling customers about satisfaction with service ●We do not have a list of customers arriving that day ●We do not even know how many customers will arrive that day ●Simple random sampling (and stratified sampling) cannot be implemented

13 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 13 of 28 Chapter 1 – Section 3 ●Assume that  We want to choose a sample of 40 customers  We believe that there will be about 350 customers ●Assume that  We want to choose a sample of 40 customers  We believe that there will be about 350 customers ●Values of k  k = 7 is reasonable because it is likely that enough customers will arrive to reach the 40 target  k = 2 is not reasonable because we will only interview the very early customers  k = 20 is not reasonable because it is unlikely that enough customers will arrive to reach the 40 target

14 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 14 of 28 Chapter 1 – Section 3 ●Using the value k = 7  We choose a random number between 1 and 7, assume that it is 4  We interview the 4 th customer that day ●Using the value k = 7  We choose a random number between 1 and 7, assume that it is 4  We interview the 4 th customer that day  We interview the 11 th customer that day (11 = 4 + 7) ●Using the value k = 7  We choose a random number between 1 and 7, assume that it is 4  We interview the 4 th customer that day  We interview the 11 th customer that day (11 = 4 + 7)  We interview the 18 th customer that day (18 = 11 + 7) ●Using the value k = 7  We choose a random number between 1 and 7, assume that it is 4  We interview the 4 th customer that day  We interview the 11 th customer that day (11 = 4 + 7)  We interview the 18 th customer that day (18 = 11 + 7)  We interview the 25 th customer that day (25 = 18 + 7) ●Using the value k = 7  We choose a random number between 1 and 7, assume that it is 4  We interview the 4 th customer that day  We interview the 11 th customer that day (11 = 4 + 7)  We interview the 18 th customer that day (18 = 11 + 7)  We interview the 25 th customer that day (25 = 18 + 7) ●We continue to interview customers until we reach our target of 40 in the sample

15 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 15 of 28 Chapter 1 – Section 3 ●This requires approximately 280 customers  280 is small enough so that we will have enough customers arriving (we expect around 350)  280 is large enough to cover the majority of the expected 350 customers ●The choice of k is difficult if have no idea of the total number of customers ●Sometimes some values of k are not appropriate (for example k = 10 when the individuals arrive as male, female, male, female, male, …)

16 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 16 of 28 Chapter 1 – Section 3 ●Learning objectives  Obtain a stratified sample  Obtain a systematic sample  Obtain a cluster sample 1 2 3

17 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 17 of 28 Chapter 1 – Section 3 ●A cluster sample is obtained when we choose a random set of groups and then select all individuals within those groups ●We can obtain a sample of size 50 by choosing 10 groups of 5 ●Cluster sampling is appropriate when it is very time consuming or expensive to choose the individuals one at a time

18 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 18 of 28 Chapter 1 – Section 3 ●Example – testing the fill of bottles  It is time consuming to pull individual bottles  It is expensive to waste an entire cartons of 12 bottles to just test one bottle ●If we would like to test 240 bottles, we could  Randomly select 20 cartons  Test all 12 bottles within each carton ●This reduces the time and expense required

19 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 19 of 28 Chapter 1 – Section 3 ●A convenience sample is obtained when we choose individuals in an easy, or convenient way ●Self-selecting samples are examples of convenience sampling  Individuals who respond to television or radio announcements ●A convenience sample is obtained when we choose individuals in an easy, or convenient way ●Self-selecting samples are examples of convenience sampling  Individuals who respond to television or radio announcements ●“Just asking around” is an example of convenience sampling  Individuals who are known to the pollster

20 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 20 of 28 Chapter 1 – Section 3 ●Convenience sampling has little statistical validity  The design is poor  The results are suspect ●However, there are times when convenience sampling could be useful as a rough guess  If none of your co-workers are concerned about a particular issue, then it is possible that the set of all employees would not be concerned about that issue

21 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 21 of 28 Chapter 1 – Section 3 ●A multistage sample is obtained using a combination of  Simple random sampling  Stratified sampling  Systematic sampling  Cluster sampling ●Many large scale samples (the US census in noncensus years) use multistage sampling

22 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 22 of 28 Chapter 1 – Section 3 ●Example – choosing 3 rd grade students ●The following method combines cluster sampling with simple random sampling ●We want a sample of 240 3 rd grade students  We randomly select 20 elementary schools  We perform a simple random sample within each school to choose 12 students

23 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 23 of 28 Chapter 1 – Section 3 ●The sample size is very important in statistical analysis ●Certain sample sizes are required to reach certain conclusions ●This will be covered in later chapters

24 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 24 of 28 Summary: Chapter 1 – Section 3 ●There are other sampling methods that are particularly useful in certain situations  Stratified sampling to cover the different strata  Systematic sampling when the frame is unknown  Cluster sampling to reduce the time and expense required  Multistage sampling for effective large scale samples ●The choice of sampling methods depends on the structure of the population and the goals of the analyst

25 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 25 of 28 Summary: Chapter 1 – Section 3

26 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 26 of 28 Summary: Chapter 1 – Section 3

27 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 27 of 28 Summary: Chapter 1 – Section 3

28 Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 28 of 28 Summary: Chapter 1 – Section 3


Download ppt "Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 1 of 28 Chapter 1 Section 3 Other Effective Sampling."

Similar presentations


Ads by Google