Determining the Size of

Slides:



Advertisements
Similar presentations
© 2011 Pearson Education, Inc
Advertisements

Statistics Versus Parameters
Estimation in Sampling
Sampling: Final and Initial Sample Size Determination
Math 161 Spring 2008 What Is a Confidence Interval?
Chapter 8 Estimating Single Population Parameters
Class notes for ISE 201 San Jose State University
Determining the Size of
Why Differences are Important
Sampling.
Sampling Distributions
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
Sampling Distributions
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Determining the Size of a Sample
Sample size. Ch 132 Sample Size Formula Standard sample size formula for estimating a percentage:
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
AM Recitation 2/10/11.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Dr Mohammad Hossein Fallahzade Determining the Size of a Sample In the name of God.
Determining Sample Size
Determining the Sampling Plan
Sampling: Theory and Methods
Essentials of Marketing Research
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination CHAPTER Eleven.
Confidence Interval Estimation
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
Estimates and Sample Sizes Lecture – 7.4
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Chapter Twelve Census: Population canvass - not really a “sample” Asking the entire population Budget Available: A valid factor – how much can we.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
PARAMETRIC STATISTICAL INFERENCE
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Sample Size Determination CHAPTER thirteen.
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Chapter 18: Sampling Distribution Models
SAMPLING Obtaining a Sample From a Population. A CENSUS is a sample of an entire Population…. They are seldom done because….. 1. Impossible 2. Unprofitable.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
1 MARKETING RESEARCH Week 5 Session A IBMS Term 2,
Determining the Size of a Sample 1 Copyright © 2014 Pearson Education, Inc.
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Learning Objective Chapter 12 Sample Size Determination Copyright © 2000 South-Western College Publishing Co. CHAPTER twelve Sample Size Determination.
 The point estimators of population parameters ( and in our case) are random variables and they follow a normal distribution. Their expected values are.
Section 7-3 Estimating a Population Mean: σ Known.
S-005 Sampling. Some basic terminology Population –Larger group (the research population) –Sometimes can be quite small Sample –A subset of the population.
CONFIDENCE INTERVALS.
Sampling Fundamentals 2 Sampling Process Identify Target Population Select Sampling Procedure Determine Sampling Frame Determine Sample Size.
SAMPLING Obtaining a Sample From a Population. A population is all the people or objects of interest in a study.
1 Mean Analysis. 2 Introduction l If we use sample mean (the mean of the sample) to approximate the population mean (the mean of the population), errors.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Chapter Eleven Sample Size Determination Chapter Eleven.
What statistical tests have we learned so far? Descriptive statistics (chp. 12) –Mean, median, mode –Frequency of each response (frequencies), range, standard.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2010 Pearson Education, Inc. Slide
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Sampling: Distribution of the Sample Mean (Sigma Known) o If a population follows the normal distribution o Population is represented by X 1,X 2,…,X N.
Chapter 9 Estimation and Confidence Intervals. Our Objectives Define a point estimate. Define level of confidence. Construct a confidence interval for.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Sampling and Sampling Distribution
Sample Size Determination
Determining the Size of a Sample
Presentation transcript:

Determining the Size of Chapter 13 Determining the Size of a Sample

Sample Accuracy Sample accuracy: refers to how close a random sample’s statistic (e.g. mean, variance, proportion) is to the population’s value it represents (mean, variance, proportion) Important points: Sample size is NOT related to representativeness … you could sample 20,000 persons walking by a street corner and the results would still not represent the city; however, an n of 100 could be “right on.”

Sample Accuracy Important points: Sample size, however, IS related to accuracy. How close the sample statistic is to the actual population parameter (e.g. sample mean vs. population mean) is a function of sample size.

Sample Size AXIOMS To properly understand how to determine sample size, it helps to understand the following AXIOMS…

Sample Size Axioms The only perfectly accurate sample is a census. A probability sample will always have some inaccuracy (sample error). The larger a probability sample is, the more accurate it is (less sample error). Probability sample accuracy (error) can be calculated with a simple formula, and expressed as a + % value.

Sample Size Axioms…cont. You can take any finding in the survey, replicate the survey with the same probability sample plan & size, and you will be “very likely” to find the same result within the + range of the original findings. In almost all cases, the accuracy (sample error) of a probability sample is independent of the size of the population.

Sample Size Axioms…cont. A probability sample can be a very tiny percentage of the population size and still be very accurate (have little sample error). The size of the probability sample depends on the client’s desired accuracy (acceptable sample error) balanced against the cost of data collection for that sample size.

The Confidence Interval Method of Determining Sample Size There is only one method of determining sample size that allows the researcher to PREDETERMINE the accuracy of the sample results… The Confidence Interval Method of Determining Sample Size

The Confidence Interval Method of Determining Sample Size Notion of Confidence Interval Confidence interval: range whose endpoints define a certain percentage of the responses to a question Central limit theorem: a theory that holds that values taken from repeated samples of a survey within a population would look like a normal curve. The mean of all sample means is the mean of the population.

The Confidence Interval Method of Determining Sample Size Confidence interval approach: applies the concepts of accuracy, variability, and confidence interval to create a “correct” sample size Two types of error: Nonsampling error: pertains to all sources of error other than sample selection method and sample size (Discuss in Chapter 14) Sampling error: involves sample selection and sample size…this is the error that we are controlling through formulas Sample error formula:

The Confidence Interval Method of Determining Sample Size The relationship between sample size and sample error:

The Confidence Interval Method of Determining Sample Size - Proportions Variability Variability: refers to how similar or dissimilar responses are to a given question P (%): share that “have” or “are” or “will do” etc. Q (%): 100%-P%, share of “have nots” or “are nots” or “won’t dos” etc. N.B.: The more variability in the population being studied, the larger the sample size needed to achieve stated accuracy level.

With Nominal data (i.e. Yes, No), we can conceptualize answer variability with bar charts…the highest variability is 50/50

The Central Limit Theorem allows us to use the logic of the Normal Curve Distribution Since 95% of samples drawn from a population will fall within + 1.96 x Sample error (this logic is based upon our understanding of the normal curve) we can make the following statement: ….

If we conducted our study over and over, e. g If we conducted our study over and over, e.g.1,000 times, we would expect our result to fall within a known range (+ 1.96 s.d.’s of the mean). Based upon this, there are 95 chances in 100 that the true value of the universe statistic (proportion, share, mean) falls within this range!

The Confidence Interval Method of Determining Sample Size Normal Distribution 1.96 X s.d. defines the endpoints for 95% of the distribution

We also know that, given the amount of variability in the population, the sample size affects the size of the confidence interval; as n goes down the interval widens (more “sloppy”)

So, what have we learned thus far? There is a relationship among: the level of confidence we desire that our results be repeated within some known range if we were to conduct the study again, and… the variability (in responses) in the population and… the amount of acceptable sample error (desired accuracy) we wish to have and… the size of the sample.

Sample Size Formula The formula requires that we (a.)specify the amount of confidence we wish to have, (b.) estimate the variance in the population, and (c.) specify the level of desired accuracy we want. When we specify the above, the formula tells us what sample size we need to use….n

Sample Size Formula - Proportion The sample size formula for estimating a proportion (also called a percentage or share):

Practical Considerations in Sample Size Determination How to estimate variability (p and q shares) in the population Expect the worst case (p=50%; q=50%) Estimate variability: results of previous studies or conduct a pilot study

Practical Considerations in Sample Size Determination How to determine the amount of desired sample error Researchers should work with managers to make this decision. How much error is the manager willing to tolerate (less error = more accuracy)? Convention is + 5% The more important the decision, the less should be the acceptable level of the sample error

Practical Considerations in Sample Size Determination How to decide on the level of confidence desired Researchers should work with managers to make this decision. The higher the desired confidence level, the larger the sample size needed Convention is 95% confidence level (z=1.96 which is + 1.96 s.d.’s ) The more important the decision, the more likely the manager will want more confidence. For example, a 99% confidence level has a z=2.58.

Example: Estimating a Percentage (proportion or share) in the Population What is the Required Sample Size? Five years ago a survey showed that 42% of consumers were aware of the company’s brand (Consumers were either “aware” or “not aware”) After an intense ad campaign, management will conduct another survey. They want to be 95% confident (95 chances in 100) that the survey estimate will be within + 5% of the true share of “aware” consumers in the population. What is n?

Estimating a Percentage: What is n? Z=1.96 (95% confidence) p=42% (p, q and e must be in the same units) q=100% - p%=58% e= + 5% What is n?

N=374 What does this mean? It means that if we use a sample size of 374, after the survey, we can say the following of the results: (Assume results show that 55% are aware) “Our most likely estimate of the percentage of consumers that are “aware” of our brand name is 55%. In addition, we are 95% confident that the true share of “aware” customers in the population falls between 52.25% and 57.75%.” Note that: ( + .05 x 55% = + 2.75%) !!!!

Estimating a Mean This requires a different formula Z is determined the same way (1.96 or 2.58) e is expressed in terms of the units we are estimating, i.e. if we are measuring attitudes on a 1-7 scale, we may want our error to be no more than + .5 scale units. If we are estimating dollars being paid for a product, we may want our error to be no more than + $3.00. S is a little more difficult to estimate, but must be in same units as e.

Estimating “s” in the Formula to Determine the Sample Size Required to Estimate a Mean Since we are estimating a mean, we can assume that our data are either interval or ratio. When we have interval or ratio data, the standard deviation of the sample, s, may be used as a measure of variance. How to estimate s? Use standard deviation of the sample from a previous study on the target population Conduct a pilot study of a few members of the target population and calculate s

Example: Estimating the Mean of a Population What is the required sample size, n? Management wants to know customers’ level of satisfaction with their service. They propose conducting a survey and asking for satisfaction on a scale from 1 to 10 (since there are 10 possible answers, the range = 10). Management wants to be 99% confident in the results (99 chances in 100 that true value is captured) and they do not want the allowed error to be more than + .5 scale points. What is n?

What is n? S = 1.7 (from a pilot study), Z = 2.58 (99% confidence), and e = .5 scale points What is n? It is 77. Assume the survey average score was 7.3, what does this “tell us?” A 10 is very satisfied and a 1 is not satisfied at all. Answer: “Our most likely estimate of the level of consumer satisfaction is 7.3 on a 10-point scale. In addition, we are 99% confident that the true level of satisfaction in our consumer population falls between 6.8 and 7.8 on the scale.”

Other Methods of Sample Size Determination Arbitrary “percentage rule of thumb” sample size: Arbitrary sample size approaches rely on erroneous rules of thumb (e.g. “n must be at least 5% of the population”). Arbitrary sample sizes are simple and easy to apply, but they are neither efficient nor economical. (e.g. Using the “5 percent rule,” if the universe is 12 million, n = 600,000 – a very large and costly result)

Other Methods of Sample Size Determination…cont. Conventional sample size specification Conventional approach follows some “convention” or number believed somehow to be the right sample size (e.g. 1,000 – 1,200 used for national opinion polls w/+ 3% error) Using conventional sample size can result in a sample that may be too large or too small. Conventional sample sizes ignore the special circumstances of the survey at hand.

Other Methods of Sample Size Determination…cont. Statistical analysis requirements of sample size specification Sometimes the researcher’s desire to use particular statistical technique influences sample size. As cross comparisons go up cell sizes go up and n goes up. Cost basis of sample size specification Using the “all you can afford” method, instead of the value of the information to be gained from the survey being the primary consideration in sample size determination, the sample size is based on budget factors.

Special Sample Size Determination Situations Sample Size Using Nonprobability Sampling When using nonprobability sampling, sample size is unrelated to accuracy, so cost-benefit considerations must be used