Determining Sample Size

Slides:



Advertisements
Similar presentations
Sample size estimation
Advertisements

Estimating a Population Proportion
© 2011 Pearson Education, Inc
Estimation in Sampling
1 Introduction to Inference Confidence Intervals William P. Wattles, Ph.D. Psychology 302.
Statistical Issues in Research Planning and Evaluation
Math 161 Spring 2008 What Is a Confidence Interval?
Estimation from Samples Find a likely range of values for a population parameter (e.g. average, %) Find a likely range of values for a population parameter.
Determining the Size of
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Sampling.
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
Introduction to Formal Statistical Inference
Chapter 11 Sampling Design. Chapter 11 Sampling Design.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
SAMPLE SIZE DETERMINATION AND SAMPLING TECHNIQUES
BCOR 1020 Business Statistics
Determining the Size of a Sample
Statistics for Managers Using Microsoft® Excel 7th Edition
BA 427 – Assurance and Attestation Services
Determining the Size of
Review of normal distribution. Exercise Solution.
AM Recitation 2/10/11.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Sampling: Theory and Methods
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Essentials of Marketing Research
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
Estimation of Statistical Parameters
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Chapter Twelve Census: Population canvass - not really a “sample” Asking the entire population Budget Available: A valid factor – how much can we.
PARAMETRIC STATISTICAL INFERENCE
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Sample Size Determination CHAPTER thirteen.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
1 MARKETING RESEARCH Week 5 Session A IBMS Term 2,
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Two THE DESIGN OF RESEARCH.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Sample Size –sigma known Note: We can use the following formula to determine the sample size necessary to discover the “true” mean value from a population.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
1 Estimation of Population Mean Dr. T. T. Kachwala.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
9-1 Copyright © 2016 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
IM911 DTC Quantitative Research Methods Statistical Inference I: Sampling distributions Thursday 4 th February 2016.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Chapter Eleven Sample Size Determination Chapter Eleven.
Chapter 7: The Distribution of Sample Means
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 Introduction to the t Statistic
The Statistical Imagination Chapter 7. Using Probability Theory to Produce Sampling Distributions.
Virtual University of Pakistan
Sampling and Sampling Distribution
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
3. The X and Y samples are independent of one another.
Sampling Distributions and Estimation
Chapter 8: Inference for Proportions
CONCEPTS OF ESTIMATION
Week Three Review.
Estimating Population Parameters Based on a Sample
Presentation transcript:

Determining Sample Size

Determining Sample Size: Basic principles Determining the sample size depends on many things. It requires much more thought than any theoretical discussion portrays. What is being sampled? How the sample is taken? The cost of the sampling The timing of when an answer is needed The situation and reliability necessary The consequences of making a wrong decision The theory

Sample Representativeness vs Sample Accuracy How representative a sample of the population is determined by the sampling method or plan Sample accuracy refers to how close the statistic, generated from the sample data, is to the true population value the statistic is estimating Sample accuracy is related to the size of the sample

Central Limit Theorem The Central Limit Theorem states that if we take many samples, of at least size 30, then the means from these samples will form a normal distribution It is this THEORETICAL SAMPLING DISTRIBUTION (a NORMAL curve) that allows us to use statistical inference

Methods to Determine Sample Size Arbitrary Approach (10% of the population) Conventional Approach (What others have done ) Cost Basis Approach (How many does budget allow?) Statistical Analysis Approach (Adequate for subgroup analysis) Confidence Interval Approach (Allows us to predetermine how precise our estimates are)

Determining Sample Size What size sample do I need? The answer to this question is influenced by a number of factors, including the purpose of the study, population size, the risk of selecting a "bad" sample, and the allowable sampling error Three criteria will need to be specified to determine the appropriate sample size: the level of precision the level of confidence or risk the degree of variability in the attributes being measured

Precision and Confidence refers to how close the sample estimate is to the true population characteristic depends on the variablity in the sampling distribution of the mean, i.e the standard error indicates the confidence interval within which the population mean can be estimated Confidence reflects the level of certainty that the sample estimates will actually hold true for the population bias is absent from the data accuracy is reflected by the confidence level

The Level Of Precision Sometimes called sampling error, is the range in which the true value of the population is estimated to be This range is often expressed in percentage points, (e.g., ±5 percent) If a researcher finds that 60% of farmers in the sample have adopted a recommended practice with a precision rate of ±5%, then he or she can conclude that between 55% and 65% of farmers in the population have adopted the practice

The Confidence Level The confidence or risk level is based on ideas encompassed under the Central Limit Theorem when a population is repeatedly sampled, the average value of the attribute obtained by those samples is equal to the true population value The values obtained by these samples are distributed normally about the true value, with some samples having a higher value and some obtaining a lower score than the true population value In a normal distribution, approximately 95% of the sample values are within two standard deviations of the true population value (e.g., mean).

The Confidence Level This means that, if a 95% confidence level is selected, 95 out of 100 samples will have the true population value within the range of precision specified earlier There is always a chance that the sample you obtain does not represent the true population value. Such samples with extreme values are represented by the shaded areas in Figure 1 This risk is reduced for 99% confidence levels and increased for 90% (or lower) confidence levels.

The Confidence Level (Figure 1)

Roscoe’s Rules of Thumb for Determining Sample Size Sample sizes larger than 30 and smaller than 500 are appropriate for most research Minimum sample size of 30 for each sub-category is usually necessary In multivariate research, the sample size should be several times as large as the number of variables in the study For simple experimental research, successful research is possible with samples as small as 10 to 20

Degree Of Variability The degree of variability in the attributes being measured refers to the distribution of attributes in the population The more heterogeneous a population, the larger the sample size required to obtain a given level of precision The less variable (more homogeneous) a population, the smaller the sample size

Degree Of Variability Note that a proportion of 50% indicates a greater level of variability than either 20% or 80%. This is because 20% and 80% indicate that a large majority do not or do, respectively, have the attribute of interest Because a proportion of 0.5 indicates the maximum variability in a population, it is often used in determining a more conservative sample size, that is, the sample size may be larger than if the true variability of the population attribute were used

Strategies for Determining Sample Size Using A Census For Small Populations Use the entire population as the sample Although cost considerations make this impossible for large populations, a census is attractive for small populations (e.g., 200 or less) A census eliminates sampling error and provides data on all the individuals in the population Virtually the entire population would have to be sampled in small populations to achieve a desirable level of precision.

Strategies for Determining Sample Size Using A Sample Size Of A Similar Study Use the same sample size as those of studies similar to the one you plan Without reviewing the procedures employed in these studies you may run the risk of repeating errors that were made in determining the sample size for another study However, a review of the literature in your discipline can provide guidance about "typical" sample sizes which are used

Strategies for Determining Sample Size Using Published Tables Rely on published tables which provide the sample size for a given set of criteria First, these sample sizes reflect the number of obtained responses, and not necessarily the number of surveys mailed or interviews planned Second, the sample sizes presume that the attributes being measured are distributed normally or nearly so If this assumption cannot be met, then the entire population may need to be surveyed.

Using Formulas To Calculate A Sample Size Formula For Calculating A Sample For Proportions For populations that are large, Cochran (1963:75) developed the Equation 1 to yield a representative sample for proportions

Using Formulas To Calculate A Sample Size Which is valid where n0 is the sample size Z2is the abscissa of the normal curve that cuts off an area at the tails (1 - equals the desired confidence level, e.g., 95%) e is the desired level of precision, p is the estimated proportion of an attribute that is present in the population, q is 1-p The value for Z is found in statistical tables which contain the area under the normal curve

Using Formulas To Calculate A Sample Size - Example To illustrate, suppose we wish to evaluate a state-wide Extension program in which farmers were encouraged to adopt a new practice Assume there is a large population but that we do not know the variability in the proportion that will adopt the practice Therefore, assume p=.5 (maximum variability) Furthermore, suppose we desire a 95% confidence level and ±5% precision.

Using Formulas To Calculate A Sample Size - Example

Using Formulas To Calculate A Sample Size - Finite Population Correction For Proportions If the population is small then the sample size can be reduced slightly This is because a given sample size provides proportionately more information for a small population than for a large population. The sample size (n0) can be adjusted using Equation 2 below Where n is the sample size and N is the population size.

Using Formulas To Calculate A Sample Size - Finite Population Correction For Proportions Suppose our evaluation of farmers' adoption of the new practice only affected 2,000 farmers The sample size that would now be 323 farmers As you can see, this adjustment (called the finite population correction) can substantially reduce the necessary sample size for small populations

Using Formulas To Calculate A Sample Size - A Simplified Formula For Proportions Yamane (1967:886) provides a simplified formula to calculate sample sizes This formula uses a 95% confidence level and P = .5 are assumed

Using Formulas To Calculate A Sample Size – Other Considerations Another consideration with sample size is the number needed for the data analysis If descriptive statistics are to be used, e.g., mean, frequencies, then nearly any sample size will suffice On the other hand, a good size sample, e.g., 200-500, is needed for multiple regression, analysis of covariance, or log-linear analysis, which might be performed for more rigorous state impact evaluations

Using Formulas To Calculate A Sample Size – Other Considerations The sample size formulas provide the number of responses that need to be obtained Many researchers commonly add 10% to the sample size to compensate for persons that the researcher is unable to contact The sample size also is often increased by 30% to compensate for nonresponse Thus, the number of mailed surveys or planned interviews can be substantially larger than the number required for a desired level of confidence and precision

Using Formulas To Calculate A Sample Size – Reference Glenn D. Israel, Department of Agricultural Education and Communication, and extension specialist, Program Evaluation and Organizational Development, Institute of Food and Agricultural Sciences (IFAS), University of Florida, Gainesville, June 2003