Sample Size Estimation

Slides:



Advertisements
Similar presentations
Statistical Decision Making
Advertisements

Estimation of Sample Size
Confidence Interval and Hypothesis Testing for:
Data Analysis Statistics. Inferential statistics.
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Sample size computations Petter Mostad
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
BS704 Class 7 Hypothesis Testing Procedures
Inferences About Process Quality
Data Analysis Statistics. Inferential statistics.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Today Concepts underlying inferential statistics
5-3 Inference on the Means of Two Populations, Variances Unknown
The t Tests Independent Samples.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 14 Inferential Data Analysis
Sample Size Determination Ziad Taib March 7, 2014.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Sample size calculation
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
AM Recitation 2/10/11.
Difference Two Groups 1. Content Experimental Research Methods: Prospective Randomization, Manipulation Control Research designs Validity Construct Internal.
Hypothesis Testing – Examples and Case Studies
Copyright © 2012 by Nelson Education Limited. Chapter 8 Hypothesis Testing II: The Two-Sample Case 8-1.
Design and Analysis of Clinical Study 4. Sample Size Determination Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
More About Significance Tests
Design and Analysis of Clinical Study 8. Cross-sectional Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
The binomial applied: absolute and relative risks, chi-square.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Chapter 10 The t Test for Two Independent Samples
© 2004 Prentice-Hall, Inc.Chap 9-1 Basic Business Statistics (9 th Edition) Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
© Copyright McGraw-Hill 2004
More Contingency Tables & Paired Categorical Data Lecture 8.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
366_8. Estimation: Chapter 8 Suppose we observe something in a random sample how confident are we in saying our observation is an accurate reflection.
Psych 230 Psychological Measurement and Statistics Pedro Wolf October 21, 2009.
Chapter 13 Understanding research results: statistical inference.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Sample Size Needed to Achieve High Confidence (Means)
Hypothesis Testing and Statistical Significance
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Logic of Hypothesis Testing
Parameter Estimation.
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Narrative Reviews Limitations: Subjectivity inherent:
Presentation transcript:

Sample Size Estimation Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

The classical hypothesis testing Define a null hypothesis (H0) and a null hypothesis (H1) Collect data (D) Estimate p-value = P(D | H0) If p-value > a, accept H0; if p-value < a, reject H0

Diagnosis and statistical reasoning Disease status Present Absent Test result +ve True +ve False +ve (sensitivity) -ve False -ve True -ve (Specificity) Significance Difference is Present Absent (Ho not true) (Ho is true) Test result Reject Ho No error Type I err. 1-b a Accept Ho Type II err. No error b 1-a a : significance level 1-b : power

Study design issues Setting Participants: inclusion / exclusion criteria Design: survey, factorial, etc Measurements: outcome, covariates Analysis Sample size / power issues

Sample size issues How many judges / consumers? Practical and statistical issues Ethical issues Unnecessarily large number of judges may be deemed unethical Too small a sample may also be unethical as the study can’t show anything.

Practical difference vs statistical significance Outcome Group A Group B Improved 9 18 No improved 21 12 Total 30 30 % improved 30% 60% Outcome Group A Group B Improved 6 12 No improved 14 8 Total 20 20 % improved 30% 60% Chi-square: 5.4; P < 0.05 “Statistically significant” Chi-square: 3.3; P > 0.05 “Statistically insignificant”

Effect of sample size on preference proportion Number of judges % preferred A (p) Variance of p Std Dev of p Z test for p=0.5 P-value 20 0.65 0.0114 0.1067 1.41 0.160 30 0.0076 0.0871 1.72 0.085 40 0.0057 0.0754 1.99 0.047 50 0.0046 0.0675 2.22 0.026

Effect of sample size: a simulation True mean: 100 True SD: 15 True mean: 100 True SD: 35 Sample size Est. M SD Est. M SD 10 98.0 11.0 108.9 32.2 50 100.4 13.6 95.3 41.4 100 101.3 14.4 99.1 35.5 200 99.9 15.2 100.3 33.2 500 99.8 15.3 98.9 33.8 1000 99.5 15.1 99.9 35.0 2000 99.7 15.0 99.9 34.7 10000 100.1 15.0 99.9 35.0 100000 100.0 15.0 100.0 35.0

What are required for sample size estimation? Parameter (or outcome) of major interest Magnitude of difference in the parameter Variability of the parameter Bound of errors (type I and type II error rates)

Parameter of interest Type of measurement of primary interest: Continuous or categorical outcome Examples: Proportion: proportion (or probability) of preference for a product Hedotic scale: 0-10 Nominal scale

Variability of the parameter of interest If the parameter is a continuous variable: What is the standard deviation (SD) ? If the parameter is a categorical variable: SD can be estimated from the proportion/probability.

Magnitude of difference of interest Distinction between practical and statistical relevance. Examples: Probability of preference: 85% vs 50% Tasting scores: difference between products by 1 SD.

The Normal distribution 0.95 0.95 Z2 Z1 -1.96 1.96 1.64 0.05 0.025 0.025 Prob. Z1 Z2 0.80 0.84 1.28 0.90 1.28 1.64 0.95 1.64 1.96 0.99 2.33 2.81

The Normal deviates Alpha Za Za/2 0.20 0.84 1.28 0.10 1.28 1.64 c (One-sided) (Two-sided) 0.20 0.84 1.28 0.10 1.28 1.64 0.05 1.64 1.96 0.01 2.33 2.81 Power Z1-b 0.80 0.84 0.90 1.28 0.95 1.64 0.99 2.33

Study design and outcome Continuous measurement Categorical outcome Correlation Single population Two populations

Single group

Sample size for estimating a population mean How close to the true mean Confidence around the sample mean Type I error. N = (Za/2)2 s2 / d2 s: standard deviation d: the accuracy of estimate (how close to the true mean). Za/2: A Normal deviate reflects the type I error. Example: we want to estimate the average weight in a population, and we want the error of estimation to be less than 2 kg of the true mean, with a probability of 95% (e.g., error rate of 5%). N = (1.96)2 s2 / 22

Effect of standard deviation Std Dev (s) Sample size 10 96 12 138 14 188 16 246 18 311 20 384

Sample size for estimating a population proportion How close to the true proportion Confidence around the sample proportion. Type I error. N = (Za/2)2 p(1-p) / d2 p: proportion to be estimated. d: the accuracy of estimate (how close to the true proportion). Za/2: A Normal deviate reflects the type I error. Example: The proportion of preference for product A is around 80%. We want to estimate the preference p in a community within 5% with 95% confidence interval. N = (1.96)2 (0.8)(0.2) / 0.052 = 246 consumers.

Effect of accuracy N = (1.96)2 (0.3)(0.7) / 0.022 = 2017 subjects. Example: The proportion of preference in the general population is around 30%. We want to estimate the prevalence p in a community within 2% with 95% confidence interval. N = (1.96)2 (0.3)(0.7) / 0.022 = 2017 subjects.

Sample size for estimating a correlation coeffcient In observational studies which involve estimate a correlation (r) between two variables of interest, say, X and Y, a typical hypothesis is of the form: Ho: r = 0 vs H1: r not equal to 0. The test statistic is of the Fisher's z transformation, which can be written as: Where n is the sample size and r is the observed correlation coefficient. It can be shown that t is normally distributed with mean 0 and unit variance, and the sample size to detect a statistical significance of t can be derived as:

Sample size for estimating r: example Example: According to the literature, the correlation between salt intake and systolic blood pressure is around 0.3. A study is conducted to test the correlation in a population, with the significance level of 1% and power of 90%. The sample size for such a study can be estimated as follows: A sample size of at least 87 subjects is required for the study.

Sample size for difference between two means Hypotheses: Ho: m1 = m2 vs. Ha: m1 = m2 + d Let n1 and n2 be the sample sizes for group 1 and 2, respectively; N = n1 + n2 ; r = n1 / n2 ; s: standard deviation of the variable of interest. Then, the total sample size is given by: If we let Z = d / s be the “effect size”, then: If n1 = n2 , power = 0.80, alpha = 0.05, then (Za + Z1-b)2 = (1.96 + 1.28)2 = 10.5, then the equation is reduced to: Where Za and Z1-b are Normal deviates

Two-group comparisons

Sample size for two means vs.“effect size” For a power of 80%, significance level of 5%

Sample size for difference between 2 proportions Hypotheses: Ho: p1 = p2 vs. Ha: p1 = p2 + d . Let p1 and p2 be the sample proportions (e.g. estimates of p1 and p2) for group 1 and group 2. Then, the sample size to test the hypothesis is: Where: n = sample size for each group ; p = (p1 + p2) / 2 ; Za and Z1-b are Normal deviates A better (more conservative) suggestion for sample size is:

Sample size for difference between 2 prevalence For most diseases, the prevalence in the general population is small (e.g. 1 per 1000 subjects). Therefore, a difference formulation is required. Let p1 and p2 be the prevalence for population 1 and population 2. Then, the sample size to test the hypothesis is: Where: n = sample size for each group; Za and Z1-b are Normal deviates.

Sample size for two proportions: example Example: The preference for product A is expected to be 70%, and for product B 60%. A study is planned to show the difference at the significance level of 1% and power of 90%. The sample size can be calculated as follows: p1 = 0.6; p2 = 0.7; p = (0.6 + 0.7)/2 = 0.65; Z0.01 = 2.81; Z1-0.9 = 1.28. The sample size required for each group should be: Adjusted / conservative sample size is:

Sample size for two proportions vs. effect size Difference from p1 by: 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 P1 0.1 424 131 67 41 28 19 14 10 0.2 625 173 82 47 30 20 14 9 0.3 759 198 89 50 30 19 13 8 0.4 825 206 89 47 28 17 . . 0.5 825 198 82 41 22 . . . 0.6 759 173 67 31 . . . . 0.7 625 131 45 . . . . . 0.8 424 73 . . . . . . Note: these values are “unadjusted” sample sizes

Sample size for estimating an odds ratio In case-control study the data are usually summarized by an odds ratio (OR), rather then difference between two proportions. If p1 and p2 are the proportions of cases and controls, respectively, exposed to a risk factor, then: If we know the proportion of exposure in the general population (p), the total sample size N for estimating an OR is: Where r = n1 / n2 is the ratio of sample sizes for group 1 and group2; p is the prevalence of exposure in the controls; and OR is the hypothetical odds ratio. If n1 = n2 (so that r = 1) then the fomula is reduced to:

Sample size for an odds ratio: example Example: The prevalence of vertebral fracture in a population is 25%. It is interested to estimate the effect of smoking on the fracture, with an odds ratio of 2, at the significance level of 5% (one-sided test) and power of 80%. The total sample size for the study can be estimated by:

Sample size for 2 correlation coefficients In detecting a relevant difference between two correlation coefficients r1 and r2 obtained from two independent samples of sizes n1 and n2, respectively, we need to firstly transform these coefficients into z value as follows: The total sample size N required to detect the difference between two correlation coefficients r1 and r2, with a significance level of a and power 1-b, can be estimated by: Where Za and Z1-b are Normal deviates

Sample size for two r’s: example The sample size required to detect the difference between r1 = 0.8 and r2 = 0.4 with the significance level of 5% (two-tailed) and power of 80% can be solved as follows: z1 = 0.5 ln ((1+0.4) / (1-0.4)) = 0.424 z1 = 0.5 ln ((1+0.8) / (1-0.8)) = 1.098 46 subjects is needed in each group.

Some comments The formulae presented are theoretical. They are all based on the assumption of Normal distribution. The estimator [of sample size] has its own variability. The calculated sample size is only an approximation. Non-response must be allowed for in the calculation.

Computer programs Software program for sample size and power evaluation PS (Power and Sample size), from Vanderbilt Medical Center. This can be obtained from me by sending email to (t.nguyen@garvan.org.au). Free. On-line calculator: http://ebook.stat.ucla.edu/calculators/powercalc/ References: Florey CD. Sample size for beginners. BMJ 1993 May 1;306(6886):1181-4 Day SJ, Graham DF. Sample size and power for comparing two or more treatment groups in clinical trials. BMJ 1989 Sep 9;299(6700):663-5. Miller DK, Homan SM. Graphical aid for determining power of clinical trials involving two groups. BMJ 1988 Sep 10;297(6649):672-6 Campbell MJ, Julious SA, Altman DG. Estimating sample sizes for binary, ordered categorical, and continuous outcomes in two group comparisons. BMJ 1995 Oct 28;311(7013):1145-8. Sahai H, Khurshid A. Formulae and tables for the determination of sample sizes and power in clinical trials for testing differences in proportions for the two-sample design: a review. Stat Med 1996 Jan 15;15(1):1-21. Kieser M, Hauschke D. Approximate sample sizes for testing hypotheses about the ratio and difference of two means. J Biopharm Stat 1999 Nov;9(4):641-50.