Sample Size & Power Estimation Computing for Research April 9, 2013.

Slides:



Advertisements
Similar presentations
High Resolution studies
Advertisements

The t Test for Two Independent Samples
Introduction to Hypothesis Testing
Name: Date: Read temperatures on a thermometer Independent / Some adult support / A lot of adult support
Overview of Lecture Partitioning Evaluating the Null Hypothesis ANOVA
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Multiple-choice question
7.1 confidence Intervals for the Mean When SD is Known
Intro to Statistics Part2 Arier Lee University of Auckland.
Chapter 17/18 Hypothesis Testing
Hypothesis Test II: t tests
You will need Your text Your calculator
Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.
Independent Measures T-Test Quantitative Methods in HPELS 440:210.
HYPOTHESIS TESTING. Purpose The purpose of hypothesis testing is to help the researcher or administrator in reaching a decision concerning a population.
CHAPTER 16 Life Tables.
II. Potential Errors In Epidemiologic Studies Random Error Dr. Sherine Shawky.
Hypothesis Tests: Two Independent Samples
Study Size Planning for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
Understanding p-values Annie Herbert Medical Statistician Research and Development Support Unit
Comparing Two Population Parameters
Hypothesis Testing and Sample Size Calculation Po Chyou, Ph. D. Director, BBC.
McGraw-Hill, Bluman, 7th ed., Chapter 9
Statistics for the Social Sciences Psychology 340 Spring 2005 Using t-tests.
Fractions Simplify: 36/48 = 36/48 = ¾ 125/225 = 125/225 = 25/45 = 5/9
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Testing Hypotheses About Proportions
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Testing the Difference between Proportions Section 11.3.
Sample size estimation
Designing Clinical Research Studies An overview S.F. O’Brien.
Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
Departments of Medicine and Biostatistics
Statistical Decision Making
Estimation of Sample Size
Sample Size Estimation
AP Statistics – Chapter 9 Test Review
Topic 6: Introduction to Hypothesis Testing
Sample Size Determination In the Context of Hypothesis Testing
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Sample Size Determination
The t Tests Independent Samples.
Sample Size Determination Ziad Taib March 7, 2014.
Mid-semester feedback In-class exercise. Chapter 8 Introduction to Hypothesis Testing.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 7: Gathering Evidence for Practice.
CI - 1 Cure Rate Models and Adjuvant Trial Design for ECOG Melanoma Studies in the Past, Present, and Future Joseph Ibrahim, PhD Harvard School of Public.
1 Clinical Investigation and Outcomes Research Statistical Issues in Designing Clinical Research Marcia A. Testa, MPH, PhD Department of Biostatistics.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
Statistical Power and Sample Size Calculations Drug Development Statistics & Data Management July 2014 Cathryn Lewis Professor of Genetic Epidemiology.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Academic Research Academic Research Dr Kishor Bhanushali M
CHAPTER 9 Testing a Claim
1 Keaven Anderson, Ph.D. Amy Ko, MPH Nancy Liu, Ph.D. Yevgen Tymofyeyev, Ph.D. Merck Research Laboratories June 9, 2010 Information-Based Sample Size Re-estimation.
How to do Power & Sample Size Calculations Part 1 **************** GCRC Research-Skills Workshop October 18, 2007 William D. Dupont Department of Biostatistics.
Chapter ?? 7 Statistical Issues in Research Planning and Evaluation C H A P T E R.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
SECTION 1 TEST OF A SINGLE PROPORTION
Core Research Competencies:
Applied Biostatistics: Lecture 4
BIOST 513 Discussion Section - Week 10
Hypothesis Testing: One Sample Cases
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
Chapter 9 Hypothesis Testing.
Sample size.
Interpreting Epidemiologic Results.
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Professor Dept. of Family & Community Medicine College.
Statistical Power.
Presentation transcript:

Sample Size & Power Estimation Computing for Research April 9, 2013

General Comments Can consume much of a collaborative biostatistician’s time Really only relevant in the context of hypothesis testing and in estimation of precision If there are multiple Aims within a proposal, make sure that each is properly powered. It can be helpful to perform computations in two or more different software programs.

More General Comments Can be somewhat of an art form Before proposing a sample size, get a sense from the other investigators what sample sizes are even feasible (know resource limitations). Make sure you understand the hypotheses that are to be tested. Make sure you understand the study design.

More General Comments A well-written sample size estimation section in a grant can convince the reviewers that you know what you’re doing. A poorly-written sample size estimation section in a grant can convince the reviewers that you don’t know what you’re doing. Sometimes PIs will calculate a sample size on their own. Double check these, and make sure their rationale is sound. Don’t be afraid to ask how they arrived at their estimate.

Understanding the Term “Effect Size” In a very general sense, this is the magnitude of the summary statistic you plan to use for your hypothesis test – Difference in means – Difference in proportions – Odds ratio, Risk ratio – Correlation

Understanding the Term “Effect Size” Often this refers to Cohen’s D: – Small: 0.2 – Medium: 0.5 – Large: >0.8 An effect size of 1 is equivalent of a 1 standard deviation unit difference between groups. Can be helpful when trying to justify a sample size when little pilot data exist. Ex. “With 20 subjects per group, we’ll be able to detect an effect size of 0.9 (i.e. a large effect) with 80% power, assuming 2-sided hypothesis testing and an alpha level of 0.05.”

Software Free (online, downloadable) – careful! Moderately priced Expensive

Sample Size Survey Results* 14 Faculty – PhD 7 Faculty – RA 9 Students * paul nietert’s results from 2011

Sample size software used

Examples GPower R Stata Power and Precision Power (Simon two-stage design) Simulation (simple independent sample T-test example)

Two sample t-test A scientist wants to compare tumor size at 12 weeks in two groups of mice. Based on preliminary data, the expects the average tumor size to be 400mm2 in the control and states that a 50% decrease in mean tumor size would be relevant. Based on the preliminary data, the SD of tumor size is estimated to be 150 mm2 What is the effect size? Assume 80% power, alpha of What is the required sample size?

. sampsi , sd(150) Estimated sample size for two-sample comparison of means Test Ho: m1 = m2, where m1 is the mean in population 1 and m2 is the mean in population 2 Assumptions: alpha = (two-sided) power = m1 = 400 m2 = 200 sd1 = 150 sd2 = 150 n2/n1 = 1.00 Estimated required sample sizes: n1 = 12 n2 = 12

nNumber of observations (per group) deltaTrue difference in means sdStandard deviation sig.levelSignificance level (Type I error probability) powerPower of test (1 minus Type II error probability) typeType of t test alternativeOne- or two-sided test strictUse strict interpretation in two-sided case R: Power calculations for one and two sample t tests Description Compute power of test, or determine parameters to obtain target power. power.t.test(n = NULL, delta = NULL, sd = 1, sig.level = 0.05, power = NULL,type = c("two.sample", "one.sample", "paired"), alternative = c("two.sided", "one.sided"), strict = FALSE)

One sample test of proportion A clinical trial is being planned in a cancer patient population. The standard of care response rate is The new treatment would be considered worth further study if the response rate were What is the needed sample size for a one arm trial to detect this response rate with 90% power using a one-sided alpha of 0.10?

. sampsi , onesample Estimated sample size for one-sample comparison of proportion to hypothesized value Test Ho: p = , where p is the proportion in the population Assumptions: alpha = (two-sided) power = alternative p = Estimated required sample size: n = 50

Simon two-stage design A study design appropriate for a binary endpoint that is quick to evaluate in a single arm study. Ethical it is appropriate to consider early stopping for futility. It is quite rare to NOT include interim analyses to consider stopping for ethical reasons. Simon, Controlled Clinical Trials, 1989.

Two-Stage Designs What if by the 15 th patient you’ve seen no responses? Is it worth proceeding? Maybe you should have considered a design with an early stopping rule Two-stage designs: Stage 1: enroll N 1 patients X 1 or more respond Stage 2: Enroll an additional N 2 patients Stop trial Fewer than X 1 respond

Example An investigator wishes to investigate a 2-way interaction between 2 risk factors (RF) for a disease. – The prevalence of RF1 and RF2 is 20% and 30%, respectively. 5% have both RF1 and RF2. – The baseline rate of developing disease within a year is known to be 10% (no RFs). – The RR of developing disease within 1 year associated with each of the RFs is 1.5, but the hypothesis is that if both RF1 and RF2 are present, the RR is 5.0. – How many subjects are needed to detect this interaction effect?

Example (Cont.): Calculations Solve for Prevalence Estimates RF2 RF1+-Total +5%20% - Total30%100%

Example (Cont.): Calculations Solve for Prevalence Estimates RF2 RF1+-Total +5%15%20% -25%55%80% Total30%70%100%

Example (Cont.): Calculations Solve for RR Estimates RF2 RF1+-Total +5% (Risk = 50%) 15%20% -25%55% (Risk = 10%) 80% Total30%70%100%

Example (Cont.): Calculations Solve for RR Estimates RF2 RF1+-Total +5% (Risk = 50%) 15%20% (Risk = 15%) -25%55% (Risk = 10%) 80% Total30% (Risk = 15%) 70%100%

Example (Cont.): Calculations Solve for RR Estimates RF2 RF1+-Total +5% (Risk = 50%) 15% (Risk = 3.33%) 20% (Risk = 15%) -25% (Risk = 8%) 55% (Risk = 10%) 80% Total30% (Risk = 15%) 70%100%

Example (Cont.): Calculations Solve for RR Estimates RF2 RF1+-Total +50 (Risk = 50%) (n=25 of 50 Diseased) 150 (Risk = 3.33%) (n=5 of 150 Diseased) 200 (Risk = 15%) (n=30 of 200 Diseased) -250 (Risk = 8%) (n=20 of 250 Diseased) 550 (Risk = 10%) (n=55 of 550 Diseased) 800 Total300 (Risk = 15%) (n=45 of 300 Diseased)

R Simulation Code