Sample Size Determination Ziad Taib March 7, 2014.

Slides:

Advertisements

Similar presentations

Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.

Advertisements

Estimation of Means and Proportions

Sample size estimation

Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.

LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.

EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.

Elementary hypothesis testing

Sample size computations Petter Mostad

9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.

BCOR 1020 Business Statistics Lecture 22 – April 10, 2008.

Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.

Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.

Inferences About Process Quality

Chapter 9 Hypothesis Testing.

BCOR 1020 Business Statistics

5-3 Inference on the Means of Two Populations, Variances Unknown

Sample Size Determination

Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.

Power and Sample Size Part II Elizabeth Garrett-Mayer, PhD Assistant Professor of Oncology & Biostatistics.

Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.

AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.

AM Recitation 2/10/11.

McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.

Overview of Statistical Hypothesis Testing: The z-Test

Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.

Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,

Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.

Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.

Fundamentals of Hypothesis Testing: One-Sample Tests

The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.

Inference for a Single Population Proportion (p).

1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.

1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.

Statistical Power and Sample Size Calculations Drug Development Statistics & Data Management July 2014 Cathryn Lewis Professor of Genetic Epidemiology.

Comparing Two Proportions

Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.

Chapter 9: Testing Hypotheses

PARAMETRIC STATISTICAL INFERENCE

9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.

Inferential Statistics 2 Maarten Buis January 11, 2006.

Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.

Chapter 22: Comparing Two Proportions

Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.

통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.

RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the.

McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.

Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry

1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.

Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.

Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.

© Copyright McGraw-Hill 2004

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.

Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.

Sample Size Determination

Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,

Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.

BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.

Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.

Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,

HYPOTHESIS TESTING.

Sample Size Determination

BIOST 513 Discussion Section - Week 10

Two-Sample Hypothesis Testing

How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.

Chapter 8: Inference for Proportions

Hypothesis Testing: Hypotheses

9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE

Chapter 9 Hypothesis Testing.

Type I and Type II Errors

Presentation transcript:

Sample Size Determination Ziad Taib March 7, 2014

(from ICH-E9) “ The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s) addressed.” “The sample size is usually determined by the primary objective of the trial.” “ Sample size calculation should be explicitly mentioned in the protocol.”

Power and sample size Suppose we want to test if a drug is better than a placebo, or if a higher dose is better than a lower dose. Sample size: How many patients should we include in our clinical trial, to give ourselves a good chance of detecting any effects of the drug? Power: Assuming that the drug has an effect, what is the probability that our clinical trial will give a significant result?

Sample Size and Power Sample size is contingent on design, analysis plan, and outcome With the wrong sample size, you will either –Not be able to make conclusions because the study is “underpowered” –Waste time and money because your study is larger than it needed to be to answer the question of interest

Sample Size and Power With wrong sample size, you might have problems interpreting your result: –Did I not find a significant result because the treatment does not work, or because my sample size is too small? –Did the treatment REALLY work, or is the effect I saw too small to warrant further consideration of this treatment? –Issue of CLINICAL versus STATISTICAL significance

Sample Size and Power Sample size ALWAYS requires the investigator to make some assumptions –How much better do you expect the experimental therapy group to perform than the standard therapy groups? –How much variability do we expect in measurements? –What would be a clinically relevant improvement? The statistician CANNOT tell what these numbers should be It is the responsibility of the clinical investigators to define these parameters

Errors with Hypothesis Testing Ho: E=C (ineffective) H1: E≠C (effective)

Type I Error Concluding for alternative hypothesis while null- hypothesis is true (false positive) Probability of Type I error = significance level or  level Value needs to be specified in the protocol and should be small Explicit guidance from regulatory authorities, e.g One-sided or two-sided ? H o : ineffective vs. H a : effective

Type II Error Concluding for null-hypothesis while alternative hypothesis is true (false negative) Probability of Type II error =  level Value to be chosen by the sponsor, and should be small No explicit guidance from regulatory authorities, e.g. 0.20, or 0.10 Power =1 -  = 1 - Type II error rate = P(reject H 0 when H a is true) =P(concluding that the drug is effective when it is) Typical values 0.80 or 0.90

Sample Size Calculation Following items should be specified –a primary variable –the statistical test method –the null hypothesis; the alternative hypothesis; the study design –the Type I error –the Type II error –way how to deal with treatment withdrawals

Three common settings 1.Continuous outcome: e.g., number of units of blood transfused, CD4 cell counts, LDL-C. 1.Binary outcome: e.g., response vs. no response, disease vs. no disease 1.Time-to-event outcome: e.g., time to progression, time to death.

Continuous outcomes Easiest to discuss Sample size depends on –Δ: difference under the null hypothesis –α: type 1 error –β type 2 error –σ: standard deviation –r: ratio of number of patients in the two groups (usually r = 1)

One-sample Assume that we wish to study a hypothesis related to a sample of independent and identically distributed normal random variables with mean  and standard deviation . Assuming  is known, a (1-  )100% confidence interval for  is given by The maximum error in estimating  is which is a function of n. If we want to pre specify E, then n can be chosen according to

This takes care of the problem of determining the sample size required to attain a certain precision in estimating  using a confidence interval at a certain confidence level. Since a confidence interval is equivalent to a test, the above procedure can also be used to test a null hypothesis. Suppose thus that we wish to test the hypothesis –H 0 : μ 1 = μ 0 –H a : μ a > μ 0 with a significance level . For a specific alternative H 1 : μ a = μ 0 +  where  >0, the power of the test is given by

Power(sample size)

H 0 : μ 1 – μ 2 = 0 H 1 : μ 1 – μ 2 ≠ 0 With known variances  1 and  2, the power against a specific alternative μ 1 = μ 2 +  is given by Two samples

is normal so In the one sided case we have Power(sample size)

Variants of two-sample formula For testing the difference in two means, with (equal) variance and equal allocation to each arm: With Unequal allocation to each arm, where n2 = n1 2 2 If variance unknown, replace by t-distribution

0 We want to test H 0 : P = P 0 H a : P >P 1 with a certain power (1-  ) and with a certain significance level (  ). The required sample of size is One proportion 0 

We want to test H 0 : P 1 = P 2 H a : P 1 ≠ P 2 with a certain power (1-  ) and with a certain significance level (  ). The required sample of size is Two proportion ≠

Analysis of variance We want to to test H 0 :  1 =  2 = …=  k H a : at least one  i is not zero Take: The last equation is difficult to solve but there are tables. Power(sample size)

StudySize

Survival analysis Assume we plan a RCT aiming at comparing a new treatment (drug) with an old one (control). Such a comparison can be made in terms of the hazard ratio or some function of this. Let d stand for the drug and c for control In the sequel we describe the sample size required to demonstrate that the drug is better than the control treatment. This will be made under some assumptions in two steps

Step 1: specify the number of events needed Step 2: specify the number of patients needed to obtain the “the right” number of events. Step 1 In what follows we will use the notation below to give a formula specifying the number of events needed to have a certain asymptotic power in many different cases like The log-rank test – the score test – parametric tests based on the exponential distribution – etc.

It is not clear how  should estimated. The difficulty lies in the fact that this depends on the exact situation at hand. We illustrate that through some examples

EXAMPLE 1 Assume that our observations follow exp( ) under the old drug (control) and exp( exp  ) under the new drug. Then the ratio of medians of the two distributions will be To be able to discover a 50% increase in the median we take

1.It is possible to dress a table specifying  as a function of the remaing parameters when  =0.05 and p=q and the test is two sided. 2.Observe that and lead to the same number of events. 3.The largest power is obtained when p=q. 4.When p and q are not equal divide the table value by 4pq. 5.For a one sided test, only C  needs to be changed from 1.96 to 1.64.

power The number of events needed for a certain power Under equal allocation.

Crossover Designs In this type of studies we are interested in testing a null hypothesis of the form –H 0 :  T =  P against –H a :  T  P. Period j=1 Arm k=1 Period j=2 Period j=1 Arm k=2 Arm k=1 T 11 T 12 T 22 T 21 Placebo Drug ≠ Crossover design

A general model we can use for the response of the i th individual in the k th arm at the j th period is where S and e are independent random effects with means zero and standard deviations  and  S. The above hypotheses can be tested using the statistic T d, which can also be used to determine the power of the test against a certain hypothesis of the type

Assuming equal sample sizes in both arms, and that  d can be estimated from old studies we have where  stands for the clinically meaningful difference. (*) can now be sued to determine n. (*)

Software For common situations, software is available Software available for purchase –NQuery –StudySize –PASS –Power and Precision –Etc….. Online free software available on web search