Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Sample size estimation
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Inference Sampling distributions Hypothesis testing.
Introduction to Hypothesis Testing Chapter 8. Applying what we know: inferential statistics z-scores + probability distribution of sample means HYPOTHESIS.
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
1 Test a hypothesis about a mean Formulate hypothesis about mean, e.g., mean starting income for graduates from WSU is $25,000. Get random sample, say.
Behavioural Science II Week 1, Semester 2, 2002
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Hypothesis Testing: Type II Error and Power.
Research Curriculum Session III – Estimating Sample Size and Power Jim Quinn MD MS Research Director, Division of Emergency Medicine Stanford University.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Inference about a Mean Part II
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Chapter 8 Introduction to Hypothesis Testing
Sample Size Determination
Hypothesis Testing Is It Significant?. Questions What is a statistical hypothesis? What is the null hypothesis? Why is it important for statistical tests?
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 10 Hypothesis Testing
Hypothesis Testing – Introduction
Confidence Intervals and Hypothesis Testing - II
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 9 Introduction to Hypothesis Testing.
Introduction to Biostatistics and Bioinformatics
Fundamentals of Hypothesis Testing: One-Sample Tests
1/2555 สมศักดิ์ ศิวดำรงพงศ์
1 Virtual COMSATS Inferential Statistics Lecture-17 Ossam Chohan Assistant Professor CIIT Abbottabad.
Statistical Analysis Statistical Analysis
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Sample Size Determination Donna McClish. Issues in sample size determination Sample size formulas depend on –Study design –Outcome measure Dichotomous.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 10 Hypothesis Testing
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
STA Statistical Inference
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Medical Statistics as a science
Education 793 Class Notes Decisions, Error and Power Presentation 8.
1 Hypothesis Testing A criminal trial is an example of hypothesis testing. In a trial a jury must decide between two hypotheses. The null hypothesis is.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
Statistical Core Didactic
Understanding Results
Hypothesis Testing – Introduction
Hypothesis Testing: Hypotheses
Chapter 9 Hypothesis Testing.
Presentation transcript:

Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010

Outline Example: lowering blood pressure Introduction to some statistical issues in sample size determination Two simple approximate formulas Descriptions of sample size calculations from the literature

Example Physicians design an intervention to reduce blood pressure in patients with high blood pressure But does it work? Need a study. How many participants are required? Too few: may not detect an effect even if there is one. Too many: may unnecessarily expose patients to risk.

The null hypothesis For intervention studies, the null hypothesis is usually this: on average there is no effect. “Innocent until proven guilty” The physicians who designed the intervention believe the null hypothesis is false. The study is designed to test the null hypothesis. Often write H 0 for the null hypothesis.

The study The population is considered to be all people who might be eligible for the intervention (might depend on age, other medical conditions, etc.) Study participants are viewed as a sample from this population. Suppose for each study participant we measure blood pressure at baseline, and after 6 weeks of intervention Outcome is change in blood pressure H 0 is that mean change in BP is 0.

Population vs. sample Population Population mean of the change in blood pressure Random sample Inference Sample mean of the change in blood pressure Calculation

Population distribution of change in blood pressure mean ± 1 standard deviation Probability distributions Recall that variance is the square of the standard deviation, often written as  

Population distribution of change in blood pressure

Sampling distribution of mean change in blood pressure (N=1)

Sampling distribution of mean change in blood pressure (N=2)

Sampling distribution of mean change in blood pressure (N=5)

Sampling distribution of mean change in blood pressure (N=10) Increasing sample size reduces the variability of the sample mean. standard deviation standard error SD N SE =

Variance and sample size As we’ve seen, increasing the sample size is akin to reducing the variance Equivalently, reducing the variance (e.g. using a more precise measurement device) can reduce the sample size requirements

Hypothesis test Sampling distribution of the mean under the null hypothesis, a.k.a. the null distribution

Hypothesis test Rejection region Observed mean Reject the null hypothesis if the observed mean is far in the tails of the null distribution, i.e. we have ruled out chance

Possible scenarios Based on the study findings we infer either … that the intervention has no effect (accept H 0 ) that the intervention has an effect (reject H 0 ) or

Possible scenarios In reality, either … the intervention has no effect (H 0 is true) the intervention has an effect (H 0 is false) Based on the study findings we infer either … that the intervention has no effect (accept H 0 ) that the intervention has an effect (reject H 0 ) or

Four possible scenarios In reality, either … the intervention has no effect (H 0 is true) the intervention has an effect (H 0 is false) Based on the study findings we infer either … that the intervention has no effect (accept H 0 ) that the intervention has an effect (reject H 0 ) or

Four possible scenarios In reality, either … the intervention has no effect (H 0 is true) the intervention has an effect (H 0 is false) Based on the study findings we infer either … that the intervention has no effect (accept H 0 ) Correctly accept H 0 that the intervention has an effect (reject H 0 ) or

Four possible scenarios In reality, either … the intervention has no effect (H 0 is true) the intervention has an effect (H 0 is false) Based on the study findings we infer either … that the intervention has no effect (accept H 0 ) Correctly accept H 0 that the intervention has an effect (reject H 0 ) Correctly reject H 0 or

Four possible scenarios In reality, either … the intervention has no effect (H 0 is true) the intervention has an effect (H 0 is false) Based on the study findings we infer either … that the intervention has no effect (accept H 0 ) Correctly accept H 0 that the intervention has an effect (reject H 0 ) Type-I errorCorrectly reject H 0 or

Four possible scenarios In reality, either … the intervention has no effect (H 0 is true) the intervention has an effect (H 0 is false) Based on the study findings we infer either … that the intervention has no effect (accept H 0 ) Correctly accept H 0 Type-II error that the intervention has an effect (reject H 0 ) Type-I errorCorrectly reject H 0 or

Type-I error If the null hypothesis is true, the rejection region of the test represents type-I error. The probability of type-I error is the area of the red region below, and is denoted by .

Type-II error Type-II error is failing to reject the null hypothesis when it is false. The probability of type-II error is denoted . It depends on how big the true effect is Sample size calculations require specification of an alternative hypothesis, which indicates the size of effect we would like to detect

Type-II error

Relationship between type-I and type-II error (alpha=0.05)

Relationship between type-I and type-II error (alpha=0.10)

Relationship between type-I and type-II error (alpha=0.20)

Relationship between type-I and type-II error Sample size calculations depend on the tradeoff between type-I and type-II error. We usually fix the probability of type-I error (alpha) at 5% and then try to minimize the probability of type-II error (beta). Define Power = 1 – beta We want to maximize power One way to do this is by increasing the sample size

How sample size affects power

Sample size (doubled)

Sample size (quintupled)

An approximate formula for the blood pressure example Suppose the variance in the change in blood pressure, sigma 2, is the same for the null and alternative hypotheses Suppose alpha is fixed at 0.05 and we use two- sided tests (allowing for the possibility that blood pressure could be either increased or decreased by the intervention) Then we will have approximately 80% power to detect a mean change in blood pressure delta if we enroll N participants, where N = 8 sigma 2 / delta 2 (approximately)

Example Suppose the standard deviation of the change in blood pressure is anticipated to be 7 mmHg (so the variance is 49) Suppose we fix alpha at 0.05 and we’d like to have approximately 80% power to detect a mean change of 5 mmHg Then we would need about 16 participants

When there are two groups So far, the example has used a single group of study participants Usually we want to compare two groups: a control group that receives “standard of care” or placebo, and an experimental group that receives a new intervention This is how most randomized controlled trials are set up In this case, delta is the difference between the means of the two groups. For simplicity, assume that the variance is the same in the two groups.

An approximate sample size formula for the case of two groups A similar approximate formula applies, again assuming alpha=0.05 and power=80%: N per group = 16 sigma 2 / delta 2 (approximately) Careful! This is the required sample size per group. Also, note that the constant is double what is was for the case of a single group. So the total sample size is 4 times as large.

Example Suppose we want to compare patients randomized to placebo with patients randomized to a new intervention Suppose the standard deviation is anticipated to again be 7 mmHg (so the variance is 49) Suppose we fix alpha at 0.05 and we’d like to have approximately 80% power to detect a change of 5 mmHg Then we would need about 32 participants per group, for a total of about 64 participants

Summary increases with variance decreases with size of effect to detect decreases with probability of type-I error, alpha decreases with probability of type-II error, beta Required sample size …

Sample size determination has many other aspects Different types of outcomes: dichotomous (e.g. mortality), time-to-event (e.g. survival time), etc. Different designs: observational studies (e.g. case-control), surveys, prevalence studies Practical considerations: e.g. costs, feasibility of recruitment

Questions?

α = Probability of type-I error (Rejecting the null hypothesis when it is in fact true.) Power = 1 – β (Rejecting the null hypothesis when it is in fact false.) Review: A comedy of errors … Probability of a false conviction Probability of a true conviction