INTRODUCTION TO CLINICAL RESEARCH Introduction to Statistical Inference Karen Bandeen-Roche, Ph.D. July 12, 2010.

Slides:



Advertisements
Similar presentations
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Advertisements

Inference Sampling distributions Hypothesis testing.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Probability Population:
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
INTRODUCTION TO CLINICAL RESEARCH Scientific Concepts for Clinical Research Karen Bandeen-Roche, Ph.D. July 12, 2010.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview Definition Hypothesis
Confidence Intervals and Hypothesis Testing - II
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
6 - 1 © 2000 Prentice-Hall, Inc. A First Course in Business Statistics Inferences Based on a Single Sample: Tests of Hypothesis Chapter 6.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 21: More About Tests “The wise man proportions his belief to the evidence.” -David Hume 1748.
Essential Statistics Chapter 131 Introduction to Inference.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
1 The Scientist Game Chris Slaughter, DrPH (courtesy of Scott Emerson) Dept of Biostatistics Vanderbilt University © 2002, 2003, 2006, 2008 Scott S. Emerson,
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Hypotheses tests for means
Testing of Hypothesis Fundamentals of Hypothesis.
CHAPTER 17: Tests of Significance: The Basics
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9 Testing a Claim
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
1 Probability and Statistics Confidence Intervals.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
+ Testing a Claim Significance Tests: The Basics.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Test of a Population Median. The Population Median (  ) The population median ( , P 50 ) is defined for population T as the value for which the following.
15 Inferential Statistics.
Unit 5: Hypothesis Testing
CHAPTER 9 Testing a Claim
When we free ourselves of desire,
CHAPTER 9 Testing a Claim
Significance Tests: The Basics
Significance Tests: The Basics
CS639: Data Management for Data Science
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

INTRODUCTION TO CLINICAL RESEARCH Introduction to Statistical Inference Karen Bandeen-Roche, Ph.D. July 12, 2010

Acknowledgements Scott Zeger Marie Diener-West ICTR Leadership / Team July 2010JHU Intro to Clinical Research2

July 2010Intro to Clinical Research Introduction to Statistical Inference 1.The game of statistical inference 2.Three key questions – three approaches to inference: likelihood; Bayesian; frequentist 3.Frequentist statistics An overview: Sampling variability Tests of hypotheses – p-values Uncertainty in estimation – confidence intervals 4. Brief introduction to data scales

4 What Is Biostatistics? The science of learning from biomedical data involving appreciable variability or uncertainty

July 2010Intro to Clinical Research Game of Statistical Inference: “Search for Truth” Truth for Population Observed Value for a Representative Sample Probability Statistical inference

July 2010Intro to Clinical Research Statistical Inference An approach for using information about the sample to make statements about the population Which population? The one from which your sample was randomly drawn The one for which your sample is reasonably thought to be representative.

July 2010Intro to Clinical Research Three Questions in Clinical Research What is the evidence in the data? –Likelihood function measures evidence What do we believe in light of the evidence? – Combine prior with likelihood to obtain posterior distribution What should we decide to do in light of the evidence? – frequentist statistics for decision rules – controlling types of errors

July 2010Intro to Clinical Research Example: Risperidone versus Haloperidol for Prevention of Schizophrenic Symptoms Randomized trial comparing risperidone to haloperidol for time- to-relapse in schizophrenic patients Reference: Csernansky, et al NEJM 367 patients followed for up to 800 days Hypothesis: relative to haloperidol, risperidone will prolong the time to relapse of schizophrenic symptoms Focus on binary indicator of relapse or not at 400 days for now

July 2010Intro to Clinical Research Three Questions What is the evidence from the Czernansky data that risperidone prolongs time to relapse relative to haloperidol? What do we believe about this difference in light of this evidence? Should the FDA give J&J an “indication” that risperidone is better than haloperidol at delaying time to relapse?

July 2010Intro to Clinical Research Evidence Measured by the “likelihood function” Is always relative: supporting one hypothesis relative to another Is what science generates Is used to update prior beliefs to address question 2

July 2010Intro to Clinical Research Data = Evidence RispHalTotal Relapse No Relapse Total x2 Table – Relapse within 400 days; from Figure 1 in Csernansky, et al. 2002

July 2010Intro to Clinical Research Observed Relative Rate of Relapse Risperidone: Relapse rate = 41/177 = Haloperidol: Relapse rate = 67/188 = Relative rate = Risp rate/Hal rate = 41/177 / 67/188 = 0.23/0.35 =

July 2010Intro to Clinical Research Likelihood Function for Relative Rate of Relapse Truth: rate of relapse for haloperidol; relative rate for risperidone versus haloperidol Probability model: an equation describing the probability of observing particular rates in one’s sample if their population has certain characteristics Statistical inference: use data above to make statements about true relative rate

July 2010Intro to Clinical Research Likelihood Function is the Probability of the Observed Data as a Function of True Parameters (Rate for halo ; relative rate) p = rate for halo rr = relative rate The likelihood function of the data is:

July 2010Intro to Clinical Research Likelihood as a function of RR Maximum Likelihood Estimate

July 2010Intro to Clinical Research Likelihood as a function of RR Range of consistent values

July 2010Intro to Clinical Research Relapse Data – Half and Twice the Sample Size RispHalTotal Relapse No Relapse Total RispHalTotal Relapse No Relapse Total HalfTwice

July 2010Intro to Clinical Research Likelihood Functions and Sample Size Twice Half

July 2010Intro to Clinical Research Notes on Likelihood Function “Best estimate” of RR is the one that maximizes the (profile) likelihood function – “maximum likelihood estimate” or mle The “spread” of the likelihood function reflects the strength of evidence – more concentrated means stronger evidence The more data, the more concentrated the likelihood function becomes around the true value

July 2010Intro to Clinical Research Summary of Evidence Value for the true, but unknown relative risk that makes the observed data most likely is 0.65, that is 35% fewer relapses for patients on risperidone A range of values for the true relative risk that make the data more than 1/32 times as likely as the best value (0.65) is (0.41 to 1.00). The data are 50 times most likely if the relative risk is 0.65 than if it is 1.0.

July 2010Intro to Clinical Research Bayesian Methods On the ascendance – see them at the FDA in near future Treat unknowns (true relative rate) as random and give probability distributions as measures of belief More in future short-courses

July 2010Intro to Clinical Research Frequentist Inference Big idea: imagine what other data might have occurred under particular hypotheses Example: assume risperidone is identical to haloperidol in rate of relapse –Relative rate = 1.0 Calculate the distribution of sample relative rates under this assumption Compare what actually occurred with what tends to occur when relative rate is 1.0

July 2010Intro to Clinical Research Procedure Play Supreme Ruler: create a world in which relative rate is really 1.0 Use computer to generate lots of data sets with same numbers of persons per group and same average rate of relapse as observed in our data (108/365) ; calculate sample relative rate for each Make a distribution of relative rate estimates over all data sets: “null distribution” for relative rate estimate

July 2010Intro to Clinical Research First Simulated Data Set RispHalTotal Relapse No Relapse Total Relative risk for this data: 56/177 / 52/188 = 1.14

July 2010Intro to Clinical Research Repeat 1000 Times Relative Rates:

July 2010Intro to Clinical Research What Have We Learned? We rarely get observed relative rates as small as 0.65 in a study the size of this one when the truth is that risperidone is not different than haloperidol (true relative rate = 1.0) –Here, as small or smaller than 0.65 ~ 1.8% of samples We therefore “reject” the null hypothesis that the true relative rate is 1.0 in favor of the alternative hypothesis that it is less than 1.0

July 2010Intro to Clinical Research What Have We Learned? When the relative risk=1.0 (null hypothesis is true), the probability of observing a sample relative rate of 0.65 or smaller is When the relative risk=1.0 (null hypothesis is true), the probability of observing a sample relative rate of 0.65 or smaller, or 1/0.65=1.54 or bigger, is Two-sided p-value =0.035

July 2010Intro to Clinical Research p-value The probability of observing a test statistic as or more extreme than occurred in sample under the null hypothesis Probability: How often as we imagine what other data might have occurred in our sample under particular hypotheses

July 2010Intro to Clinical Research p-value The probability of observing a test statistic as or more extreme than occurred in sample under the null hypothesis As or more extreme: as different or more different from the hypothesized value This is “two sided”; if “one-sided”, “probability of … a … statistic larger than…

July 2010Intro to Clinical Research p-value The probability of observing a test statistic as or more extreme than occurred in sample under the null hypothesis Under the null hypothesis: If the sample truly comes from the population we hypothesize

July 2010Intro to Clinical Research p-value: “p” is for: Publish Perish if you don’t publish aPProval of your drug by FDA Promotion to Vice-President for Research

July 2010Intro to Clinical Research Hypothesis Testing Assume a null hypothesis – straw man to knock down (no difference between risp and hal or rr=1) Choose a test statistic (relative risk) Compute the null distribution (here, by simulation; often, by probability model) Calculate p-value Reject the null if p-value less than a selected (“alpha”) level – usually 0.05

July 2010Intro to Clinical Research Repeat 1000 Times Relative Rates:

July 2010Intro to Clinical Research A name for the “distribution” of values that might have occurred: SAMPLING DISTRIBUTION These can be computed “under the null hypothesis” or under any other hypothesis There is a way to approximate it for one’s data— that is, for the population truly underlying one’s sample: “Bootstrapping”

July 2010Intro to Clinical Research Another Frequentist Method: “Bootstrapping” Imagine that the sample is the whole population Repeat the experiment a large number of times to obtain a distribution for the statistic of interest –Draw a bootstrap sample of 177 risp and 188 hal patients from the original sample, now the population –Calculate the relative risk –Repeat 1000s of times Find a central interval that includes 95% of relative risks: “confidence interval”

July 2010Intro to Clinical Research First Simulated Data Set RispHalTotal Relapse No Relapse Total Relative risk for this data: 44/177 / 54/188 = 0.87

July 2010Intro to Clinical Research Repeat 1000 Times Relative Rate:

July 2010Intro to Clinical Research Confidence Interval for True Relative Risk We are 95% confident that the interval: 0.45, 0.89 includes the true, unknown value In frequentist statistics, the sample and hence particular interval is random. The truth is fixed We don’t say: “the probability that the true value is between 0.45 to 0.89 is 0.95.”

July 2010Intro to Clinical Research Main Points Once Again - Inference The game of statistical inference is to use data to make statements about how the world works Three key questions: 1.What is the evidence – likelihood function; 2.What do we believe? – Bayesian methods 3.What do we decide - frequentist methods Frequentist statistics Tests of hypotheses – p-values Confidence intervals via bootstrapping

1/5/2016Session 1: Quant Analysis - Measuring Clinical Effects 40 Scales and Types of Data Scales –Numeric Continuous – no gaps in the possible measurements Otherwise: discrete (e.g. counts) –Categorical Binary, dichotomous – two categories Polytomous – more than two Ordinal – ordered categories (e.g. satisfaction ratings)

1/5/2016Session 1: Quant Analysis - Measuring Clinical Effects 41 Scales and Types of Data A few important categories of data –“Measures” – determined relatively precisely –Constructs – measurable only up to considerable error –Times-to-events – may be “censored”—only known to be later (or earlier) than a certain point : later in course –Variable – label for one aspect of data collected Why this all matters Data scale and type partially determine the correct analysis to use!

Typical Clinical Study with Time to Event Outcome StartEnd EnrollmentEnd Study Calendar time Loss to Follow-up Event