COSBBI 2013-07-10 DBMI Roger Day Personalized medicine Clinical trials p. 1.

Slides:



Advertisements
Similar presentations
Chapter 7 Hypothesis Testing
Advertisements

Breakout Session 4: Personalized Medicine and Subgroup Selection Christopher Jennison, University of Bath Robert A. Beckman, Daiichi Sankyo Pharmaceutical.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
MAT 1000 Mathematics in Today's World. Last Time 1.What does a sample tell us about the population? 2.Practical problems in sample surveys.
Bayesian Adaptive Methods
Hypothesis Testing making decisions using sample data.
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Chapter 10: Hypothesis Testing
Analysis of frequency counts with Chi square
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
T-tests Computing a t-test  the t statistic  the t distribution Measures of Effect Size  Confidence Intervals  Cohen’s d.
Chapter 11: Sequential Clinical Trials Descriptive Exploratory Experimental Describe Find Cause Populations Relationships and Effect Sequential Clinical.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
BS704 Class 7 Hypothesis Testing Procedures
Statistics for Health Care
Sample Size Determination
Cancer Trials. Reading instructions 6.1: Introduction 6.2: General Considerations 6.3: Single stage phase I designs 6.4: Two stage phase I designs 6.5:
Sample Size Determination Ziad Taib March 7, 2014.
Power and Non-Inferiority Richard L. Amdur, Ph.D. Chief, Biostatistics & Data Management Core, DC VAMC Assistant Professor, Depts. of Psychiatry & Surgery.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Multiple Choice Questions for discussion
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Psy B07 Chapter 8Slide 1 POWER. Psy B07 Chapter 8Slide 2 Chapter 4 flashback  Type I error is the probability of rejecting the null hypothesis when it.
Clinical Trials. What is a clinical trial? Clinical trials are research studies involving people Used to find better ways to prevent, detect, and treat.
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
Clinical Trials 2015 Practical Session 1. Q1: List three parameters (quantities) necessary for the determination of sample size (n) for a Phase III clinical.
BIOE 301 Lecture Seventeen. Guest Speaker Jay Brollier World Camp Malawi.
CI - 1 Cure Rate Models and Adjuvant Trial Design for ECOG Melanoma Studies in the Past, Present, and Future Joseph Ibrahim, PhD Harvard School of Public.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Background to Adaptive Design Nigel Stallard Professor of Medical Statistics Director of Health Sciences Research Institute Warwick Medical School
Statistics for Health Care Biostatistics. Phases of a Full Clinical Trial Phase I – the trial takes place after the development of a therapy and is designed.
Study design P.Olliaro Nov04. Study designs: observational vs. experimental studies What happened?  Case-control study What’s happening?  Cross-sectional.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Chapter 21: More About Tests “The wise man proportions his belief to the evidence.” -David Hume 1748.
01/20151 EPI 5344: Survival Analysis in Epidemiology Interpretation of Models March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
Sample Size And Power Warren Browner and Stephen Hulley  The ingredients for sample size planning, and how to design them  An example, with strategies.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
Cancer Trials. Reading instructions 6.1: Introduction 6.2: General Considerations - read 6.3: Single stage phase I designs - read 6.4: Two stage phase.
Study Designs for Clinical and Epidemiological Research Carla J. Alvarado, MS, CIC University of Wisconsin-Madison (608)
Is the Continual Reassessment Method Superior to the Standard “3+3” dose escalation scheme? Alexia Iasonos 1 Elyn R. Riedel 1, David.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
BIOE 301 Lecture Seventeen. Progression of Heart Disease High Blood Pressure High Cholesterol Levels Atherosclerosis Ischemia Heart Attack Heart Failure.
Sampling and Probability Chapter 5. Sampling & Elections >Problems with predicting elections: Sample sizes are too small Samples are biased (also tied.
1 Statistics in Research & Things to Consider for Your Proposal May 2, 2007.
Lecture & tutorial material prepared by: Dr. Shaffi Shaikh Tutorial presented by: Dr. Rufaidah Dabbagh Dr. Nurah Al-Amro.
Math 4030 – 9a Introduction to Hypothesis Testing
Bayesian Statistics & Innovative Trial Design April 3, 2006 Jane Perlmutter
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Compliance Original Study Design Randomised Surgical care Medical care.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
CV-1 Trial 709 The ISEL Study (IRESSA ® Survival Evaluation in Lung Cancer) Summary of Data as of December 16, 2004 Kevin Carroll, MSc Summary of Data.
European Patients’ Academy on Therapeutic Innovation Ethical and practical challenges of organising clinical trials in small populations.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Sample Size and Power Considerations.
Section 9.1 First Day The idea of a significance test What is a p-value?
C HAPTER 2  Hypothesis Testing -Test for one means - Test for two means -Test for one and two proportions.
© 2010 Jones and Bartlett Publishers, LLC. Chapter 12 Clinical Epidemiology.
Statistical Significance or Hypothesis Testing. Significance testing Learning objectives of this lecture are to Understand Hypothesis: definition & types.
Critical Appraisal Course for Emergency Medicine Trainees Module 2 Statistics.
Statistical issues: Designing your trial for success ELIZABETH GARRETT-MAYER, PHD PROFESSOR OF BIOSTATISTICS HOLLINGS CANCER CENTER, MEDICAL UNIVERSITY.
Bayesian-based decision making in early oncology clinical trials
The Importance of Adequately Powered Studies
Dose-finding designs incorporating toxicity data from multiple treatment cycles and continuous efficacy outcome Sumithra J. Mandrekar Mayo Clinic Invited.
Aiying Chen, Scott Patterson, Fabrice Bailleux and Ehab Bassily
Statistical Methods for Biotechnology Products II
Statistics for Clinical Trials in Cancer Research
Presentation transcript:

COSBBI DBMI Roger Day Personalized medicine Clinical trials p. 1

Individuals and the group: the challenge of “personalized medicine” p. 2 Why do we need statistics in medicine? Because people are individuals (not because they’re all alike). It’s all about VARIABILITY On the other hand… “I take comfort in thinking of myself as a statistic” ---Nan Laird, Former Chair, Dep’t of Biostats, Harvard School of Public Health … What a medicine does to me teaches something about what it does to you.

The Lump/Split Dilemma A new treatment is given to 100 patients. Of them, only 8 respond. But there is a subgroup of 5 in which 3 patients respond, yielding a response rate of 60%! Should the treatment be recommended for people in the subgroup? (“Personalized”?) p. 3

Lump? Split? Dr. Lump: “Of course hair color has nothing to do with it. Response rate = 8/100. Don’t treat the Dark Hair people!” Dr. Split: “The latest research says personalize. Response rate = 3/5=60%. Treat the Dark Hair people!” p. 4

Some Bayesian analyses p. 5

Dr. I. DontKnow: Let the data tell us˜ p. 6

p. 7 X =Observation (Dr. Split) Green = prior belief = prior mean Red = posterior belief (prior + data) M =posterior mean

Goldilocks and the three investigators Dr. Lump: low variance, high bias Dr. Split: high variance, low bias Dr. I.Dontknow: JUST…right! –Empirical Bayes, hierarchical model The challenge for the future of medicine: Let DATA + PRIOR UNDERSTANDING dictate how much to “personalize”. p. 8

p. 9

CLINICAL TRIALS! p. 10

p. 11 Drug development process (idealized) Molecular In vitro In vivo Epidemiologic “File cabinet” { IDEA Phase IOBJ: “Safety” ENDPT: Toxicity ) Phase IIOBJ: “Efficacy” ENDPT: Clinical response Phase IIIOBJ: “Effectiveness” ENDPT: Survival Phase IVOBJ: “Outcomes” ENDPT: Pain, cost, …

p. 12 Phase I Clinical Trials: Objectives OBJECTIVES: Identification of toxicities to watch out for. Determination of a “Recommended Phase II Dose”. DEFINITION: Maximum tolerated dose (MTD): “The highest level of a dose that can be tolerated” (“Tolerated”= an “acceptable” risk of toxicity) DEFINITION: Dose-limiting toxicity (DLT): (1) “an adverse event that is counted against dose escalation” (2) “a type of adverse event associated with the drug being tested”

p. 13 Phase I Clinical Trials: Endpoints Severity grades: 1Mild 2Moderate DLT3Severe DLT 4 Life-threatening DLT 5Fatal The U.S. National Cancer Institute: Common Toxicity Criteria 1982; CTC Version ; Common Terminology Criteria for Adverse Events v3.0 (CTCAE) (an informatics data conversion nightmare)

p. 14 Phase I: Standard Design (3+3) 3 patients per dose tier If #DLT =... 0/3, then Escalate the dose 1/3, then add 3 pts if 1/6, then Escalate if any more, then Stop 2/3,then Stop (many better designs, but still the most popular)

p. 15 What’s a “good” toxicity rate? proportion of patients who will get adverse events “too toxic, no matter what!” “just right” “not toxic enough- probably won’t work.”

Development of a Phase I Dose from a Dose-Ranging Study p. 16

Let’s play Phase I Trial! You are the Patient. p. 17

p. 18

p. 19

p. 20 How much information do we really have about the “maximum tolerated dose” If 0 out of 3 DLT’s, –estimated risk of DLT is “zero” –95% confidence interval is 0% to 63%. If 1 out of 6 DLT’s, –estimated risk of DLT is 17% –95% confidence interval is 4% to 64% not much information!!!

p. 21 Main purpose: Determine whether there is sufficient evidence of efficacy Secondary: Determine (or confirm) safety with greater confidence. (Is the “MTD” or “RP2D” really “tolerable”?) Usually just one regimen. Often just one drug or other treatment at a time. Phase II: Objectives

p. 22 Efficacy vs Effectiveness: Textbook Definitions Efficacy –“true biological effect of a treatment” Effectiveness –“the effect of a treatment when widely used in practice”

p. 23 If early results say the treatment is not very good, isn’t it UNETHICAL to continue to accrue patients? ETHICS |||||||| DESIGN ||||||| BIOSTATISTICS Solution: An early stopping rule a special case of ADAPTIVE DESIGN. Phase II: Ethical Issue

p. 24 Some Phase II study designs “Simon design” - Early stopping for poor response “Bryant-Day design - Early stopping for poor response or excess toxicity

p. 25 This might be a “Type II error”. This might be a “Type I error”. Simon two-stage design, Control Clin Trials. 1989

p. 26 Study Design Jargon “Type I Error” (or “alpha”) The probability that you say “whoopee” when you shouldn’t. “Type II Error” (or “beta”) The probability that you say “poopie” … when you shouldn’t. “Statistical Power” The probability that you say “whoopee”... when you should. So.... Power = 1 – beta = 1 – Type II Error

p. 27 Response rates: What’s good enough? What’s too bad? proportion of patients who will respond we love the new treatment! indifferent worse than what we use now! p0p1

p. 28 A class of decision rules: Reject drug if # responses is... r1 out of n1 (first stage) (SHORT WALL) or... r out of n (full trial),(TALL WALL) A set of criteria: alpha = Type I error < 0.10, p0 = 0.30 beta = Type II error < p1 = 0.50 Minimize the average sample size if p0 is true: E(N | p0) Optimal design : r1/n1 r/ n E(N | p0) 7/22 17/

p. 29 This might be a “Type II error”. This might be a “Type I error”. =7 =17 =46 =22

Let’s play Phase II Trial! You are the Principal Investigator. p. 30

p. 31 Objective: Comparative, confirmatory analysis Endpoint: Clinically directly meaningful & important (Survival; Time to Progression; Symptom relief) Treatment assignment: Randomized Control: “Standard of care” usually Early stopping:Evidence for non-equivalence. - New treatment clearly better. - New treatment clearly worse. Phase III studies

p. 32 What is a “statistic”? “Test statistic” measures “surprise if the null hypothesis is true”. P-value = If P-value, “whoopee”. (“reject the null”) If not, then “poopie”. (“accept the null”)

p. 33 Strength of evidence: the “P-value” “P=0.01”: “In a long series of identical trials,if the null hypothesis is true”, such an unusual result as OUR study would only occur once in a hundred trials (0.01 of the time).” ordering of what is possible outcomes “unusual” { }  { }

p. 34 Survival Curves for Arms A and B

Let’s play Phase III Trial! You are a figment in statistician’s computer ( a SIMULATION). Except ONE of you is real. p. 35

p. 36 Historical controls can be misleading… This is one reason we randomize! Arm A = 5-FU and LV (1983 Trial) (Advanced Colorectal Cancer) Arm B = 5-FU and LV (1986 Trial) (Advanced Colorectal Cancer) Arm A is the better regimen!

Stuff we “knew” that ain’t so… (animal studies, observational “big data”, …) p. 37 Womens Health Initiative The unopposed estrogen trial was halted in February 2004, after an average follow- up period of 6.8 years, on the basis that unopposed estrogen does not appear to affect the risk of heart disease, the primary outcome, which was in contrast to the findings of previous observational studies. On the other hand, there were indications for an increased risk of stroke. The Effect of Vitamin E and Beta Carotene on the Incidence of Lung Cancer and Other Cancers in Male Smokers Unexpectedly, we observed a higher incidence of lung cancer among the men who received beta carotene than among those who did not. Randomized trials, hooray!!

p. 38 Advanced Colorectal Cancer SYMPTOM STATUS Maybe this explains why Arm A did better. Let’s do some statistical magic!

p. 39 Advanced Colorectal Cancer Cox Proportional Hazards Model “ Controlling for risk factors”

p. 40 Advanced Colorectal Cancer Cox Proportional Hazards Model

p. 41 DUKES’ B Confounding

p. 42 DUKES’ C Confounding

p. 43 ALL PATIENTS Confounding

p. 44 Confounding – “Simpson’s paradox”

Afterthoughts p. 45 New ideas in clinical trial design are growing rapidly! Clinical trials should become more ethical. “Personalized medicine”– a crisis looming. Explosion of “features”. People “like me” – fewer and fewer. Sample sizes smaller, but effect sizes bigger (we hope). The best discussion of Simpson’s Paradox is in Judea Pearl’s book, Causality, Chapter 6. A FUN READ!