Validation of predictive regression models Ewout W. Steyerberg, PhD Clinical epidemiologist Frank E. Harrell, PhD Biostatistician.

Slides:



Advertisements
Similar presentations
Residuals Residuals are used to investigate the lack of fit of a model to a given subject. For Cox regression, there’s no easy analog to the usual “observed.
Advertisements

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Logistic Regression Psy 524 Ainsworth.
Forecasting Using the Simple Linear Regression Model and Correlation
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Inference for Regression
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Departments of Medicine and Biostatistics
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Confidence Intervals Underlying model: Unknown parameter We know how to calculate point estimates E.g. regression analysis But different data would change.
Prediction Models in Medicine Clinical Decision Support The Road Ahead Chapter 10.
N-way ANOVA. 3-way ANOVA 2 H 0 : The mean respiratory rate is the same for all species H 0 : The mean respiratory rate is the same for all temperatures.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.

Statistics for Managers Using Microsoft® Excel 5th Edition
Multivariate Data Analysis Chapter 4 – Multiple Regression.
1 G Lect 11M Binary outcomes in psychology Can Binary Outcomes Be Studied Using OLS Multiple Regression? Transforming the binary outcome Logistic.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Classification and Prediction: Regression Analysis
Regression and Correlation Methods Judy Zhong Ph.D.
Xitao Fan, Ph.D. Chair Professor & Dean Faculty of Education University of Macau Designing Monte Carlo Simulation Studies.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Simple Linear Regression
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
© Department of Statistics 2012 STATS 330 Lecture 25: Slide 1 Stats 330: Lecture 25.
3/2003 Rev 1 I – slide 1 of 33 Session I Part I Review of Fundamentals Module 2Basic Physics and Mathematics Used in Radiation Protection.
Review of Statistical Models and Linear Regression Concepts STAT E-150 Statistical Methods.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Statistics and Quantitative Analysis U4320 Segment 8 Prof. Sharyn O’Halloran.
EDRN Approaches to Biomarker Validation DMCC Statisticians Fred Hutchinson Cancer Research Center Margaret Pepe Ziding Feng, Mark Thornquist, Yingye Zheng,
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Using Resampling Techniques to Measure the Effectiveness of Providers in Workers’ Compensation Insurance David Speights Senior Research Statistician HNC.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Linear correlation and linear regression + summary of tests
Multivariate Data Analysis Chapter 5 – Discrimination Analysis and Logistic Regression.
Why Model? Make predictions or forecasts where we don’t have data.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Ilona Verburg Nicolette de Keizer Niels Peek
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Evaluating Predictive Models Niels Peek Department of Medical Informatics Academic Medical Center University of Amsterdam.
BUILDING THE REGRESSION MODEL Data preparation Variable reduction Model Selection Model validation Procedures for variable reduction 1 Building the Regression.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
1 Statistics & R, TiP, 2011/12 Neural Networks  Technique for discrimination & regression problems  More mathematical theoretical foundation  Works.
Case Selection and Resampling Lucila Ohno-Machado HST951.
Instructor Resource Chapter 15 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Assumptions of Multiple Regression 1. Form of Relationship: –linear vs nonlinear –Main effects vs interaction effects 2. All relevant variables present.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Assessing the additional value of diagnostic markers: a comparison of traditional and novel measures Ewout W. Steyerberg Professor of Medical Decision.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Logistic Regression: Regression with a Binary Dependent Variable.
Bootstrap and Model Validation
Simple Linear Regression
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
26134 Business Statistics Week 5 Tutorial
Chapter Six Normal Curves and Sampling Probability Distributions
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
VALIDATION AND UPDATING OF MODELS WITH BIOMARKERS
Simple Linear Regression
Bootstrapping Jackknifing
Regression and Clinical prediction models
Clinical prediction models
Presentation transcript:

Validation of predictive regression models Ewout W. Steyerberg, PhD Clinical epidemiologist Frank E. Harrell, PhD Biostatistician

Personal background  Ewout Steyerberg: Erasmus MC, Rotterdam, the Netherlands  Frank Harrell: Health Evaluation Sciences, Univ of Virginia, Charlottesville, VA, USA “Validation of predictions from regression models is of paramount importance”

Learning objectives: knowledge of  common types of regression models  fundamental assumptions of regression models  performance criteria of predictive models  principles of different types of validation

Performance objectives  To be able to explain why validation is necessary for predictive models  To be able to judge the adequacy of a validation procedure

Predictive models provide quantitative estimates of an outcome, e.g.  Quality of life one year after surgery  Death at 30 days after surgery  Long term survival

Predictive models are often based on regression analysis  y ~ a + sum(b i *x i ) y: outcome variable a: intercept b i : regression coefficient i x i : predictor variable i i in [1,many], usually 2 to 20

3 examples of regression  Quality of life one year after surgery: continuous outcome, linear regression  Death at 30 days after surgery: binary outcome, logistic regression  Long term survival: time-to-outcome, Cox regression

Predictive models make assumptions  Distribution  Linearity of continuous variables  Additivity of effects

Example: a simple logistic regression model  30day mortality ~ a + b 1 *sex + b 2 *age Assumptions:  Distribution of 30day mortality is binomial  Age has a linear effect  The effects of sex and age can be added

Assessing model assumptions  Examine model residuals  Perform specific tests  add nonlinear terms, e.g. age+age 2  add interaction terms, e.g. sex*age

Model assumptions and predictions  Better predictions if assumptions are met  Some violation inherent in empirical data  Evaluate predictions in new data

Evaluation of predictions  Calibration  average of predictions correct?  low and high predictions correct?  Discrimination  distinguish low risk from high risk patients?

Example: predicted probabilities

3 types of validation  Apparent: performance on sample used to develop model  Internal: performance on population underlying the sample  External: performance on related but slightly different population

Apparent validity  Easy to calculate  Results in optimistic performance estimates

Apparent estimates optimistic since same data used for:  Definition of model structure: e.g. selection and coding of variables  Estimation of model parameters: e.g. regression coefficients  Evaluation of model performance: e.g. calibration and discrimination

Internal validity  More difficult to calculate  Test model in new data, random from underlying population

Why internal validation?  Honest estimate of performance should be obtained, at least for a population similar to the development sample  Internal validated performance sets an upper limit to what may be expected in other settings (external validity)

External validity  Moderately easy to calculate when new data are available  Test model in new data, different from development population

Why external validation?  Various factors may differ from development population, including  different selection of patients  different definitions of variables  different diagnostic or therapeutic procedures

Internal validation techniques  Split-sample:  development / validation  Cross-validation:  alternating development / validation  extreme: n-1 develop / 1 validate (‘jack-knife’)  Bootstrap

Bootstrap is the preferred internal validation technique  bootstrap sample for model development: n patients drawn with replacement  original sample for validation: n patients  difference: optimism  efficiency: development and validation on n patients

Example: bootstrap results for logistic regression model  30-day mortality ~ a + b 1 *sex + b 2 *age Apparent area under the ROC curve: 0.77 Mean area of 200 bootstrap samples:0.772 Mean area of 200 tests in original: Optimism in apparent performance: 0.01 Optimism-corrected area: 0.76

External validation techniques  Temporal validation: same investigators, validate in recent years  Spatial validation (other place): same investigators, cross-validate in centers  Fully external: other investigators, other centers

Example: external validity of logistic regression model  30-day mortality ~ a + b 1 *sex + b 2 *age Apparent area in 785 patients: 0.77 Tested in 20,318 other patients: 0.74 Tested by other investigators: ?

Example: external validation

Summary  Apparent validity gives an optimistic estimate of model performance  Internal validity may be estimated by bootstrapping  External validity should be determined in other populations

Key references  tutorial and book on multivariable models (Harrell 1996, Stat Med 15:361-87; Harrell: regression modeling strategies, Springer 2001)  empirical evaluations of strategies (Steyerberg 2000: Stat Med19: )  internal validation (Steyerberg 2001:JCE 54: )  external validation (Justice 1999: Ann Intern Med 130:515-24; Altman 2000: Stat Med 19: )

Links  Interactive text book on predictive modeling  Harrell’s Regression modeling strategies